/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: null
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Ant-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 7
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>

<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:03<00:00, 638.02it/s]
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   18
episode_length                     104.944444
returns                            -56.696618
return_std                         121.783482
average_reward                      -0.527179
round_time             0 days 00:00:03.153001
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       953.077799
return_std_test                     12.979083
average_reward_test                  0.953078
round_time_test        0 days 00:00:04.212889
round_time_total       0 days 00:00:06.992674 

=== epoch 1/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 83%|████████▎ | 1662/2000 [00:02<00:00, 630.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 708.96it/s]
episodes                                   18
episode_length                      61.666667
returns                            -36.032525
return_std                          33.242759
average_reward                      -0.587907
round_time             0 days 00:00:04.423759
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       932.837313
return_std_test                     11.640662
average_reward_test                  0.932837
round_time_test        0 days 00:00:03.664539
round_time_total       0 days 00:00:06.691888 

=== epoch 1/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 54%|█████▍    | 1082/2000 [00:01<00:01, 637.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 631.51it/s]
episodes                                   14
episode_length                     125.357143
returns                            -79.916512
return_std                         149.493451
average_reward                      -0.604018
round_time             0 days 00:00:05.203137
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       941.444637
return_std_test                     13.391131
average_reward_test                  0.941445
round_time_test        0 days 00:00:03.666633
round_time_total       0 days 00:00:06.639251 

=== epoch 1/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 68%|██████▊   | 1356/2000 [00:01<00:00, 651.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 669.09it/s]
episodes                                    4
episode_length                           74.5
returns                            -71.206494
return_std                          45.043772
average_reward                      -0.651825
round_time             0 days 00:00:04.584034
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       944.550067
return_std_test                     21.375674
average_reward_test                   0.94455
round_time_test        0 days 00:00:03.512397
round_time_total       0 days 00:00:06.321873 

=== epoch 1/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 59%|█████▉    | 1179/2000 [00:01<00:01, 622.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 627.15it/s]
episodes                                   15
episode_length                      67.666667
returns                            -32.093088
return_std                          20.654029
average_reward                      -0.523399
round_time             0 days 00:00:04.819468
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        914.57099
return_std_test                      0.497558
average_reward_test                  0.914571
round_time_test        0 days 00:00:03.745562
round_time_total       0 days 00:00:06.601083 

=== epoch 1/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/home/anon/20260123-icml-dcac/dcac/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 3/2000 [00:01<20:20,  1.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:32<00:00,  1.62it/s]
starting training
episodes                                    9
episode_length                     193.333333
returns                            -95.674374
return_std                         152.707725
average_reward                      -0.472629
round_time             0 days 00:20:34.167751
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       942.869433
return_std_test                      4.333292
average_reward_test                  0.942869
round_time_test        0 days 00:00:03.394340
round_time_total       0 days 00:20:34.170603
loss_total                         104.043291
loss_critic                        143.180153
loss_actor                          -52.50417
memory_size                         8710.2585 

=== epoch 1/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<21:06,  1.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:25<00:00,  1.56it/s]
episodes                                   10
episode_length                          105.5
returns                            -45.546039
return_std                          53.931228
average_reward                       -0.53294
round_time             0 days 00:21:27.796187
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       834.289965
return_std_test                      1.448407
average_reward_test                   0.83429
round_time_test        0 days 00:00:04.301852
round_time_total       0 days 00:21:27.798538
loss_total                         110.398788
loss_critic                        166.113091
loss_actor                        -112.458437
memory_size                        10485.7785 

=== epoch 1/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:18,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:44<00:00,  1.16it/s]
episodes                                   11
episode_length                     138.454545
returns                            -94.916573
return_std                         185.754706
average_reward                      -0.688611
round_time             0 days 00:28:46.851412
episodes_test                            18.0
episode_length_test                108.777778
returns_test                        33.490729
return_std_test                     31.864975
average_reward_test                  0.321816
round_time_test        0 days 00:00:04.189783
round_time_total       0 days 00:28:46.854149
loss_total                         134.628317
loss_critic                        208.941167
loss_actor                        -162.623091
memory_size                          11966.72 

=== epoch 1/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:37,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:59<00:00,  1.11it/s]
episodes                                    8
episode_length                        166.125
returns                           -122.441272
return_std                         226.415841
average_reward                       -0.72063
round_time             0 days 00:30:02.020370
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       675.920581
return_std_test                      5.856845
average_reward_test                  0.675921
round_time_test        0 days 00:00:03.535839
round_time_total       0 days 00:30:02.023039
loss_total                         150.989407
loss_critic                        237.555522
loss_actor                        -195.275065
memory_size                         13731.025 

=== epoch 1/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:35,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:24<00:00,  1.17it/s]
episodes                                    6
episode_length                      56.833333
returns                            -39.879442
return_std                          23.246978
average_reward                      -0.716316
round_time             0 days 00:28:26.249307
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                      -142.232621
return_std_test                      8.019225
average_reward_test                 -0.142233
round_time_test        0 days 00:00:04.603388
round_time_total       0 days 00:28:26.251948
loss_total                         179.295442
loss_critic                        276.879396
loss_actor                        -211.040384
memory_size                         15504.438 

=== epoch 1/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:10,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:21<00:00,  1.22it/s]
episodes                                    2
episode_length                          514.0
returns                           -385.441922
return_std                         371.120377
average_reward                        -0.7245
round_time             0 days 00:27:24.022562
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        401.94209
return_std_test                     59.461927
average_reward_test                  0.401942
round_time_test        0 days 00:00:04.326464
round_time_total       0 days 00:27:24.025097
loss_total                         180.909409
loss_critic                        279.759429
loss_actor                        -214.490686
memory_size                         17333.526 

=== epoch 1/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<33:39,  1.01s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:24<00:00,  1.13it/s]
episodes                                   11
episode_length                     156.454545
returns                           -119.998725
return_std                         193.616786
average_reward                      -0.757199
round_time             0 days 00:29:26.700419
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       557.938815
return_std_test                     17.305098
average_reward_test                  0.557939
round_time_test        0 days 00:00:03.741430
round_time_total       0 days 00:29:26.703625
loss_total                          205.00194
loss_critic                        308.946364
loss_actor                         -210.77577
memory_size                        19114.7525 

=== epoch 1/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<32:38,  1.02it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:16<00:00,  1.27it/s]
episodes                                   18
episode_length                     102.944444
returns                             -74.44631
return_std                         169.825348
average_reward                      -0.708783
round_time             0 days 00:26:18.981638
episodes_test                             2.0
episode_length_test                     606.5
returns_test                       354.322996
return_std_test                    247.167945
average_reward_test                  0.552198
round_time_test        0 days 00:00:04.870263
round_time_total       0 days 00:26:18.984362
loss_total                         212.897943
loss_critic                        317.208565
loss_actor                         -204.34456
memory_size                        20601.5835 

=== epoch 1/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:29,  1.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:24<00:00,  1.56it/s]
episodes                                   13
episode_length                      87.307692
returns                            -75.357289
return_std                          56.744768
average_reward                       -0.82463
round_time             0 days 00:21:26.195954
episodes_test                             2.0
episode_length_test                     540.0
returns_test                       341.232637
return_std_test                    314.383332
average_reward_test                  0.597369
round_time_test        0 days 00:00:03.321341
round_time_total       0 days 00:21:26.198364
loss_total                         236.675751
loss_critic                        344.784777
loss_actor                        -195.760373
memory_size                         22061.386 

=== epoch 1/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:45,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:42<00:00,  1.54it/s]
episodes                                   14
episode_length                      57.642857
returns                            -43.659142
return_std                          28.289783
average_reward                      -0.710855
round_time             0 days 00:21:43.421968
episodes_test                             3.0
episode_length_test                546.333333
returns_test                       160.294702
return_std_test                     91.405914
average_reward_test                  0.290891
round_time_test        0 days 00:00:03.400885
round_time_total       0 days 00:21:43.424180
loss_total                         253.240307
loss_critic                        362.735518
loss_actor                        -184.740557
memory_size                         23609.364 

=== epoch 1/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:43,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:17<00:00,  1.57it/s]
episodes                                   30
episode_length                      64.666667
returns                            -44.709714
return_std                          43.166699
average_reward                      -0.685745
round_time             0 days 00:21:19.389356
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       538.326052
return_std_test                       70.9176
average_reward_test                  0.538326
round_time_test        0 days 00:00:03.535437
round_time_total       0 days 00:21:19.391647
loss_total                         269.358405
loss_critic                        380.484238
loss_actor                        -175.144951
memory_size                        24995.1435 

=== epoch 1/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:52,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:50<00:00,  1.53it/s]
episodes                                   22
episode_length                      88.318182
returns                            -57.089851
return_std                         137.963805
average_reward                      -0.644756
round_time             0 days 00:21:51.805587
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       599.260321
return_std_test                    173.719709
average_reward_test                   0.59926
round_time_test        0 days 00:00:04.163941
round_time_total       0 days 00:21:51.807514
loss_total                         271.923783
loss_critic                        381.551362
loss_actor                        -166.586557
memory_size                        26307.2975 

=== epoch 1/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:06,  1.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:07<00:00,  1.66it/s]
episodes                                   16
episode_length                       118.5625
returns                            -77.069052
return_std                         156.483703
average_reward                      -0.646066
round_time             0 days 00:20:09.122360
episodes_test                             6.0
episode_length_test                270.166667
returns_test                       169.485813
return_std_test                     242.56571
average_reward_test                  0.602446
round_time_test        0 days 00:00:03.448679
round_time_total       0 days 00:20:09.123877
loss_total                         266.596561
loss_critic                        373.145052
loss_actor                        -159.597426
memory_size                        27679.8945 

=== epoch 1/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:29,  1.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:49<00:00,  1.60it/s]
episodes                                    4
episode_length                         251.25
returns                           -154.630313
return_std                         142.071235
average_reward                      -0.604964
round_time             0 days 00:20:50.650454
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       696.133232
return_std_test                     11.157392
average_reward_test                  0.696133
round_time_test        0 days 00:00:04.414122
round_time_total       0 days 00:20:50.652172
loss_total                         275.856918
loss_critic                        383.192576
loss_actor                        -153.485742
memory_size                         29200.917 

=== epoch 1/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:03,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:27<00:00,  1.63it/s]
episodes                                   16
episode_length                       118.9375
returns                            -75.411637
return_std                         146.124915
average_reward                      -0.633992
round_time             0 days 00:20:28.971595
episodes_test                             4.0
episode_length_test                     347.5
returns_test                       133.641794
return_std_test                    180.820466
average_reward_test                  0.230531
round_time_test        0 days 00:00:04.480680
round_time_total       0 days 00:20:28.973739
loss_total                         269.530442
loss_critic                        373.648094
loss_actor                        -146.940189
memory_size                        30802.2445 

=== epoch 1/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:59,  1.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:51<00:00,  1.60it/s]
episodes                                   10
episode_length                          139.9
returns                           -103.574882
return_std                         197.004845
average_reward                      -0.689495
round_time             0 days 00:20:52.211326
episodes_test                             3.0
episode_length_test                374.333333
returns_test                       170.800808
return_std_test                    223.164008
average_reward_test                  0.602878
round_time_test        0 days 00:00:03.566379
round_time_total       0 days 00:20:52.213720
loss_total                         268.665203
loss_critic                        370.803394
loss_actor                        -139.887586
memory_size                         32447.196 

=== epoch 1/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<19:12,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:29<00:00,  1.63it/s]
episodes                                   13
episode_length                     119.461538
returns                            -77.388912
return_std                         173.846101
average_reward                      -0.655101
round_time             0 days 00:20:30.586868
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       626.698133
return_std_test                     17.695904
average_reward_test                  0.626698
round_time_test        0 days 00:00:04.305424
round_time_total       0 days 00:20:30.588977
loss_total                         267.694632
loss_critic                        367.699345
loss_actor                        -132.324246
memory_size                        34148.1015 

=== epoch 1/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:52,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:35<00:00,  1.54it/s]
episodes                                   16
episode_length                        121.375
returns                              -81.6476
return_std                         147.293616
average_reward                      -0.666139
round_time             0 days 00:21:36.560811
episodes_test                             4.0
episode_length_test                    398.25
returns_test                       216.435766
return_std_test                    283.820179
average_reward_test                  0.571557
round_time_test        0 days 00:00:03.480293
round_time_total       0 days 00:21:36.562853
loss_total                         273.636274
loss_critic                        373.388037
loss_actor                        -125.370809
memory_size                         35693.334 

=== epoch 1/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:31,  1.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:27<00:00,  1.55it/s]
episodes                                    4
episode_length                           84.0
returns                            -51.760915
return_std                          54.499717
average_reward                      -0.632576
round_time             0 days 00:21:28.706680
episodes_test                             3.0
episode_length_test                531.333333
returns_test                       243.922904
return_std_test                    238.556156
average_reward_test                  0.518073
round_time_test        0 days 00:00:03.493600
round_time_total       0 days 00:21:28.708901
loss_total                         277.031193
loss_critic                         376.00774
loss_actor                        -118.875023
memory_size                         37292.895 

=== epoch 1/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:58,  1.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:33<00:00,  1.62it/s]
episodes                                   16
episode_length                         48.625
returns                            -31.336121
return_std                          29.953349
average_reward                      -0.631608
round_time             0 days 00:20:35.008982
episodes_test                             2.0
episode_length_test                     524.0
returns_test                       355.804849
return_std_test                    347.253846
average_reward_test                  0.489381
round_time_test        0 days 00:00:03.372314
round_time_total       0 days 00:20:35.011427
loss_total                          281.59592
loss_critic                        380.361796
loss_actor                        -113.467609
memory_size                         38903.923 

=== epoch 1/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:52,  1.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:54<00:00,  1.67it/s]
episodes                                    6
episode_length                     218.666667
returns                           -133.861759
return_std                         200.787526
average_reward                      -0.616789
round_time             0 days 00:19:56.170886
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       553.725978
return_std_test                      66.49786
average_reward_test                  0.553726
round_time_test        0 days 00:00:03.570242
round_time_total       0 days 00:19:56.172757
loss_total                          284.62339
loss_critic                        382.727266
loss_actor                        -107.792144
memory_size                        40581.4175 

=== epoch 1/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:57,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:54<00:00,  1.67it/s]
episodes                                    9
episode_length                     167.777778
returns                            -85.488895
return_std                         188.330023
average_reward                      -0.512611
round_time             0 days 00:19:55.877331
episodes_test                             4.0
episode_length_test                    387.75
returns_test                       208.286407
return_std_test                    262.685543
average_reward_test                  0.500233
round_time_test        0 days 00:00:03.942233
round_time_total       0 days 00:19:55.879241
loss_total                         282.678012
loss_critic                        379.397643
loss_actor                        -104.200542
memory_size                         42409.189 

=== epoch 1/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:59,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:18<00:00,  1.64it/s]
episodes                                   17
episode_length                     111.647059
returns                            -74.328804
return_std                         123.777933
average_reward                      -0.674067
round_time             0 days 00:20:19.759163
episodes_test                             4.0
episode_length_test                     292.5
returns_test                        189.53904
return_std_test                    306.849116
average_reward_test                  0.673191
round_time_test        0 days 00:00:03.399715
round_time_total       0 days 00:20:19.761096
loss_total                         278.747958
loss_critic                        373.873982
loss_actor                        -101.756165
memory_size                         44034.393 

=== epoch 1/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:00,  1.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:41<00:00,  1.69it/s]
episodes                                   20
episode_length                          96.65
returns                            -57.201548
return_std                         130.231311
average_reward                      -0.575796
round_time             0 days 00:19:42.127543
episodes_test                             7.0
episode_length_test                283.571429
returns_test                       153.249643
return_std_test                    248.890741
average_reward_test                  0.531566
round_time_test        0 days 00:00:04.145923
round_time_total       0 days 00:19:42.128969
loss_total                         288.149998
loss_critic                        385.013831
loss_actor                          -99.30536
memory_size                         45296.515 

=== epoch 1/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<18:47,  1.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:11<00:00,  1.65it/s]
episodes                                   18
episode_length                     103.055556
returns                            -62.751646
return_std                         126.466631
average_reward                      -0.616393
round_time             0 days 00:20:12.827818
episodes_test                             8.0
episode_length_test                     175.5
returns_test                        42.230961
return_std_test                     30.288311
average_reward_test                  0.322842
round_time_test        0 days 00:00:03.969306
round_time_total       0 days 00:20:12.829774
loss_total                         295.334666
loss_critic                        393.253253
loss_actor                         -96.339712
memory_size                         46727.306 

=== epoch 1/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:29<00:00,  1.63it/s]
episodes                                    3
episode_length                     352.666667
returns                            -209.33364
return_std                         283.649765
average_reward                      -0.640792
round_time             0 days 00:20:30.084608
episodes_test                             9.0
episode_length_test                129.666667
returns_test                        37.822248
return_std_test                      42.81712
average_reward_test                  0.432531
round_time_test        0 days 00:00:03.420566
round_time_total       0 days 00:20:30.086039
loss_total                         289.449611
loss_critic                         385.15315
loss_actor                         -93.364572
memory_size                         48466.803 

=== epoch 1/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:03,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:10<00:00,  1.65it/s]
episodes                                   14
episode_length                     127.642857
returns                            -86.900729
return_std                          153.56982
average_reward                      -0.686263
round_time             0 days 00:20:10.842729
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       739.253382
return_std_test                     48.123311
average_reward_test                  0.739253
round_time_test        0 days 00:00:04.145978
round_time_total       0 days 00:20:10.844581
loss_total                         293.744736
loss_critic                        389.989376
loss_actor                         -91.233849
memory_size                        50297.4515 

=== epoch 1/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<18:02,  1.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:23<00:00,  1.63it/s]
episodes                                   15
episode_length                          123.0
returns                            -80.427393
return_std                         121.103865
average_reward                      -0.667195
round_time             0 days 00:20:24.619026
episodes_test                             4.0
episode_length_test                     402.5
returns_test                       228.781075
return_std_test                    313.536273
average_reward_test                  0.555798
round_time_test        0 days 00:00:03.695279
round_time_total       0 days 00:20:24.620490
loss_total                         302.656441
loss_critic                        400.518164
loss_actor                         -88.790478
memory_size                         51739.363 

=== epoch 1/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:50,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:19<00:00,  1.64it/s]
episodes                                   14
episode_length                           64.5
returns                            -43.459988
return_std                          35.516757
average_reward                      -0.622527
round_time             0 days 00:20:19.891961
episodes_test                             5.0
episode_length_test                     347.8
returns_test                       215.345998
return_std_test                    303.800577
average_reward_test                  0.590754
round_time_test        0 days 00:00:03.581563
round_time_total       0 days 00:20:19.893906
loss_total                         305.319552
loss_critic                        403.246675
loss_actor                         -86.388972
memory_size                        53350.3645 

=== epoch 1/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:40,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:39<00:00,  1.54it/s]
episodes                                   15
episode_length                     117.333333
returns                            -69.491768
return_std                         128.204031
average_reward                      -0.598322
round_time             0 days 00:21:40.504659
episodes_test                             2.0
episode_length_test                     568.0
returns_test                       309.861487
return_std_test                    326.967991
average_reward_test                  0.584326
round_time_test        0 days 00:00:04.584838
round_time_total       0 days 00:21:40.506605
loss_total                          309.20068
loss_critic                        407.717713
loss_actor                         -84.867483
memory_size                         54754.291 

=== epoch 1/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:29,  1.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:31<00:00,  1.62it/s]
episodes                                    3
episode_length                     356.333333
returns                           -192.433532
return_std                           266.2047
average_reward                      -0.572013
round_time             0 days 00:20:32.666283
episodes_test                             3.0
episode_length_test                     358.0
returns_test                       197.859827
return_std_test                    275.358681
average_reward_test                  0.632706
round_time_test        0 days 00:00:04.322501
round_time_total       0 days 00:20:32.668193
loss_total                         295.545637
loss_critic                        390.332067
loss_actor                          -83.60011
memory_size                        56565.4285 

=== epoch 1/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:58,  1.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:00<00:00,  1.59it/s]
episodes                                    5
episode_length                          312.2
returns                           -143.487716
return_std                           176.0292
average_reward                      -0.495022
round_time             0 days 00:21:01.150873
episodes_test                             2.0
episode_length_test                     512.0
returns_test                       352.392897
return_std_test                    344.194102
average_reward_test                   0.62534
round_time_test        0 days 00:00:03.499715
round_time_total       0 days 00:21:01.152926
loss_total                         289.922425
loss_critic                        383.286605
loss_actor                         -83.534325
memory_size                        58375.8595 

=== epoch 1/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:20,  1.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:25<00:00,  1.56it/s]
episodes                                   10
episode_length                          141.9
returns                            -83.100943
return_std                         185.515876
average_reward                      -0.597711
round_time             0 days 00:21:26.982772
episodes_test                             4.0
episode_length_test                     279.5
returns_test                       189.407942
return_std_test                     320.76351
average_reward_test                  0.740367
round_time_test        0 days 00:00:04.310002
round_time_total       0 days 00:21:26.984615
loss_total                          282.22904
loss_critic                        373.874964
loss_actor                         -84.354679
memory_size                        60202.2335 

=== epoch 1/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:23,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:56<00:00,  1.59it/s]
episodes                                    3
episode_length                     349.333333
returns                           -192.887418
return_std                         259.821813
average_reward                      -0.556198
round_time             0 days 00:20:57.841345
episodes_test                             4.0
episode_length_test                     321.0
returns_test                       221.950595
return_std_test                    352.211545
average_reward_test                  0.651004
round_time_test        0 days 00:00:03.701894
round_time_total       0 days 00:20:57.843216
loss_total                         283.976512
loss_critic                        376.405678
loss_actor                         -85.740177
memory_size                        61902.8645 

=== epoch 1/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:06,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:13<00:00,  1.57it/s]
episodes                                   11
episode_length                     147.545455
returns                            -91.427723
return_std                         167.866642
average_reward                      -0.644277
round_time             0 days 00:21:14.785882
episodes_test                             4.0
episode_length_test                    300.25
returns_test                       197.452472
return_std_test                    300.140384
average_reward_test                  0.668802
round_time_test        0 days 00:00:03.588427
round_time_total       0 days 00:21:14.787354
loss_total                         283.737643
loss_critic                        376.490327
loss_actor                          -87.27312
memory_size                         63642.611 

=== epoch 1/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:05,  1.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:42<00:00,  1.54it/s]
episodes                                    7
episode_length                      58.142857
returns                            -38.778298
return_std                          28.674526
average_reward                      -0.555134
round_time             0 days 00:21:43.341899
episodes_test                             2.0
episode_length_test                     510.5
returns_test                       444.836181
return_std_test                    448.028947
average_reward_test                  0.791968
round_time_test        0 days 00:00:03.752978
round_time_total       0 days 00:21:43.343806
loss_total                         283.891463
loss_critic                        377.001606
loss_actor                         -88.549137
memory_size                        65382.9795 

=== epoch 1/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:28,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:02<00:00,  1.51it/s]
episodes                                   21
episode_length                       82.52381
returns                             -46.06981
return_std                         115.885995
average_reward                      -0.565476
round_time             0 days 00:22:03.156020
episodes_test                             5.0
episode_length_test                     386.0
returns_test                       275.753839
return_std_test                    357.818597
average_reward_test                  0.699944
round_time_test        0 days 00:00:04.410976
round_time_total       0 days 00:22:03.157570
loss_total                         279.061614
loss_critic                        371.281792
loss_actor                         -89.819126
memory_size                        66908.6695 

=== epoch 1/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:16,  1.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:01<00:00,  1.51it/s]
episodes                                    8
episode_length                        212.625
returns                           -104.933047
return_std                         160.943239
average_reward                      -0.507687
round_time             0 days 00:22:02.487220
episodes_test                             2.0
episode_length_test                     590.5
returns_test                       442.299023
return_std_test                    357.710409
average_reward_test                  0.803142
round_time_test        0 days 00:00:03.621433
round_time_total       0 days 00:22:02.489264
loss_total                         277.897508
loss_critic                         370.34912
loss_actor                         -91.908966
memory_size                        68493.0835 

=== epoch 1/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:49,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:16<00:00,  1.50it/s]
episodes                                   21
episode_length                      53.047619
returns                            -35.253005
return_std                          28.134875
average_reward                      -0.604144
round_time             0 days 00:22:17.782650
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       691.458899
return_std_test                      7.945166
average_reward_test                  0.691459
round_time_test        0 days 00:00:04.104438
round_time_total       0 days 00:22:17.784607
loss_total                         280.491933
loss_critic                        374.100584
loss_actor                         -93.942698
memory_size                          70179.91 

=== epoch 1/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:57,  1.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:45<00:00,  1.53it/s]
episodes                                    1
episode_length                         1000.0
returns                           -558.686049
return_std                                0.0
average_reward                      -0.507885
round_time             0 days 00:21:46.848133
episodes_test                             4.0
episode_length_test                     356.0
returns_test                       200.742584
return_std_test                    292.502952
average_reward_test                  0.619979
round_time_test        0 days 00:00:03.963868
round_time_total       0 days 00:21:46.850027
loss_total                         276.585167
loss_critic                        369.667744
loss_actor                         -95.745169
memory_size                         71788.233 

=== epoch 1/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:51,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:50<00:00,  1.53it/s]
episodes                                   11
episode_length                      42.909091
returns                            -38.009476
return_std                          26.009862
average_reward                      -0.615743
round_time             0 days 00:21:51.938422
episodes_test                             7.0
episode_length_test                246.142857
returns_test                       130.284679
return_std_test                    253.438008
average_reward_test                  0.508006
round_time_test        0 days 00:00:03.737785
round_time_total       0 days 00:21:51.940194
loss_total                         272.665469
loss_critic                        365.338121
loss_actor                         -98.025163
memory_size                         73586.445 

=== epoch 1/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:10,  1.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:05<00:00,  1.51it/s]
episodes                                   15
episode_length                     108.866667
returns                            -60.220097
return_std                         126.703391
average_reward                      -0.549181
round_time             0 days 00:22:06.390277
episodes_test                             4.0
episode_length_test                    415.25
returns_test                       263.735864
return_std_test                    308.763723
average_reward_test                   0.67784
round_time_test        0 days 00:00:04.093236
round_time_total       0 days 00:22:06.392128
loss_total                         274.032606
loss_critic                        367.695216
loss_actor                        -100.617864
memory_size                         75113.357 

=== epoch 1/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:12,  1.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:57<00:00,  1.52it/s]
episodes                                    4
episode_length                           75.5
returns                             -55.22148
return_std                           39.96362
average_reward                      -0.568186
round_time             0 days 00:21:58.865443
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       793.754146
return_std_test                     44.350073
average_reward_test                  0.793754
round_time_test        0 days 00:00:04.036916
round_time_total       0 days 00:21:58.866932
loss_total                         265.053062
loss_critic                        357.173362
loss_actor                        -103.428166
memory_size                         76870.848 

=== epoch 1/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:11,  1.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:51<00:00,  1.52it/s]
episodes                                   12
episode_length                         133.75
returns                            -74.400975
return_std                         145.667165
average_reward                      -0.543407
round_time             0 days 00:21:52.795718
episodes_test                             3.0
episode_length_test                     344.0
returns_test                       215.989223
return_std_test                    309.502306
average_reward_test                  0.745353
round_time_test        0 days 00:00:03.905162
round_time_total       0 days 00:21:52.797651
loss_total                         266.693949
loss_critic                        359.817458
loss_actor                        -105.800113
memory_size                        78650.9395 

=== epoch 1/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:28,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:49<00:00,  1.53it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   10
episode_length                          161.9
returns                           -112.206598
return_std                         182.955346
average_reward                      -0.647587
round_time             0 days 00:21:50.227038
episodes_test                             3.0
episode_length_test                     397.0
returns_test                       271.064637
return_std_test                    387.421479
average_reward_test                  0.757868
round_time_test        0 days 00:00:03.382250
round_time_total       0 days 00:21:50.228781
loss_total                         284.206794
loss_critic                        382.169837
loss_actor                        -107.645405
memory_size                         80198.163 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<15:39,  2.12it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:23<00:00,  1.81it/s]
episodes                                   36
episode_length                           52.0
returns                            -34.436951
return_std                           23.01815
average_reward                      -0.659381
round_time             0 days 00:18:23.264644
episodes_test                             2.0
episode_length_test                     514.0
returns_test                       377.302854
return_std_test                     395.06473
average_reward_test                  0.758031
round_time_test        0 days 00:00:04.032228
round_time_total       0 days 00:18:23.266592
loss_total                         288.724661
loss_critic                        388.299577
loss_actor                        -109.575034
memory_size                         81474.228 

=== epoch 2/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:59,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:35<00:00,  1.79it/s]
episodes                                   14
episode_length                     121.642857
returns                            -70.120173
return_std                         129.310339
average_reward                      -0.513573
round_time             0 days 00:18:36.490570
episodes_test                             3.0
episode_length_test                374.333333
returns_test                       298.982831
return_std_test                    376.267769
average_reward_test                  0.755988
round_time_test        0 days 00:00:04.111303
round_time_total       0 days 00:18:36.492505
loss_total                          289.92796
loss_critic                        390.020777
loss_actor                        -110.443332
memory_size                        82789.5135 

=== epoch 2/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:21,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:44<00:00,  1.78it/s]
episodes                                   12
episode_length                     133.083333
returns                            -76.506244
return_std                          132.76994
average_reward                      -0.582455
round_time             0 days 00:18:45.207081
episodes_test                             6.0
episode_length_test                324.833333
returns_test                       163.520323
return_std_test                    242.121343
average_reward_test                  0.497016
round_time_test        0 days 00:00:04.374524
round_time_total       0 days 00:18:45.208985
loss_total                         289.238521
loss_critic                        389.549004
loss_actor                        -112.003441
memory_size                         84331.149 

=== epoch 2/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:02,  1.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:49<00:00,  1.68it/s]
episodes                                   10
episode_length                           79.3
returns                            -51.933716
return_std                           43.48839
average_reward                      -0.589016
round_time             0 days 00:19:50.602506
episodes_test                             7.0
episode_length_test                251.714286
returns_test                        85.019466
return_std_test                    203.901698
average_reward_test                  0.364192
round_time_test        0 days 00:00:03.386308
round_time_total       0 days 00:19:50.604392
loss_total                         287.762162
loss_critic                        388.055631
loss_actor                        -113.411742
memory_size                          85974.95 

=== epoch 2/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:40,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:03<00:00,  1.75it/s]
episodes                                   19
episode_length                      59.157895
returns                            -39.454273
return_std                          35.576992
average_reward                      -0.590852
round_time             0 days 00:19:04.178359
episodes_test                             9.0
episode_length_test                158.111111
returns_test                        94.201245
return_std_test                    262.069478
average_reward_test                  0.652475
round_time_test        0 days 00:00:03.670254
round_time_total       0 days 00:19:04.180335
loss_total                         296.630156
loss_critic                        399.528494
loss_actor                        -114.963224
memory_size                        87501.1095 

=== epoch 2/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:22,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:27<00:00,  1.71it/s]
episodes                                    9
episode_length                      74.888889
returns                            -48.359922
return_std                          61.294365
average_reward                      -0.554315
round_time             0 days 00:19:28.213833
episodes_test                             3.0
episode_length_test                663.666667
returns_test                        464.71748
return_std_test                    331.049503
average_reward_test                  0.696024
round_time_test        0 days 00:00:03.324225
round_time_total       0 days 00:19:28.215747
loss_total                         301.465953
loss_critic                         405.82258
loss_actor                        -115.960585
memory_size                        89014.4845 

=== epoch 2/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<29:23,  1.13it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:25<00:00,  1.72it/s]
episodes                                   11
episode_length                     143.272727
returns                            -86.506465
return_std                         150.580033
average_reward                      -0.644307
round_time             0 days 00:19:26.380807
episodes_test                             3.0
episode_length_test                419.666667
returns_test                       277.402479
return_std_test                    367.576892
average_reward_test                  0.726445
round_time_test        0 days 00:00:03.341360
round_time_total       0 days 00:19:26.382677
loss_total                         302.054758
loss_critic                        406.642884
loss_actor                        -116.297778
memory_size                         90753.927 

=== epoch 2/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:22,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:52<00:00,  1.68it/s]
episodes                                   11
episode_length                     137.181818
returns                            -75.885025
return_std                         139.737665
average_reward                      -0.534695
round_time             0 days 00:19:53.410463
episodes_test                             4.0
episode_length_test                    347.25
returns_test                        239.39553
return_std_test                    326.873101
average_reward_test                  0.675864
round_time_test        0 days 00:00:04.012545
round_time_total       0 days 00:19:53.412331
loss_total                         314.130138
loss_critic                        421.769444
loss_actor                        -116.427113
memory_size                         92377.693 

=== epoch 2/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<20:41,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:43<00:00,  1.69it/s]
episodes                                   12
episode_length                      51.833333
returns                            -38.872771
return_std                          25.981992
average_reward                       -0.59248
round_time             0 days 00:19:44.664066
episodes_test                             5.0
episode_length_test                     397.6
returns_test                       184.174366
return_std_test                    197.792719
average_reward_test                  0.457425
round_time_test        0 days 00:00:03.851207
round_time_total       0 days 00:19:44.665966
loss_total                         313.973284
loss_critic                        421.767754
loss_actor                        -117.204627
memory_size                        93971.1825 

=== epoch 2/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<16:14,  2.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:22<00:00,  1.72it/s]
episodes                                    6
episode_length                     192.833333
returns                            -99.830618
return_std                          194.52036
average_reward                      -0.515659
round_time             0 days 00:19:23.252345
episodes_test                             3.0
episode_length_test                478.666667
returns_test                       274.006398
return_std_test                    322.220929
average_reward_test                  0.569145
round_time_test        0 days 00:00:04.537393
round_time_total       0 days 00:19:23.254147
loss_total                         297.239761
loss_critic                        401.230308
loss_actor                        -118.722454
memory_size                         95662.434 

=== epoch 2/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:59,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:16<00:00,  1.73it/s]
episodes                                   18
episode_length                      62.055556
returns                            -43.996888
return_std                          50.744139
average_reward                      -0.692543
round_time             0 days 00:19:17.214330
episodes_test                             3.0
episode_length_test                364.666667
returns_test                       265.918259
return_std_test                    367.203304
average_reward_test                  0.743705
round_time_test        0 days 00:00:03.705623
round_time_total       0 days 00:19:17.216392
loss_total                         308.258654
loss_critic                        415.386903
loss_actor                        -120.254372
memory_size                        97264.1945 

=== epoch 2/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:21,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:55<00:00,  1.67it/s]
episodes                                   28
episode_length                      55.464286
returns                            -35.108978
return_std                          30.325216
average_reward                      -0.595091
round_time             0 days 00:19:56.165195
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       699.562332
return_std_test                     21.418658
average_reward_test                  0.699562
round_time_test        0 days 00:00:04.884596
round_time_total       0 days 00:19:56.167018
loss_total                         319.868385
loss_critic                        429.989272
loss_actor                        -120.615198
memory_size                        98705.5315 

=== epoch 2/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:35,  1.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:36<00:00,  1.70it/s]
episodes                                   21
episode_length                      88.142857
returns                            -51.782065
return_std                         105.444654
average_reward                      -0.563652
round_time             0 days 00:19:37.794626
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        708.36855
return_std_test                     31.927818
average_reward_test                  0.708369
round_time_test        0 days 00:00:04.143106
round_time_total       0 days 00:19:37.796530
loss_total                         330.011243
loss_critic                        442.763142
loss_actor                        -120.996382
memory_size                         99742.985 

=== epoch 2/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:23,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:54<00:00,  1.76it/s]
episodes                                    4
episode_length                           52.5
returns                            -26.582653
return_std                          14.069581
average_reward                      -0.555173
round_time             0 days 00:18:55.452994
episodes_test                             4.0
episode_length_test                    335.25
returns_test                       196.699696
return_std_test                    321.454856
average_reward_test                  0.658679
round_time_test        0 days 00:00:04.374979
round_time_total       0 days 00:18:55.454459
loss_total                         311.935084
loss_critic                        420.458319
loss_actor                        -122.157887
memory_size                       101497.1355 

=== epoch 2/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:03,  1.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:52<00:00,  1.68it/s]
episodes                                    8
episode_length                          160.5
returns                            -84.030016
return_std                         157.813524
average_reward                      -0.516977
round_time             0 days 00:19:53.151449
episodes_test                             2.0
episode_length_test                     556.5
returns_test                        248.05312
return_std_test                    282.819066
average_reward_test                  0.565315
round_time_test        0 days 00:00:04.545038
round_time_total       0 days 00:19:53.153520
loss_total                         312.994903
loss_critic                        422.172753
loss_actor                        -123.716531
memory_size                        103217.608 

=== epoch 2/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:11,  1.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:25<00:00,  1.72it/s]
episodes                                    7
episode_length                     190.428571
returns                            -98.543637
return_std                         171.758875
average_reward                      -0.502188
round_time             0 days 00:19:26.907272
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       669.486028
return_std_test                     15.682952
average_reward_test                  0.669486
round_time_test        0 days 00:00:03.375791
round_time_total       0 days 00:19:26.909313
loss_total                         305.544676
loss_critic                        413.246298
loss_actor                         -125.26184
memory_size                         105000.99 

=== epoch 2/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:52,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:10<00:00,  1.57it/s]
episodes                                    9
episode_length                     141.444444
returns                            -82.123363
return_std                         153.746812
average_reward                      -0.554899
round_time             0 days 00:21:11.393183
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       667.550825
return_std_test                     96.835359
average_reward_test                  0.667551
round_time_test        0 days 00:00:03.719785
round_time_total       0 days 00:21:11.396751
loss_total                         296.073005
loss_critic                        402.030246
loss_actor                        -127.755983
memory_size                        106813.591 

=== epoch 2/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:56,  1.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:48<00:00,  1.60it/s]
episodes                                    2
episode_length                          517.5
returns                           -287.997592
return_std                         270.207022
average_reward                      -0.526884
round_time             0 days 00:20:51.717355
episodes_test                             3.0
episode_length_test                     353.0
returns_test                       205.465234
return_std_test                    297.981843
average_reward_test                   0.59723
round_time_test        0 days 00:00:04.593317
round_time_total       0 days 00:20:51.719307
loss_total                         293.215081
loss_critic                        399.015079
loss_actor                        -129.984942
memory_size                       108628.6225 

=== epoch 2/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:58,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:58<00:00,  1.59it/s]
episodes                                   17
episode_length                     114.941176
returns                            -58.235413
return_std                         119.453721
average_reward                       -0.49543
round_time             0 days 00:20:59.513854
episodes_test                             4.0
episode_length_test                     488.5
returns_test                       217.200884
return_std_test                    243.823749
average_reward_test                  0.422516
round_time_test        0 days 00:00:03.870066
round_time_total       0 days 00:20:59.516000
loss_total                         300.811033
loss_critic                         408.95643
loss_actor                        -131.770581
memory_size                       110279.6955 

=== epoch 2/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:01,  1.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:27<00:00,  1.63it/s]
episodes                                   12
episode_length                     158.666667
returns                             -86.22954
return_std                         136.630483
average_reward                        -0.5521
round_time             0 days 00:20:28.827252
episodes_test                             2.0
episode_length_test                     560.5
returns_test                       308.194137
return_std_test                    303.355171
average_reward_test                  0.589426
round_time_test        0 days 00:00:03.733202
round_time_total       0 days 00:20:28.829099
loss_total                         306.221258
loss_critic                        416.162818
loss_actor                         -133.54501
memory_size                       111873.8135 

=== epoch 2/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:27,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:40<00:00,  1.69it/s]
episodes                                    2
episode_length                          524.5
returns                           -288.876914
return_std                         256.031556
average_reward                      -0.516326
round_time             0 days 00:19:41.587011
episodes_test                             5.0
episode_length_test                     254.2
returns_test                       131.745264
return_std_test                     236.12094
average_reward_test                  0.515318
round_time_test        0 days 00:00:03.973911
round_time_total       0 days 00:19:41.589060
loss_total                         300.374489
loss_critic                        409.273582
loss_actor                        -135.221916
memory_size                       113692.7565 

=== epoch 2/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:40,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:42<00:00,  1.61it/s]
episodes                                    6
episode_length                     220.666667
returns                           -117.869051
return_std                         180.977315
average_reward                      -0.546778
round_time             0 days 00:20:43.678089
episodes_test                             3.0
episode_length_test                415.333333
returns_test                         51.75473
return_std_test                     43.539074
average_reward_test                  0.328024
round_time_test        0 days 00:00:04.154461
round_time_total       0 days 00:20:43.680016
loss_total                          298.38575
loss_critic                        407.456283
loss_actor                        -137.896411
memory_size                        115484.478 

=== epoch 2/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:57,  1.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:07<00:00,  1.66it/s]
episodes                                   15
episode_length                          109.2
returns                            -59.349705
return_std                         120.180673
average_reward                      -0.551831
round_time             0 days 00:20:09.030634
episodes_test                             9.0
episode_length_test                189.333333
returns_test                        52.407327
return_std_test                    185.398982
average_reward_test                  0.270147
round_time_test        0 days 00:00:04.642994
round_time_total       0 days 00:20:09.032572
loss_total                         289.945884
loss_critic                        397.533487
loss_actor                        -140.404557
memory_size                        117215.639 

=== epoch 2/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:09,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:30<00:00,  1.63it/s]
episodes                                    3
episode_length                     346.666667
returns                           -162.861665
return_std                         225.913704
average_reward                      -0.536816
round_time             0 days 00:20:31.359574
episodes_test                             3.0
episode_length_test                370.333333
returns_test                       195.072531
return_std_test                    260.374481
average_reward_test                  0.541104
round_time_test        0 days 00:00:04.063672
round_time_total       0 days 00:20:31.361504
loss_total                         288.034295
loss_critic                        395.683253
loss_actor                        -142.561566
memory_size                       118860.9625 

=== epoch 2/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:45,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:50<00:00,  1.60it/s]
episodes                                   11
episode_length                     139.272727
returns                            -71.834682
return_std                         125.582653
average_reward                      -0.522642
round_time             0 days 00:20:51.265542
episodes_test                            11.0
episode_length_test                141.636364
returns_test                        62.463094
return_std_test                    197.977122
average_reward_test                  0.454543
round_time_test        0 days 00:00:03.654017
round_time_total       0 days 00:20:51.267308
loss_total                         284.531693
loss_critic                        391.898716
loss_actor                        -144.936429
memory_size                          120679.9 

=== epoch 2/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:20,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:02<00:00,  1.58it/s]
episodes                                    9
episode_length                     151.666667
returns                            -73.185068
return_std                         141.136632
average_reward                       -0.50212
round_time             0 days 00:21:03.638760
episodes_test                             5.0
episode_length_test                     271.6
returns_test                       154.847269
return_std_test                    305.305167
average_reward_test                  0.620277
round_time_test        0 days 00:00:03.861037
round_time_total       0 days 00:21:03.640621
loss_total                         283.717719
loss_critic                        391.456546
loss_actor                        -147.237613
memory_size                        122427.599 

=== epoch 2/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:10,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:46<00:00,  1.60it/s]
episodes                                   14
episode_length                     138.928571
returns                            -72.649627
return_std                         117.150968
average_reward                      -0.541006
round_time             0 days 00:20:47.880825
episodes_test                             4.0
episode_length_test                     385.0
returns_test                        90.863939
return_std_test                    133.364448
average_reward_test                  0.317514
round_time_test        0 days 00:00:04.365665
round_time_total       0 days 00:20:47.882252
loss_total                         297.472664
loss_critic                        409.100682
loss_actor                        -149.039436
memory_size                        123958.376 

=== epoch 2/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:38,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:26<00:00,  1.55it/s]
episodes                                   15
episode_length                     129.466667
returns                            -72.707868
return_std                         116.120202
average_reward                      -0.563368
round_time             0 days 00:21:27.460160
episodes_test                             3.0
episode_length_test                     397.0
returns_test                        231.42743
return_std_test                     327.18153
average_reward_test                   0.41137
round_time_test        0 days 00:00:03.523312
round_time_total       0 days 00:21:27.461594
loss_total                         299.758908
loss_critic                        412.328465
loss_actor                        -150.519352
memory_size                       125592.6265 

=== epoch 2/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<33:45,  1.01s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:20<00:00,  1.56it/s]
episodes                                    7
episode_length                     185.714286
returns                            -88.561825
return_std                         159.710085
average_reward                      -0.471337
round_time             0 days 00:21:21.910721
episodes_test                             7.0
episode_length_test                     219.0
returns_test                        83.864795
return_std_test                    176.951937
average_reward_test                  0.437374
round_time_test        0 days 00:00:04.760730
round_time_total       0 days 00:21:21.912208
loss_total                         299.654134
loss_critic                        412.727367
loss_actor                        -152.638831
memory_size                        127126.966 

=== epoch 2/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:12,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:46<00:00,  1.60it/s]
episodes                                   16
episode_length                        105.375
returns                            -54.169313
return_std                         124.943858
average_reward                       -0.52165
round_time             0 days 00:20:47.896661
episodes_test                             3.0
episode_length_test                635.333333
returns_test                       259.447435
return_std_test                    240.056322
average_reward_test                  0.400088
round_time_test        0 days 00:00:03.656058
round_time_total       0 days 00:20:47.898108
loss_total                         290.848991
loss_critic                         402.32612
loss_actor                        -155.059552
memory_size                        128933.102 

=== epoch 2/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:02,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:16<00:00,  1.57it/s]
episodes                                    8
episode_length                        196.375
returns                           -104.517375
return_std                         157.092304
average_reward                      -0.541691
round_time             0 days 00:21:17.493319
episodes_test                             2.0
episode_length_test                     508.5
returns_test                       238.203567
return_std_test                    233.379008
average_reward_test                  0.523921
round_time_test        0 days 00:00:03.358568
round_time_total       0 days 00:21:17.495009
loss_total                         297.908489
loss_critic                        411.655619
loss_actor                         -157.08006
memory_size                        130367.872 

=== epoch 2/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:44,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:08<00:00,  1.58it/s]
episodes                                   10
episode_length                          141.3
returns                            -76.850034
return_std                          154.67574
average_reward                      -0.541664
round_time             0 days 00:21:09.440144
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       538.652526
return_std_test                     56.227363
average_reward_test                  0.538653
round_time_test        0 days 00:00:03.646108
round_time_total       0 days 00:21:09.441549
loss_total                         294.991582
loss_critic                        408.567468
loss_actor                        -159.311992
memory_size                       132105.6275 

=== epoch 2/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:22,  1.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:59<00:00,  1.52it/s]
episodes                                   19
episode_length                      98.263158
returns                             -57.55243
return_std                         103.704856
average_reward                      -0.565705
round_time             0 days 00:22:00.048805
episodes_test                             2.0
episode_length_test                     503.5
returns_test                       359.416852
return_std_test                    354.286096
average_reward_test                  0.620931
round_time_test        0 days 00:00:04.681785
round_time_total       0 days 00:22:00.050614
loss_total                         299.359048
loss_critic                        414.431663
loss_actor                         -160.93144
memory_size                       133662.4705 

=== epoch 2/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:04,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:17<00:00,  1.57it/s]
episodes                                    8
episode_length                         169.25
returns                            -77.100129
return_std                         152.190158
average_reward                      -0.456723
round_time             0 days 00:21:18.774855
episodes_test                             4.0
episode_length_test                    360.75
returns_test                       125.749052
return_std_test                    146.197336
average_reward_test                  0.373974
round_time_test        0 days 00:00:03.490228
round_time_total       0 days 00:21:18.776779
loss_total                          306.44165
loss_critic                        423.752248
loss_actor                        -162.800772
memory_size                       135255.8155 

=== epoch 2/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:26,  1.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:44<00:00,  1.53it/s]
episodes                                   17
episode_length                      55.058824
returns                            -34.177256
return_std                           31.19109
average_reward                       -0.56662
round_time             0 days 00:21:45.749797
episodes_test                             4.0
episode_length_test                    391.25
returns_test                       221.514867
return_std_test                    282.779936
average_reward_test                  0.546918
round_time_test        0 days 00:00:04.953300
round_time_total       0 days 00:21:45.751703
loss_total                         309.170452
loss_critic                        427.612402
loss_actor                        -164.597379
memory_size                        136835.894 

=== epoch 2/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:14,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:56<00:00,  1.52it/s]
episodes                                   11
episode_length                      74.818182
returns                            -62.744394
return_std                          88.340266
average_reward                      -0.631105
round_time             0 days 00:21:57.012183
episodes_test                             3.0
episode_length_test                     377.0
returns_test                       207.257972
return_std_test                    257.797545
average_reward_test                  0.618495
round_time_test        0 days 00:00:03.524144
round_time_total       0 days 00:21:57.013964
loss_total                          314.11584
loss_critic                        434.251053
loss_actor                        -166.425039
memory_size                        138386.868 

=== epoch 2/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:43,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:41<00:00,  1.54it/s]
episodes                                    7
episode_length                     200.142857
returns                            -100.92287
return_std                         153.623688
average_reward                      -0.486602
round_time             0 days 00:21:42.295231
episodes_test                             4.0
episode_length_test                    321.75
returns_test                       191.838614
return_std_test                    248.375167
average_reward_test                  0.611398
round_time_test        0 days 00:00:04.048462
round_time_total       0 days 00:21:42.297108
loss_total                         306.575803
loss_critic                        425.173612
loss_actor                        -167.815464
memory_size                        140117.828 

=== epoch 2/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:32,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:15<00:00,  1.57it/s]
episodes                                    4
episode_length                          269.0
returns                           -120.497475
return_std                         177.973279
average_reward                      -0.501359
round_time             0 days 00:21:16.779416
episodes_test                             5.0
episode_length_test                     358.8
returns_test                       197.366986
return_std_test                    280.191614
average_reward_test                  0.546475
round_time_test        0 days 00:00:03.594652
round_time_total       0 days 00:21:16.780865
loss_total                         308.770767
loss_critic                        428.411115
loss_actor                        -169.790653
memory_size                       141938.4985 

=== epoch 2/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:14,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    7
episode_length                     177.428571
returns                            -93.623128
return_std                         160.304132
average_reward                      -0.528416
round_time             0 days 00:16:29.907544
episodes_test                             2.0
episode_length_test                     514.5
returns_test                        396.75853
return_std_test                    377.548009
average_reward_test                  0.710873
round_time_test        0 days 00:00:03.379580
round_time_total       0 days 00:16:29.908882
loss_total                         304.357264
loss_critic                        423.329928
loss_actor                        -171.533419
memory_size                        143747.599 

=== epoch 2/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:52,  2.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:01<00:00,  2.22it/s]
episodes                                    4
episode_length                          283.0
returns                             -132.6712
return_std                         211.691752
average_reward                      -0.475992
round_time             0 days 00:15:02.551380
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       722.556401
return_std_test                    101.882606
average_reward_test                  0.722556
round_time_test        0 days 00:00:03.368342
round_time_total       0 days 00:15:02.552644
loss_total                         300.249025
loss_critic                        418.854984
loss_actor                         -174.17484
memory_size                       145573.9845 

=== epoch 2/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                   16
episode_length                        113.875
returns                            -60.130126
return_std                         112.260024
average_reward                      -0.551814
round_time             0 days 00:14:15.429192
episodes_test                            13.0
episode_length_test                141.846154
returns_test                        75.235141
return_std_test                    181.192848
average_reward_test                  0.496282
round_time_test        0 days 00:00:03.151241
round_time_total       0 days 00:14:15.430388
loss_total                         303.562945
loss_critic                        423.573441
loss_actor                        -176.479066
memory_size                        147221.753 

=== epoch 2/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:22<00:00,  2.49it/s]
episodes                                    3
episode_length                     362.333333
returns                           -176.399359
return_std                         207.781903
average_reward                      -0.438822
round_time             0 days 00:13:23.418040
episodes_test                             5.0
episode_length_test                     244.2
returns_test                       143.410405
return_std_test                    278.074794
average_reward_test                  0.640897
round_time_test        0 days 00:00:03.001617
round_time_total       0 days 00:13:23.419221
loss_total                         304.763241
loss_critic                        425.705347
loss_actor                        -179.005211
memory_size                        148935.061 

=== epoch 2/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:08,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:11<00:00,  2.73it/s]
episodes                                   20
episode_length                           55.6
returns                             -37.15877
return_std                          39.637056
average_reward                      -0.602916
round_time             0 days 00:12:12.204602
episodes_test                             8.0
episode_length_test                     169.0
returns_test                        85.144245
return_std_test                    237.215082
average_reward_test                  0.567901
round_time_test        0 days 00:00:02.831446
round_time_total       0 days 00:12:12.205702
loss_total                         312.347429
loss_critic                        435.726253
loss_actor                        -181.167897
memory_size                       150524.6585 

=== epoch 2/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:45,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                   14
episode_length                     112.928571
returns                            -63.039637
return_std                         121.375709
average_reward                      -0.555097
round_time             0 days 00:11:42.016987
episodes_test                             3.0
episode_length_test                     368.0
returns_test                       263.227354
return_std_test                    351.495647
average_reward_test                  0.544953
round_time_test        0 days 00:00:02.851404
round_time_total       0 days 00:11:42.018106
loss_total                         316.110437
loss_critic                         440.88477
loss_actor                        -182.986924
memory_size                       152067.9405 

=== epoch 2/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:52,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                   17
episode_length                     116.117647
returns                            -61.919328
return_std                         110.549668
average_reward                      -0.541842
round_time             0 days 00:11:22.087815
episodes_test                             4.0
episode_length_test                    303.75
returns_test                       171.906883
return_std_test                    253.301304
average_reward_test                  0.460745
round_time_test        0 days 00:00:02.852263
round_time_total       0 days 00:11:22.088921
loss_total                         320.664962
loss_critic                        446.905687
loss_actor                        -184.297971
memory_size                        153529.518 

=== epoch 2/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:35<00:00,  2.88it/s]
episodes                                    6
episode_length                          201.0
returns                             -88.77101
return_std                         153.877284
average_reward                      -0.463488
round_time             0 days 00:11:36.014017
episodes_test                             3.0
episode_length_test                348.666667
returns_test                       216.830858
return_std_test                    306.490678
average_reward_test                  0.515843
round_time_test        0 days 00:00:02.821392
round_time_total       0 days 00:11:36.015118
loss_total                         321.020923
loss_critic                        447.684586
loss_actor                        -185.633756
memory_size                        155205.859 

=== epoch 2/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:19,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    5
episode_length                          237.4
returns                           -119.190123
return_std                         187.535651
average_reward                      -0.490541
round_time             0 days 00:11:28.557527
episodes_test                             2.0
episode_length_test                     527.0
returns_test                       353.299699
return_std_test                    367.370367
average_reward_test                  0.439097
round_time_test        0 days 00:00:02.888342
round_time_total       0 days 00:11:28.558634
loss_total                         299.802696
loss_critic                        421.728387
loss_actor                        -187.900096
memory_size                       157042.2555 

=== epoch 2/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:40,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    4
episode_length                         314.25
returns                            -140.57731
return_std                         162.892175
average_reward                       -0.50358
round_time             0 days 00:11:28.673376
episodes_test                             3.0
episode_length_test                     361.0
returns_test                         51.91806
return_std_test                     85.945689
average_reward_test                  0.412966
round_time_test        0 days 00:00:02.873989
round_time_total       0 days 00:11:28.674491
loss_total                         300.356713
loss_critic                        423.432115
loss_actor                        -191.944923
memory_size                         158880.09 

=== epoch 2/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:15,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:38<00:00,  2.86it/s]
episodes                                    9
episode_length                     147.888889
returns                            -83.172897
return_std                         148.414254
average_reward                      -0.533925
round_time             0 days 00:11:38.679962
episodes_test                             7.0
episode_length_test                238.142857
returns_test                        72.095698
return_std_test                     141.17677
average_reward_test                  0.282083
round_time_test        0 days 00:00:02.788564
round_time_total       0 days 00:11:38.681069
loss_total                         311.907967
loss_critic                        438.356947
loss_actor                        -193.887986
memory_size                        160727.288 

=== epoch 2/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:53<00:00,  2.80it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   20
episode_length                           95.9
returns                            -47.002473
return_std                         106.309737
average_reward                      -0.500577
round_time             0 days 00:11:53.679501
episodes_test                             2.0
episode_length_test                     611.5
returns_test                       380.132527
return_std_test                    378.129383
average_reward_test                  0.537173
round_time_test        0 days 00:00:02.818381
round_time_total       0 days 00:11:53.680614
loss_total                         316.060398
loss_critic                        444.025684
loss_actor                        -195.800774
memory_size                       162236.4185 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                    3
episode_length                     345.666667
returns                           -170.055906
return_std                         226.488289
average_reward                      -0.473089
round_time             0 days 00:10:30.088049
episodes_test                             6.0
episode_length_test                303.333333
returns_test                         97.86342
return_std_test                    161.964517
average_reward_test                  0.314328
round_time_test        0 days 00:00:02.776212
round_time_total       0 days 00:10:30.089174
loss_total                         305.704426
loss_critic                        431.675011
loss_actor                        -198.177943
memory_size                        163864.652 

=== epoch 3/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    6
episode_length                          191.0
returns                            -96.045718
return_std                         176.858603
average_reward                      -0.487374
round_time             0 days 00:10:35.070135
episodes_test                             3.0
episode_length_test                     376.0
returns_test                        259.05361
return_std_test                    361.449958
average_reward_test                  0.556392
round_time_test        0 days 00:00:02.828936
round_time_total       0 days 00:10:35.071231
loss_total                         293.447915
loss_critic                        417.150019
loss_actor                        -201.360528
memory_size                       165709.4705 

=== epoch 3/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:38,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                   13
episode_length                          135.0
returns                            -81.641886
return_std                         131.515705
average_reward                      -0.601992
round_time             0 days 00:10:35.136840
episodes_test                             7.0
episode_length_test                233.571429
returns_test                       121.945045
return_std_test                    261.418317
average_reward_test                   0.52924
round_time_test        0 days 00:00:02.763234
round_time_total       0 days 00:10:35.137945
loss_total                         297.343834
loss_critic                        422.525273
loss_actor                        -203.381947
memory_size                       167449.0395 

=== epoch 3/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                    4
episode_length                           43.0
returns                            -25.932041
return_std                          28.595375
average_reward                      -0.503722
round_time             0 days 00:10:36.010114
episodes_test                             8.0
episode_length_test                   203.125
returns_test                        66.681664
return_std_test                    163.070166
average_reward_test                  0.403956
round_time_test        0 days 00:00:02.832071
round_time_total       0 days 00:10:36.011235
loss_total                         296.649182
loss_critic                        422.118791
loss_actor                        -205.229286
memory_size                        169180.444 

=== epoch 3/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                    3
episode_length                     371.333333
returns                           -188.754907
return_std                          184.91915
average_reward                      -0.502311
round_time             0 days 00:10:39.423967
episodes_test                             8.0
episode_length_test                     193.5
returns_test                        86.015622
return_std_test                    222.670197
average_reward_test                  0.504407
round_time_test        0 days 00:00:02.782176
round_time_total       0 days 00:10:39.425073
loss_total                         290.342856
loss_critic                        414.857676
loss_actor                        -207.716451
memory_size                       171035.4835 

=== epoch 3/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:50,  3.07it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   14
episode_length                      59.571429
returns                            -30.173699
return_std                          31.197259
average_reward                      -0.495149
round_time             0 days 00:10:36.740581
episodes_test                             7.0
episode_length_test                212.428571
returns_test                       101.541188
return_std_test                    229.516221
average_reward_test                  0.539934
round_time_test        0 days 00:00:02.840208
round_time_total       0 days 00:10:36.741689
loss_total                         297.975277
loss_critic                        424.965982
loss_actor                        -209.987572
memory_size                        172684.344 

=== epoch 3/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:17,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   11
episode_length                     152.272727
returns                            -81.165606
return_std                          132.47487
average_reward                       -0.51906
round_time             0 days 00:10:37.803036
episodes_test                             2.0
episode_length_test                     546.0
returns_test                        352.17266
return_std_test                    342.439721
average_reward_test                  0.451717
round_time_test        0 days 00:00:02.823980
round_time_total       0 days 00:10:37.804128
loss_total                          303.57388
loss_critic                        432.246065
loss_actor                        -211.114888
memory_size                       174321.5545 

=== epoch 3/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                    5
episode_length                           71.0
returns                            -49.020922
return_std                          23.857408
average_reward                      -0.552581
round_time             0 days 00:10:38.644973
episodes_test                             2.0
episode_length_test                     514.0
returns_test                       223.393002
return_std_test                    227.413898
average_reward_test                  0.444954
round_time_test        0 days 00:00:02.818651
round_time_total       0 days 00:10:38.646070
loss_total                         305.690154
loss_critic                        435.185021
loss_actor                        -212.289342
memory_size                       176085.6645 

=== epoch 3/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:42,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   16
episode_length                       118.5625
returns                            -64.618657
return_std                         106.842634
average_reward                       -0.54078
round_time             0 days 00:10:37.946196
episodes_test                             6.0
episode_length_test                211.833333
returns_test                         100.9611
return_std_test                    204.634604
average_reward_test                  0.575271
round_time_test        0 days 00:00:02.833715
round_time_total       0 days 00:10:37.947299
loss_total                         306.511508
loss_critic                        436.204958
loss_actor                        -212.262321
memory_size                       177700.1535 

=== epoch 3/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                    4
episode_length                          296.0
returns                           -151.467701
return_std                         189.185677
average_reward                       -0.49683
round_time             0 days 00:10:39.476563
episodes_test                             3.0
episode_length_test                     368.0
returns_test                       228.541231
return_std_test                     345.26515
average_reward_test                   0.66711
round_time_test        0 days 00:00:02.874076
round_time_total       0 days 00:10:39.477661
loss_total                         305.814508
loss_critic                        435.612346
loss_actor                        -213.376874
memory_size                       179411.4005 

=== epoch 3/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    5
episode_length                          256.4
returns                           -125.441412
return_std                         164.945058
average_reward                      -0.519443
round_time             0 days 00:10:43.692576
episodes_test                            11.0
episode_length_test                164.545455
returns_test                        58.586468
return_std_test                    207.798084
average_reward_test                  0.315839
round_time_test        0 days 00:00:02.812328
round_time_total       0 days 00:10:43.693665
loss_total                         302.363493
loss_critic                        431.495878
loss_actor                        -214.166074
memory_size                       181233.0975 

=== epoch 3/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:45<00:00,  3.10it/s]
episodes                                    9
episode_length                     162.888889
returns                            -99.450297
return_std                         151.655816
average_reward                      -0.550977
round_time             0 days 00:10:46.193488
episodes_test                             7.0
episode_length_test                198.571429
returns_test                        93.914226
return_std_test                    231.897369
average_reward_test                  0.506648
round_time_test        0 days 00:00:02.819749
round_time_total       0 days 00:10:46.194583
loss_total                         311.655567
loss_critic                        443.430883
loss_actor                        -215.445732
memory_size                       183038.5115 

=== epoch 3/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:59,  3.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                   15
episode_length                           55.0
returns                            -36.794249
return_std                          33.447899
average_reward                      -0.544189
round_time             0 days 00:10:43.510608
episodes_test                             5.0
episode_length_test                     268.0
returns_test                        75.174678
return_std_test                    132.122551
average_reward_test                  0.442326
round_time_test        0 days 00:00:02.911169
round_time_total       0 days 00:10:43.511712
loss_total                         309.176042
loss_critic                        440.512043
loss_actor                        -216.167991
memory_size                        184755.512 

=== epoch 3/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    4
episode_length                         295.25
returns                           -149.580579
return_std                         196.538444
average_reward                      -0.507471
round_time             0 days 00:10:37.181914
episodes_test                             4.0
episode_length_test                     296.5
returns_test                       120.281196
return_std_test                    216.813195
average_reward_test                  0.523117
round_time_test        0 days 00:00:02.833540
round_time_total       0 days 00:10:37.183006
loss_total                         309.528115
loss_critic                        441.182763
loss_actor                        -217.090507
memory_size                       186338.4655 

=== epoch 3/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                    5
episode_length                          242.6
returns                           -109.702796
return_std                          158.33283
average_reward                      -0.444641
round_time             0 days 00:10:36.017308
episodes_test                             3.0
episode_length_test                     401.0
returns_test                       230.729861
return_std_test                    342.197148
average_reward_test                  0.596202
round_time_test        0 days 00:00:02.800096
round_time_total       0 days 00:10:36.018421
loss_total                         313.897179
loss_critic                        447.084736
loss_actor                        -218.853075
memory_size                        188121.305 

=== epoch 3/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                   33
episode_length                      49.575758
returns                            -24.236611
return_std                          36.359908
average_reward                       -0.53036
round_time             0 days 00:10:40.929127
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       717.448116
return_std_test                     35.452009
average_reward_test                  0.717448
round_time_test        0 days 00:00:02.835398
round_time_total       0 days 00:10:40.930232
loss_total                         315.068474
loss_critic                        448.918578
loss_actor                         -220.33197
memory_size                        189620.029 

=== epoch 3/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:17,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                   18
episode_length                     104.444444
returns                            -53.969737
return_std                         107.068227
average_reward                      -0.517414
round_time             0 days 00:10:40.788178
episodes_test                             6.0
episode_length_test                283.166667
returns_test                       134.722676
return_std_test                    247.610987
average_reward_test                  0.506258
round_time_test        0 days 00:00:02.898097
round_time_total       0 days 00:10:40.789279
loss_total                         337.774121
loss_critic                        477.153816
loss_actor                         -219.74469
memory_size                        190920.437 

=== epoch 3/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                   13
episode_length                     128.384615
returns                             -69.20522
return_std                         109.818136
average_reward                      -0.522269
round_time             0 days 00:10:43.780771
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       518.825554
return_std_test                     85.066787
average_reward_test                  0.518826
round_time_test        0 days 00:00:02.786147
round_time_total       0 days 00:10:43.781886
loss_total                         344.148998
loss_critic                        484.969621
loss_actor                        -219.133526
memory_size                        192320.175 

=== epoch 3/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                    5
episode_length                          254.8
returns                           -122.594113
return_std                         139.401751
average_reward                       -0.47737
round_time             0 days 00:10:40.191632
episodes_test                             6.0
episode_length_test                311.666667
returns_test                       159.783684
return_std_test                    301.840946
average_reward_test                  0.532628
round_time_test        0 days 00:00:02.757464
round_time_total       0 days 00:10:40.192730
loss_total                         338.494153
loss_critic                        477.866068
loss_actor                        -218.993537
memory_size                        194086.241 

=== epoch 3/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                   17
episode_length                     110.705882
returns                            -72.703022
return_std                          103.26981
average_reward                      -0.646526
round_time             0 days 00:10:43.570265
episodes_test                             3.0
episode_length_test                     357.0
returns_test                       187.769354
return_std_test                    268.676446
average_reward_test                  0.656369
round_time_test        0 days 00:00:02.787087
round_time_total       0 days 00:10:43.571366
loss_total                         350.867148
loss_critic                        493.308469
loss_actor                         -218.89817
memory_size                        195681.478 

=== epoch 3/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:43,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                   13
episode_length                           77.0
returns                            -58.667178
return_std                          54.255139
average_reward                      -0.614107
round_time             0 days 00:10:42.587196
episodes_test                             6.0
episode_length_test                     195.5
returns_test                       107.317615
return_std_test                    198.124317
average_reward_test                  0.474271
round_time_test        0 days 00:00:02.803706
round_time_total       0 days 00:10:42.588363
loss_total                         362.818819
loss_critic                        508.260007
loss_actor                        -218.945964
memory_size                        197092.704 

=== epoch 3/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    9
episode_length                     166.444444
returns                             -90.87764
return_std                         147.779161
average_reward                      -0.552347
round_time             0 days 00:11:01.402537
episodes_test                             4.0
episode_length_test                     275.5
returns_test                       162.462062
return_std_test                    311.346275
average_reward_test                  0.598337
round_time_test        0 days 00:00:02.859266
round_time_total       0 days 00:11:01.403642
loss_total                         364.867743
loss_critic                        510.791965
loss_actor                        -218.829179
memory_size                        198773.611 

=== epoch 3/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:11,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   13
episode_length                     112.615385
returns                            -58.839075
return_std                         128.920548
average_reward                      -0.525637
round_time             0 days 00:11:00.836031
episodes_test                             6.0
episode_length_test                     207.0
returns_test                       107.756391
return_std_test                    251.564726
average_reward_test                  0.559553
round_time_test        0 days 00:00:02.842966
round_time_total       0 days 00:11:00.837148
loss_total                         359.184545
loss_critic                        503.808408
loss_actor                         -219.31094
memory_size                       200563.6625 

=== epoch 3/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:47,  1.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                    2
episode_length                           50.0
returns                             -47.85324
return_std                          18.789684
average_reward                      -0.474597
round_time             0 days 00:11:04.956712
episodes_test                             4.0
episode_length_test                    429.75
returns_test                       100.797268
return_std_test                     106.23581
average_reward_test                  0.276779
round_time_test        0 days 00:00:02.753223
round_time_total       0 days 00:11:04.957828
loss_total                         359.846934
loss_critic                        504.788198
loss_actor                        -219.918157
memory_size                        202294.794 

=== epoch 3/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   15
episode_length                          122.2
returns                             -68.50117
return_std                         112.739702
average_reward                      -0.543993
round_time             0 days 00:11:02.467615
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       688.002644
return_std_test                      9.890653
average_reward_test                  0.688003
round_time_test        0 days 00:00:02.897799
round_time_total       0 days 00:11:02.468717
loss_total                         363.244576
loss_critic                         509.21125
loss_actor                        -220.622154
memory_size                       203912.2445 

=== epoch 3/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:26,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                    3
episode_length                          359.0
returns                           -183.603164
return_std                         230.557144
average_reward                      -0.472055
round_time             0 days 00:11:03.468388
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       609.706812
return_std_test                     96.267612
average_reward_test                  0.609707
round_time_test        0 days 00:00:02.829043
round_time_total       0 days 00:11:03.469507
loss_total                         358.268789
loss_critic                        503.201144
loss_actor                        -221.460661
memory_size                        205719.345 

=== epoch 3/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                   10
episode_length                          144.8
returns                            -66.830609
return_std                         129.012671
average_reward                      -0.477225
round_time             0 days 00:11:22.404935
episodes_test                             8.0
episode_length_test                   211.125
returns_test                         92.74637
return_std_test                    258.576289
average_reward_test                  0.463151
round_time_test        0 days 00:00:02.801739
round_time_total       0 days 00:11:22.406043
loss_total                         359.541512
loss_critic                        505.144546
loss_actor                        -222.870658
memory_size                       207402.6905 

=== epoch 3/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                    6
episode_length                           92.0
returns                            -50.971972
return_std                          40.612823
average_reward                      -0.489483
round_time             0 days 00:11:30.136982
episodes_test                             5.0
episode_length_test                     239.2
returns_test                        148.24517
return_std_test                    296.522563
average_reward_test                  0.439497
round_time_test        0 days 00:00:02.777992
round_time_total       0 days 00:11:30.138083
loss_total                         364.664946
loss_critic                        511.447934
loss_actor                        -222.467041
memory_size                        209192.122 

=== epoch 3/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:53,  1.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                   12
episode_length                     133.666667
returns                            -64.289764
return_std                         129.460004
average_reward                      -0.471935
round_time             0 days 00:11:19.200762
episodes_test                             2.0
episode_length_test                     506.5
returns_test                      -251.785342
return_std_test                    249.295031
average_reward_test                 -0.511825
round_time_test        0 days 00:00:02.891336
round_time_total       0 days 00:11:19.201866
loss_total                          363.72392
loss_critic                        510.777112
loss_actor                        -224.488884
memory_size                       210935.2085 

=== epoch 3/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                    5
episode_length                          233.0
returns                           -118.257945
return_std                         170.660464
average_reward                      -0.470885
round_time             0 days 00:11:04.724023
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       714.288707
return_std_test                      4.842086
average_reward_test                  0.714289
round_time_test        0 days 00:00:02.836297
round_time_total       0 days 00:11:04.725118
loss_total                         358.005987
loss_critic                          503.7233
loss_actor                        -224.863295
memory_size                       212667.4045 

=== epoch 3/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    1
episode_length                         1000.0
returns                           -449.597534
return_std                                0.0
average_reward                      -0.398653
round_time             0 days 00:11:00.555068
episodes_test                             3.0
episode_length_test                358.666667
returns_test                       -49.504841
return_std_test                     64.967594
average_reward_test                  0.289772
round_time_test        0 days 00:00:02.817227
round_time_total       0 days 00:11:00.556167
loss_total                          362.47053
loss_critic                        509.394592
loss_actor                        -225.225757
memory_size                       214514.8565 

=== epoch 3/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:37,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                    7
episode_length                     180.428571
returns                             -71.78646
return_std                         143.752913
average_reward                      -0.424917
round_time             0 days 00:11:03.282684
episodes_test                             8.0
episode_length_test                   127.125
returns_test                        -0.129639
return_std_test                     33.532687
average_reward_test                   0.27526
round_time_test        0 days 00:00:02.770268
round_time_total       0 days 00:11:03.283790
loss_total                         352.487293
loss_critic                        497.385828
loss_actor                        -227.106876
memory_size                       216345.6195 

=== epoch 3/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:19,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                    7
episode_length                     171.285714
returns                            -94.218593
return_std                          174.03605
average_reward                      -0.505338
round_time             0 days 00:11:08.215429
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       542.615507
return_std_test                     66.903197
average_reward_test                  0.542616
round_time_test        0 days 00:00:02.812079
round_time_total       0 days 00:11:08.216539
loss_total                         348.242221
loss_critic                        492.462092
loss_actor                        -228.637294
memory_size                        218161.169 

=== epoch 3/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:29,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                   13
episode_length                     104.846154
returns                             -54.22162
return_std                         113.097494
average_reward                      -0.500949
round_time             0 days 00:11:20.301264
episodes_test                             4.0
episode_length_test                     294.0
returns_test                        154.41899
return_std_test                    275.045556
average_reward_test                  0.612301
round_time_test        0 days 00:00:02.815013
round_time_total       0 days 00:11:20.302387
loss_total                         348.224221
loss_critic                        492.675945
loss_actor                        -229.582705
memory_size                       219944.3975 

=== epoch 3/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:52,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                    2
episode_length                          550.0
returns                           -230.990968
return_std                         241.926953
average_reward                      -0.485042
round_time             0 days 00:11:36.967305
episodes_test                             3.0
episode_length_test                360.333333
returns_test                       183.672295
return_std_test                    273.164014
average_reward_test                   0.58545
round_time_test        0 days 00:00:02.885284
round_time_total       0 days 00:11:36.968412
loss_total                         340.777613
loss_critic                        483.739113
loss_actor                         -231.06842
memory_size                       221604.7895 

=== epoch 3/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:08,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                    2
episode_length                          509.0
returns                           -195.196284
return_std                         178.026728
average_reward                      -0.458902
round_time             0 days 00:11:20.252384
episodes_test                             5.0
episode_length_test                     254.6
returns_test                        105.26823
return_std_test                    216.396678
average_reward_test                  0.547119
round_time_test        0 days 00:00:02.783336
round_time_total       0 days 00:11:20.253497
loss_total                         339.631065
loss_critic                        482.698978
loss_actor                        -232.640619
memory_size                       223513.5065 

=== epoch 3/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                   20
episode_length                           84.9
returns                            -41.111491
return_std                         100.120749
average_reward                      -0.499531
round_time             0 days 00:11:24.502228
episodes_test                             8.0
episode_length_test                   190.125
returns_test                        80.065972
return_std_test                    222.726248
average_reward_test                  0.482433
round_time_test        0 days 00:00:02.883908
round_time_total       0 days 00:11:24.503337
loss_total                          345.56431
loss_critic                        490.243256
loss_actor                        -233.151505
memory_size                        225089.614 

=== epoch 3/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:31,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                    4
episode_length                          278.5
returns                           -117.743684
return_std                          160.56173
average_reward                      -0.436562
round_time             0 days 00:11:37.162851
episodes_test                            14.0
episode_length_test                116.714286
returns_test                        46.575916
return_std_test                    157.713931
average_reward_test                  0.376264
round_time_test        0 days 00:00:02.773007
round_time_total       0 days 00:11:37.163954
loss_total                         338.134903
loss_critic                        481.447886
loss_actor                         -235.11706
memory_size                       226818.4775 

=== epoch 3/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:22<00:00,  2.93it/s]
episodes                                    5
episode_length                          236.8
returns                           -123.823492
return_std                         203.182135
average_reward                      -0.481728
round_time             0 days 00:11:23.172344
episodes_test                             8.0
episode_length_test                    177.75
returns_test                        77.044968
return_std_test                    177.739168
average_reward_test                  0.477702
round_time_test        0 days 00:00:02.801176
round_time_total       0 days 00:11:23.173452
loss_total                         347.304773
loss_critic                        493.054563
loss_actor                        -235.694421
memory_size                        228599.589 

=== epoch 3/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:04,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:32<00:00,  2.89it/s]
episodes                                   12
episode_length                     130.833333
returns                            -75.637846
return_std                         123.760783
average_reward                      -0.546816
round_time             0 days 00:11:32.489341
episodes_test                             2.0
episode_length_test                     987.5
returns_test                       184.643074
return_std_test                      81.09349
average_reward_test                  0.185885
round_time_test        0 days 00:00:02.756816
round_time_total       0 days 00:11:32.490449
loss_total                         346.340935
loss_critic                        492.195489
loss_actor                        -237.077315
memory_size                       230329.6395 

=== epoch 3/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:30<00:00,  2.90it/s]
episodes                                    4
episode_length                          300.0
returns                           -136.482564
return_std                         192.477091
average_reward                      -0.458241
round_time             0 days 00:11:30.493726
episodes_test                             8.0
episode_length_test                    185.75
returns_test                        91.028986
return_std_test                    229.103503
average_reward_test                  0.544999
round_time_test        0 days 00:00:02.794212
round_time_total       0 days 00:11:30.494830
loss_total                         345.308169
loss_critic                        491.330876
loss_actor                        -238.782691
memory_size                       232126.2445 

=== epoch 3/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:21,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:57<00:00,  2.79it/s]
episodes                                    4
episode_length                          63.25
returns                            -40.224782
return_std                           26.97555
average_reward                      -0.499304
round_time             0 days 00:11:58.094083
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       683.273702
return_std_test                     46.829705
average_reward_test                  0.683274
round_time_test        0 days 00:00:02.887737
round_time_total       0 days 00:11:58.095195
loss_total                         336.771429
loss_critic                        481.008392
loss_actor                        -240.176453
memory_size                       233973.5265 

=== epoch 3/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                   13
episode_length                      71.153846
returns                            -33.586117
return_std                          38.876217
average_reward                      -0.434362
round_time             0 days 00:11:40.451500
episodes_test                             7.0
episode_length_test                220.285714
returns_test                        109.38372
return_std_test                    241.868248
average_reward_test                  0.548085
round_time_test        0 days 00:00:02.835998
round_time_total       0 days 00:11:40.452606
loss_total                         356.736066
loss_critic                        505.991823
loss_actor                        -240.286999
memory_size                       235609.8915 

=== epoch 3/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.88it/s]
episodes                                   10
episode_length                          158.8
returns                            -83.992001
return_std                         141.499148
average_reward                      -0.503087
round_time             0 days 00:11:34.148415
episodes_test                             7.0
episode_length_test                     188.0
returns_test                       106.825377
return_std_test                    269.842149
average_reward_test                  0.647714
round_time_test        0 days 00:00:02.753808
round_time_total       0 days 00:11:34.149523
loss_total                         357.811411
loss_critic                        507.512215
loss_actor                        -240.991835
memory_size                        237313.175 

=== epoch 3/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:35<00:00,  2.88it/s]
episodes                                    4
episode_length                          294.5
returns                           -144.707886
return_std                         177.247473
average_reward                      -0.475618
round_time             0 days 00:11:35.711469
episodes_test                             7.0
episode_length_test                193.428571
returns_test                       109.448371
return_std_test                    263.510644
average_reward_test                  0.641334
round_time_test        0 days 00:00:02.795877
round_time_total       0 days 00:11:35.712594
loss_total                         359.176286
loss_critic                        509.194696
loss_actor                         -240.89739
memory_size                        239071.199 

=== epoch 3/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:33,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                   16
episode_length                        111.375
returns                            -64.049492
return_std                         109.406342
average_reward                      -0.598745
round_time             0 days 00:11:41.900991
episodes_test                             2.0
episode_length_test                     522.5
returns_test                       278.124521
return_std_test                    280.297114
average_reward_test                  0.608234
round_time_test        0 days 00:00:02.864033
round_time_total       0 days 00:11:41.902089
loss_total                          361.21145
loss_critic                        511.727185
loss_actor                        -240.851523
memory_size                        240792.712 

=== epoch 3/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:32<00:00,  2.89it/s]
episodes                                    1
episode_length                         1000.0
returns                           -481.581699
return_std                                0.0
average_reward                      -0.504392
round_time             0 days 00:11:32.752340
episodes_test                             2.0
episode_length_test                     530.0
returns_test                        360.92694
return_std_test                    346.342182
average_reward_test                  0.666367
round_time_test        0 days 00:00:02.840108
round_time_total       0 days 00:11:32.753445
loss_total                         360.932894
loss_critic                         511.25108
loss_actor                        -240.339877
memory_size                        242508.927 

=== epoch 3/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:07,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:43<00:00,  2.84it/s]
episodes                                    4
episode_length                         285.75
returns                           -146.194286
return_std                         173.927392
average_reward                      -0.492485
round_time             0 days 00:11:44.060562
episodes_test                             3.0
episode_length_test                368.666667
returns_test                       262.508705
return_std_test                    340.060642
average_reward_test                  0.750675
round_time_test        0 days 00:00:02.823650
round_time_total       0 days 00:11:44.061645
loss_total                         355.021823
loss_critic                        504.236111
loss_actor                        -241.835361
memory_size                        244410.079 

=== epoch 3/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:59,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                   20
episode_length                           89.3
returns                            -36.895251
return_std                          91.172628
average_reward                      -0.394067
round_time             0 days 00:11:40.140365
episodes_test                             4.0
episode_length_test                     291.5
returns_test                       191.006784
return_std_test                     333.46898
average_reward_test                  0.698881
round_time_test        0 days 00:00:02.870455
round_time_total       0 days 00:11:40.141477
loss_total                         357.666309
loss_critic                        507.769126
loss_actor                        -242.744994
memory_size                       245908.7525 

=== epoch 3/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:06,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:55<00:00,  2.79it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   10
episode_length                           52.4
returns                            -26.095106
return_std                          19.269609
average_reward                      -0.489345
round_time             0 days 00:11:56.111953
episodes_test                             4.0
episode_length_test                     309.0
returns_test                       162.068346
return_std_test                     253.29567
average_reward_test                  0.578029
round_time_test        0 days 00:00:02.815943
round_time_total       0 days 00:11:56.113070
loss_total                         358.615081
loss_critic                        509.093435
loss_actor                        -243.298371
memory_size                       247560.8675 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                   20
episode_length                          93.55
returns                            -47.115522
return_std                          92.262218
average_reward                      -0.532166
round_time             0 days 00:10:26.339812
episodes_test                             3.0
episode_length_test                377.666667
returns_test                       200.875597
return_std_test                    247.936498
average_reward_test                  0.629365
round_time_test        0 days 00:00:02.799675
round_time_total       0 days 00:10:26.340927
loss_total                         368.367557
loss_critic                        521.127249
loss_actor                        -242.671244
memory_size                       248996.7655 

=== epoch 4/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                    7
episode_length                     175.285714
returns                            -75.951906
return_std                         122.472557
average_reward                      -0.492626
round_time             0 days 00:10:28.708479
episodes_test                             2.0
episode_length_test                     515.5
returns_test                       238.903081
return_std_test                    224.165939
average_reward_test                  0.380914
round_time_test        0 days 00:00:02.835187
round_time_total       0 days 00:10:28.709582
loss_total                          381.96679
loss_critic                         538.29085
loss_actor                        -243.329488
memory_size                       250675.8765 

=== epoch 4/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:30,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.17it/s]
episodes                                   16
episode_length                       105.5625
returns                            -38.486004
return_std                          88.411681
average_reward                      -0.393274
round_time             0 days 00:10:30.406781
episodes_test                             9.0
episode_length_test                193.444444
returns_test                        90.066803
return_std_test                    247.533331
average_reward_test                  0.436783
round_time_test        0 days 00:00:02.747675
round_time_total       0 days 00:10:30.407902
loss_total                            372.205
loss_critic                        526.194599
loss_actor                        -243.753432
memory_size                         252460.09 

=== epoch 4/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:30,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                   14
episode_length                     102.214286
returns                            -49.608579
return_std                          99.089064
average_reward                      -0.502771
round_time             0 days 00:10:33.472440
episodes_test                             9.0
episode_length_test                149.444444
returns_test                        90.257159
return_std_test                    235.118611
average_reward_test                  0.413564
round_time_test        0 days 00:00:02.807267
round_time_total       0 days 00:10:33.473538
loss_total                         381.641284
loss_critic                        538.307112
loss_actor                        -245.022066
memory_size                       253887.5195 

=== epoch 4/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:39,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                    7
episode_length                     167.142857
returns                            -80.030525
return_std                         138.680789
average_reward                      -0.496141
round_time             0 days 00:10:40.327877
episodes_test                             5.0
episode_length_test                     269.2
returns_test                       146.035946
return_std_test                    317.921683
average_reward_test                  0.476188
round_time_test        0 days 00:00:02.796706
round_time_total       0 days 00:10:40.329003
loss_total                         375.317878
loss_critic                        530.520921
loss_actor                        -245.494327
memory_size                       255642.8955 

=== epoch 4/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                    6
episode_length                      75.833333
returns                            -41.573642
return_std                          47.214273
average_reward                      -0.485252
round_time             0 days 00:10:38.617733
episodes_test                             7.0
episode_length_test                214.428571
returns_test                       130.377086
return_std_test                    271.834653
average_reward_test                  0.563978
round_time_test        0 days 00:00:02.763051
round_time_total       0 days 00:10:38.618837
loss_total                         382.379198
loss_critic                        539.377615
loss_actor                        -245.614506
memory_size                        257451.597 

=== epoch 4/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:35,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    2
episode_length                          543.0
returns                           -255.783227
return_std                         150.412927
average_reward                      -0.450726
round_time             0 days 00:10:35.443450
episodes_test                             4.0
episode_length_test                     279.5
returns_test                       112.493528
return_std_test                    205.779817
average_reward_test                  0.538606
round_time_test        0 days 00:00:02.851059
round_time_total       0 days 00:10:35.444542
loss_total                         377.678246
loss_critic                        533.618766
loss_actor                        -246.083869
memory_size                       259301.9065 

=== epoch 4/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    4
episode_length                         278.75
returns                           -115.898502
return_std                         172.115182
average_reward                      -0.427074
round_time             0 days 00:10:34.499825
episodes_test                             3.0
episode_length_test                426.333333
returns_test                        41.463178
return_std_test                     35.703735
average_reward_test                  0.265601
round_time_test        0 days 00:00:02.836964
round_time_total       0 days 00:10:34.500935
loss_total                         372.879116
loss_critic                        527.852999
loss_actor                         -247.01645
memory_size                       261155.7705 

=== epoch 4/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                    4
episode_length                          282.5
returns                            -157.42963
return_std                         227.244142
average_reward                      -0.505182
round_time             0 days 00:10:33.870001
episodes_test                             7.0
episode_length_test                     188.0
returns_test                        119.22279
return_std_test                    256.453429
average_reward_test                  0.680048
round_time_test        0 days 00:00:02.846301
round_time_total       0 days 00:10:33.871111
loss_total                         378.503104
loss_critic                        535.290981
loss_actor                        -248.648436
memory_size                       263030.8445 

=== epoch 4/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:24,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                   12
episode_length                         131.25
returns                            -78.469377
return_std                         123.254606
average_reward                      -0.577005
round_time             0 days 00:10:40.985355
episodes_test                             2.0
episode_length_test                     510.0
returns_test                       197.150963
return_std_test                    186.187481
average_reward_test                  0.578249
round_time_test        0 days 00:00:02.805661
round_time_total       0 days 00:10:40.986457
loss_total                         382.152955
loss_critic                        540.215007
loss_actor                        -250.095286
memory_size                        264762.504 

=== epoch 4/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   12
episode_length                     114.666667
returns                            -65.196572
return_std                         131.329696
average_reward                      -0.520225
round_time             0 days 00:10:37.580791
episodes_test                             3.0
episode_length_test                372.333333
returns_test                       208.535395
return_std_test                     296.77397
average_reward_test                  0.655857
round_time_test        0 days 00:00:02.857185
round_time_total       0 days 00:10:37.581900
loss_total                         389.234034
loss_critic                         549.07809
loss_actor                        -250.142226
memory_size                       266312.4955 

=== epoch 4/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   12
episode_length                          56.75
returns                             -37.13892
return_std                          38.481041
average_reward                      -0.517233
round_time             0 days 00:10:37.009006
episodes_test                             3.0
episode_length_test                     361.0
returns_test                       265.920542
return_std_test                    359.260183
average_reward_test                  0.538207
round_time_test        0 days 00:00:02.763874
round_time_total       0 days 00:10:37.010100
loss_total                         382.504054
loss_critic                        540.907761
loss_actor                        -251.110804
memory_size                       267989.8345 

=== epoch 4/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                   27
episode_length                      55.259259
returns                            -32.800415
return_std                          33.960163
average_reward                      -0.539008
round_time             0 days 00:10:40.529383
episodes_test                             8.0
episode_length_test                    159.25
returns_test                        93.499086
return_std_test                     275.96838
average_reward_test                  0.621243
round_time_test        0 days 00:00:02.862401
round_time_total       0 days 00:10:40.530492
loss_total                         396.659522
loss_critic                         558.43645
loss_actor                        -250.448226
memory_size                        269399.522 

=== epoch 4/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    7
episode_length                     192.285714
returns                            -91.188424
return_std                         159.233028
average_reward                      -0.439397
round_time             0 days 00:10:35.334650
episodes_test                             4.0
episode_length_test                    302.75
returns_test                        185.21232
return_std_test                    344.191181
average_reward_test                  0.657237
round_time_test        0 days 00:00:02.841705
round_time_total       0 days 00:10:35.335758
loss_total                         401.522841
loss_critic                        564.243518
loss_actor                        -249.359909
memory_size                        270970.451 

=== epoch 4/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:11,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   11
episode_length                      70.363636
returns                            -51.262232
return_std                          44.239796
average_reward                      -0.596812
round_time             0 days 00:10:38.329901
episodes_test                             4.0
episode_length_test                     294.0
returns_test                       183.477497
return_std_test                    286.864713
average_reward_test                  0.699149
round_time_test        0 days 00:00:02.808036
round_time_total       0 days 00:10:38.331008
loss_total                         403.364885
loss_critic                        566.614665
loss_actor                        -249.634273
memory_size                       272704.4855 

=== epoch 4/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:11,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                   21
episode_length                      48.952381
returns                            -22.773572
return_std                           31.00852
average_reward                      -0.490705
round_time             0 days 00:10:35.053030
episodes_test                             5.0
episode_length_test                     298.4
returns_test                        94.259204
return_std_test                     147.00724
average_reward_test                  0.319097
round_time_test        0 days 00:00:02.783360
round_time_total       0 days 00:10:35.054143
loss_total                         409.576933
loss_critic                        574.425798
loss_actor                        -249.818566
memory_size                       274192.7235 

=== epoch 4/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                    6
episode_length                     199.166667
returns                            -94.588498
return_std                          153.97359
average_reward                      -0.481578
round_time             0 days 00:10:37.521499
episodes_test                             5.0
episode_length_test                     240.4
returns_test                       146.008751
return_std_test                     254.59043
average_reward_test                  0.633172
round_time_test        0 days 00:00:02.903799
round_time_total       0 days 00:10:37.522602
loss_total                         414.732281
loss_critic                        580.691512
loss_actor                        -249.104681
memory_size                       275744.5215 

=== epoch 4/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                    9
episode_length                     145.888889
returns                            -70.388221
return_std                         136.792578
average_reward                      -0.492827
round_time             0 days 00:10:40.328067
episodes_test                             6.0
episode_length_test                     233.0
returns_test                        46.932052
return_std_test                     110.62533
average_reward_test                  0.334775
round_time_test        0 days 00:00:02.783142
round_time_total       0 days 00:10:40.329187
loss_total                          413.74787
loss_critic                        579.550893
loss_actor                        -249.464263
memory_size                        277336.697 

=== epoch 4/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:11,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   15
episode_length                     108.333333
returns                            -48.411952
return_std                         113.800327
average_reward                      -0.432178
round_time             0 days 00:10:36.120654
episodes_test                             2.0
episode_length_test                     555.0
returns_test                       388.624291
return_std_test                    415.292497
average_reward_test                  0.705007
round_time_test        0 days 00:00:02.795048
round_time_total       0 days 00:10:36.121765
loss_total                         409.995562
loss_critic                        575.029619
loss_actor                        -250.140703
memory_size                       279161.7045 

=== epoch 4/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:09,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                    3
episode_length                           68.0
returns                             -3.684732
return_std                           18.84874
average_reward                      -0.400777
round_time             0 days 00:10:40.196464
episodes_test                             2.0
episode_length_test                     510.0
returns_test                       306.261997
return_std_test                    293.834646
average_reward_test                  0.658576
round_time_test        0 days 00:00:02.825224
round_time_total       0 days 00:10:40.197587
loss_total                          418.84311
loss_critic                        586.227926
loss_actor                        -250.696197
memory_size                        280772.989 

=== epoch 4/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                    7
episode_length                     176.571429
returns                            -72.925071
return_std                         135.501033
average_reward                      -0.433346
round_time             0 days 00:10:41.825012
episodes_test                             2.0
episode_length_test                     542.0
returns_test                       397.541935
return_std_test                    385.819803
average_reward_test                  0.647635
round_time_test        0 days 00:00:02.834672
round_time_total       0 days 00:10:41.826133
loss_total                         417.539255
loss_critic                        584.731687
loss_actor                        -251.230514
memory_size                       282568.2505 

=== epoch 4/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:50<00:00,  3.07it/s]
episodes                                   16
episode_length                        58.4375
returns                            -53.511879
return_std                          78.864611
average_reward                      -0.660884
round_time             0 days 00:10:51.089690
episodes_test                             8.0
episode_length_test                     162.0
returns_test                        78.639564
return_std_test                    199.757118
average_reward_test                  0.491094
round_time_test        0 days 00:00:02.796411
round_time_total       0 days 00:10:51.090800
loss_total                         429.069881
loss_critic                        599.295311
loss_actor                        -251.831882
memory_size                       284214.6345 

=== epoch 4/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:30,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    6
episode_length                     197.333333
returns                           -101.598265
return_std                         162.112239
average_reward                      -0.469538
round_time             0 days 00:10:44.286299
episodes_test                             2.0
episode_length_test                     514.0
returns_test                       351.574387
return_std_test                    328.243656
average_reward_test                  0.560889
round_time_test        0 days 00:00:02.802840
round_time_total       0 days 00:10:44.287435
loss_total                          429.43793
loss_critic                        599.643524
loss_actor                        -251.384483
memory_size                        285936.128 

=== epoch 4/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:46<00:00,  3.09it/s]
episodes                                    8
episode_length                         163.75
returns                             -84.28226
return_std                         139.045264
average_reward                      -0.494246
round_time             0 days 00:10:46.882600
episodes_test                             7.0
episode_length_test                192.285714
returns_test                        85.091016
return_std_test                    229.006935
average_reward_test                  0.489805
round_time_test        0 days 00:00:02.819487
round_time_total       0 days 00:10:46.883713
loss_total                         423.633379
loss_critic                        592.271997
loss_actor                        -250.921141
memory_size                        287689.774 

=== epoch 4/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                    4
episode_length                          288.0
returns                           -143.907355
return_std                           182.5377
average_reward                      -0.472637
round_time             0 days 00:10:59.678190
episodes_test                             3.0
episode_length_test                348.666667
returns_test                       249.538765
return_std_test                    343.875907
average_reward_test                  0.706983
round_time_test        0 days 00:00:02.828493
round_time_total       0 days 00:10:59.679295
loss_total                         423.719494
loss_critic                        592.512227
loss_actor                        -251.451477
memory_size                         289517.92 

=== epoch 4/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:59,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   11
episode_length                     134.272727
returns                            -69.261675
return_std                         116.065372
average_reward                      -0.527552
round_time             0 days 00:10:57.663476
episodes_test                             4.0
episode_length_test                    334.25
returns_test                       126.834675
return_std_test                    218.214836
average_reward_test                   0.47887
round_time_test        0 days 00:00:02.776583
round_time_total       0 days 00:10:57.664586
loss_total                         430.767047
loss_critic                         601.37721
loss_actor                        -251.673642
memory_size                       291154.4055 

=== epoch 4/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   14
episode_length                           52.0
returns                            -39.023626
return_std                          40.246265
average_reward                      -0.556928
round_time             0 days 00:11:03.798821
episodes_test                             7.0
episode_length_test                     209.0
returns_test                        50.412036
return_std_test                    126.230744
average_reward_test                    0.3441
round_time_test        0 days 00:00:02.780337
round_time_total       0 days 00:11:03.799922
loss_total                         438.437989
loss_critic                        611.004873
loss_actor                        -251.829585
memory_size                        292784.282 

=== epoch 4/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                    5
episode_length                          229.4
returns                           -113.988801
return_std                         183.134311
average_reward                      -0.481833
round_time             0 days 00:11:01.599549
episodes_test                            13.0
episode_length_test                110.846154
returns_test                        50.075767
return_std_test                     170.50682
average_reward_test                  0.500492
round_time_test        0 days 00:00:02.838598
round_time_total       0 days 00:11:01.600656
loss_total                         431.731975
loss_critic                        602.692651
loss_actor                        -252.110766
memory_size                        294490.214 

=== epoch 4/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    6
episode_length                     245.333333
returns                           -101.921514
return_std                         150.422447
average_reward                      -0.425063
round_time             0 days 00:11:00.601118
episodes_test                             2.0
episode_length_test                     517.5
returns_test                       200.808276
return_std_test                    194.635383
average_reward_test                   0.57383
round_time_test        0 days 00:00:02.789248
round_time_total       0 days 00:11:00.602226
loss_total                         429.865861
loss_critic                        600.548846
loss_actor                        -252.866115
memory_size                       296304.9585 

=== epoch 4/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                    5
episode_length                           50.0
returns                            -27.315495
return_std                          20.369627
average_reward                      -0.481862
round_time             0 days 00:11:04.728923
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       683.144185
return_std_test                      26.37553
average_reward_test                  0.683144
round_time_test        0 days 00:00:02.866737
round_time_total       0 days 00:11:04.730036
loss_total                         445.805075
loss_critic                        620.467361
loss_actor                        -252.844111
memory_size                        298098.404 

=== epoch 4/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   23
episode_length                      80.173913
returns                            -40.956104
return_std                           93.36483
average_reward                      -0.500929
round_time             0 days 00:11:06.155803
episodes_test                             3.0
episode_length_test                     356.0
returns_test                       254.569651
return_std_test                    358.215875
average_reward_test                  0.743939
round_time_test        0 days 00:00:02.795197
round_time_total       0 days 00:11:06.156909
loss_total                         453.049347
loss_critic                        629.391347
loss_actor                         -252.31869
memory_size                       299657.1615 

=== epoch 4/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:29,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                    3
episode_length                      35.666667
returns                            -23.326283
return_std                           5.856547
average_reward                       -0.40921
round_time             0 days 00:11:10.963590
episodes_test                             4.0
episode_length_test                     287.0
returns_test                       177.157185
return_std_test                    320.170074
average_reward_test                  0.635627
round_time_test        0 days 00:00:02.820681
round_time_total       0 days 00:11:10.964696
loss_total                         448.112467
loss_critic                        623.237746
loss_actor                         -252.38869
memory_size                       301301.8525 

=== epoch 4/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   11
episode_length                     138.545455
returns                            -63.743075
return_std                          123.91262
average_reward                      -0.435778
round_time             0 days 00:11:02.927887
episodes_test                             2.0
episode_length_test                     505.5
returns_test                       358.201433
return_std_test                    366.440002
average_reward_test                  0.718134
round_time_test        0 days 00:00:02.812150
round_time_total       0 days 00:11:02.928984
loss_total                         448.823299
loss_critic                        624.487358
loss_actor                        -253.832975
memory_size                        303079.641 

=== epoch 4/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                    6
episode_length                          188.0
returns                            -81.094222
return_std                         151.693096
average_reward                      -0.411535
round_time             0 days 00:11:07.243066
episodes_test                             2.0
episode_length_test                     509.0
returns_test                        367.81241
return_std_test                    364.400307
average_reward_test                   0.40124
round_time_test        0 days 00:00:02.831854
round_time_total       0 days 00:11:07.244168
loss_total                         444.565626
loss_critic                        619.362727
loss_actor                         -254.62282
memory_size                       304817.5445 

=== epoch 4/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:20,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [32:18<00:00,  1.03it/s]
episodes                                   11
episode_length                      50.818182
returns                            -21.869452
return_std                          28.335318
average_reward                      -0.477912
round_time             0 days 00:32:18.732851
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       742.510657
return_std_test                     22.429089
average_reward_test                  0.742511
round_time_test        0 days 00:00:02.814508
round_time_total       0 days 00:32:18.733958
loss_total                         437.115308
loss_critic                        610.289274
loss_actor                        -255.580594
memory_size                       306516.3205 

=== epoch 4/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:45<00:00,  2.83it/s]
episodes                                   11
episode_length                     142.545455
returns                            -66.776368
return_std                          124.29014
average_reward                      -0.464824
round_time             0 days 00:11:46.122471
episodes_test                            10.0
episode_length_test                     187.9
returns_test                        67.858015
return_std_test                    205.463763
average_reward_test                  0.326836
round_time_test        0 days 00:00:02.803528
round_time_total       0 days 00:11:46.123592
loss_total                         448.096077
loss_critic                        624.040866
loss_actor                        -255.683122
memory_size                       308133.8475 

=== epoch 4/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                    5
episode_length                          224.2
returns                            -98.464351
return_std                         150.291812
average_reward                      -0.445887
round_time             0 days 00:11:23.685114
episodes_test                            13.0
episode_length_test                136.923077
returns_test                        -5.568597
return_std_test                     33.300324
average_reward_test                  0.041355
round_time_test        0 days 00:00:02.699131
round_time_total       0 days 00:11:23.686226
loss_total                          443.71153
loss_critic                        618.709898
loss_actor                        -256.281983
memory_size                       309898.2195 

=== epoch 4/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:19,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                   19
episode_length                           61.0
returns                            -44.146723
return_std                          48.747552
average_reward                      -0.623154
round_time             0 days 00:11:21.108379
episodes_test                             8.0
episode_length_test                     194.5
returns_test                        75.261293
return_std_test                    216.767406
average_reward_test                  0.329133
round_time_test        0 days 00:00:02.780561
round_time_total       0 days 00:11:21.109478
loss_total                          463.83074
loss_critic                        643.768289
loss_actor                        -255.919502
memory_size                        311397.299 

=== epoch 4/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                    8
episode_length                          181.0
returns                           -113.875523
return_std                         161.294352
average_reward                      -0.567712
round_time             0 days 00:11:15.508456
episodes_test                             7.0
episode_length_test                217.428571
returns_test                       103.757663
return_std_test                    255.493512
average_reward_test                  0.539661
round_time_test        0 days 00:00:02.882812
round_time_total       0 days 00:11:15.509568
loss_total                         473.561865
loss_critic                         655.74589
loss_actor                        -255.174284
memory_size                        313031.204 

=== epoch 4/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:06,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    5
episode_length                          229.0
returns                           -105.824887
return_std                         170.897397
average_reward                      -0.480909
round_time             0 days 00:11:28.896329
episodes_test                             9.0
episode_length_test                215.666667
returns_test                        81.967455
return_std_test                    230.207035
average_reward_test                  0.368794
round_time_test        0 days 00:00:02.796251
round_time_total       0 days 00:11:28.897445
loss_total                         476.427185
loss_critic                         659.18349
loss_actor                        -254.598073
memory_size                        314861.213 

=== epoch 4/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:34,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                    7
episode_length                     169.428571
returns                            -85.816183
return_std                         169.398526
average_reward                      -0.482432
round_time             0 days 00:11:17.810718
episodes_test                             2.0
episode_length_test                     665.0
returns_test                       352.571307
return_std_test                    212.249422
average_reward_test                   0.52333
round_time_test        0 days 00:00:02.766840
round_time_total       0 days 00:11:17.811818
loss_total                         467.588185
loss_critic                        648.195761
loss_actor                        -254.842165
memory_size                        316694.808 

=== epoch 4/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                   11
episode_length                          141.0
returns                            -88.681366
return_std                         128.754732
average_reward                      -0.597805
round_time             0 days 00:11:29.702926
episodes_test                             2.0
episode_length_test                     948.5
returns_test                       378.150554
return_std_test                    365.146897
average_reward_test                  0.372212
round_time_test        0 days 00:00:02.777183
round_time_total       0 days 00:11:29.704036
loss_total                         475.997733
loss_critic                        658.928759
loss_actor                        -255.726413
memory_size                       318308.8255 

=== epoch 4/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:59,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                    9
episode_length                     150.888889
returns                            -58.161457
return_std                         117.171488
average_reward                      -0.405143
round_time             0 days 00:11:22.376405
episodes_test                             7.0
episode_length_test                     190.0
returns_test                       121.853647
return_std_test                    287.124754
average_reward_test                  0.647442
round_time_test        0 days 00:00:02.822139
round_time_total       0 days 00:11:22.377516
loss_total                         469.756347
loss_critic                        650.936106
loss_actor                         -254.96273
memory_size                       320129.0415 

=== epoch 4/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:30,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                   12
episode_length                          161.0
returns                            -86.514984
return_std                         121.767768
average_reward                      -0.543732
round_time             0 days 00:11:40.144786
episodes_test                             2.0
episode_length_test                     543.5
returns_test                       373.147383
return_std_test                    392.019602
average_reward_test                  0.693264
round_time_test        0 days 00:00:02.832262
round_time_total       0 days 00:11:40.145891
loss_total                         488.692457
loss_critic                        674.676874
loss_actor                        -255.245255
memory_size                       321680.8305 

=== epoch 4/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:06,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.92it/s]
episodes                                   24
episode_length                      78.833333
returns                            -40.542155
return_std                          84.170439
average_reward                      -0.520784
round_time             0 days 00:11:24.400537
episodes_test                             5.0
episode_length_test                     300.8
returns_test                       121.161831
return_std_test                    292.351659
average_reward_test                  0.464007
round_time_test        0 days 00:00:02.779814
round_time_total       0 days 00:11:24.401665
loss_total                         494.218554
loss_critic                        681.357077
loss_actor                        -254.335588
memory_size                         323197.26 

=== epoch 4/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:35,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:25<00:00,  2.92it/s]
episodes                                    8
episode_length                        166.875
returns                            -79.553833
return_std                         136.501913
average_reward                      -0.523344
round_time             0 days 00:11:26.327750
episodes_test                             7.0
episode_length_test                233.428571
returns_test                       117.375185
return_std_test                    261.616153
average_reward_test                  0.432571
round_time_test        0 days 00:00:02.770687
round_time_total       0 days 00:11:26.328857
loss_total                         509.749003
loss_critic                        700.756717
loss_actor                        -254.281905
memory_size                        324763.611 

=== epoch 4/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:54,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    7
episode_length                          207.0
returns                            -99.881282
return_std                         157.078139
average_reward                      -0.462474
round_time             0 days 00:11:28.669675
episodes_test                             2.0
episode_length_test                     535.5
returns_test                        355.41853
return_std_test                    366.831742
average_reward_test                  0.621177
round_time_test        0 days 00:00:02.769620
round_time_total       0 days 00:11:28.670791
loss_total                         520.820097
loss_critic                        714.610755
loss_actor                        -254.342579
memory_size                       326528.5965 

=== epoch 4/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:15,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:34<00:00,  2.88it/s]
episodes                                   16
episode_length                        55.1875
returns                            -34.499732
return_std                          32.772563
average_reward                      -0.497422
round_time             0 days 00:11:35.361224
episodes_test                             3.0
episode_length_test                     358.0
returns_test                       245.468525
return_std_test                    336.110802
average_reward_test                  0.702643
round_time_test        0 days 00:00:02.806317
round_time_total       0 days 00:11:35.362329
loss_total                         517.235163
loss_critic                        710.047582
loss_actor                        -254.014563
memory_size                       328174.5075 

=== epoch 4/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:34,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:38<00:00,  2.86it/s]
episodes                                    2
episode_length                          520.5
returns                           -244.786654
return_std                         219.945431
average_reward                      -0.464764
round_time             0 days 00:11:39.142393
episodes_test                             5.0
episode_length_test                     235.4
returns_test                       155.006514
return_std_test                    291.851479
average_reward_test                  0.709808
round_time_test        0 days 00:00:02.813964
round_time_total       0 days 00:11:39.143520
loss_total                          525.81383
loss_critic                        720.731102
loss_actor                         -253.85531
memory_size                        329804.393 

=== epoch 4/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:25,  2.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:38<00:00,  2.86it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   15
episode_length                           43.8
returns                            -25.420894
return_std                          18.943116
average_reward                      -0.506557
round_time             0 days 00:11:38.549380
episodes_test                             8.0
episode_length_test                   166.125
returns_test                        91.930358
return_std_test                    226.575814
average_reward_test                  0.573471
round_time_test        0 days 00:00:02.805875
round_time_total       0 days 00:11:38.550484
loss_total                         520.434299
loss_critic                         714.17323
loss_actor                        -254.521472
memory_size                        331537.874 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:50,  3.07it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    4
episode_length                          278.5
returns                            -133.41403
return_std                         186.463489
average_reward                      -0.463442
round_time             0 days 00:10:26.549726
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       705.158729
return_std_test                     44.609701
average_reward_test                  0.705159
round_time_test        0 days 00:00:02.816460
round_time_total       0 days 00:10:26.550841
loss_total                         519.799073
loss_critic                        713.383514
loss_actor                        -254.538738
memory_size                       333143.7365 

=== epoch 5/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:07,  2.99it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                    8
episode_length                        179.875
returns                            -92.655129
return_std                         156.097004
average_reward                      -0.476203
round_time             0 days 00:10:31.651318
episodes_test                             2.0
episode_length_test                     507.5
returns_test                       395.846973
return_std_test                    402.653513
average_reward_test                   0.75807
round_time_test        0 days 00:00:02.845554
round_time_total       0 days 00:10:31.652406
loss_total                         522.712218
loss_critic                        717.055899
loss_actor                        -254.662555
memory_size                        334913.043 

=== epoch 5/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                    3
episode_length                     382.666667
returns                           -196.490706
return_std                         178.566619
average_reward                      -0.502493
round_time             0 days 00:10:35.661574
episodes_test                             2.0
episode_length_test                     508.5
returns_test                       391.086867
return_std_test                    384.633555
average_reward_test                  0.766073
round_time_test        0 days 00:00:02.847643
round_time_total       0 days 00:10:35.662665
loss_total                         518.337893
loss_critic                        711.952171
loss_actor                        -256.119268
memory_size                       336795.8355 

=== epoch 5/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    2
episode_length                          527.5
returns                           -227.928027
return_std                         196.458491
average_reward                      -0.476969
round_time             0 days 00:10:34.583020
episodes_test                             8.0
episode_length_test                   147.625
returns_test                        90.644234
return_std_test                    246.240139
average_reward_test                  0.634568
round_time_test        0 days 00:00:02.815501
round_time_total       0 days 00:10:34.584112
loss_total                         516.291378
loss_critic                        709.363831
loss_actor                        -255.998481
memory_size                       338678.5845 

=== epoch 5/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:21,  2.93it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    9
episode_length                      54.222222
returns                            -35.458872
return_std                          51.489693
average_reward                      -0.470227
round_time             0 days 00:10:35.204957
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       728.936812
return_std_test                     20.148836
average_reward_test                  0.728937
round_time_test        0 days 00:00:02.876157
round_time_total       0 days 00:10:35.206037
loss_total                         515.493282
loss_critic                        708.493707
loss_actor                        -256.508464
memory_size                       340476.4125 

=== epoch 5/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    2
episode_length                          522.5
returns                            -235.62796
return_std                         188.989311
average_reward                      -0.431756
round_time             0 days 00:10:36.523100
episodes_test                             5.0
episode_length_test                     249.0
returns_test                       151.040644
return_std_test                    301.199977
average_reward_test                  0.335891
round_time_test        0 days 00:00:02.810762
round_time_total       0 days 00:10:36.524208
loss_total                         511.997205
loss_critic                        704.169527
loss_actor                        -256.692133
memory_size                        342280.906 

=== epoch 5/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:58,  3.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                    4
episode_length                          284.5
returns                           -131.245193
return_std                         206.558337
average_reward                      -0.465026
round_time             0 days 00:10:32.263665
episodes_test                             6.0
episode_length_test                207.166667
returns_test                        142.71986
return_std_test                    285.117126
average_reward_test                  0.719853
round_time_test        0 days 00:00:02.799077
round_time_total       0 days 00:10:32.264767
loss_total                         520.814986
loss_critic                        715.279204
loss_actor                        -257.041936
memory_size                       344163.1705 

=== epoch 5/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:06,  2.99it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                    2
episode_length                          549.5
returns                           -288.849278
return_std                         164.174293
average_reward                      -0.498176
round_time             0 days 00:10:35.941785
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       689.148506
return_std_test                      5.902063
average_reward_test                  0.689149
round_time_test        0 days 00:00:02.884138
round_time_total       0 days 00:10:35.942878
loss_total                         514.358015
loss_critic                        707.270879
loss_actor                        -257.293488
memory_size                       346008.2215 

=== epoch 5/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:17,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                    6
episode_length                      87.833333
returns                            -54.670935
return_std                          44.371817
average_reward                       -0.45907
round_time             0 days 00:10:35.861095
episodes_test                             6.0
episode_length_test                     210.0
returns_test                       105.684928
return_std_test                      251.9106
average_reward_test                  0.583842
round_time_test        0 days 00:00:02.809008
round_time_total       0 days 00:10:35.862204
loss_total                         518.433286
loss_critic                        712.704362
loss_actor                        -258.651069
memory_size                        347826.928 

=== epoch 5/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    6
episode_length                     205.666667
returns                           -120.168484
return_std                         181.694727
average_reward                      -0.525193
round_time             0 days 00:10:37.026995
episodes_test                             2.0
episode_length_test                     507.5
returns_test                       382.129295
return_std_test                     380.93085
average_reward_test                  0.745497
round_time_test        0 days 00:00:02.832164
round_time_total       0 days 00:10:37.028091
loss_total                         519.479971
loss_critic                        714.035942
loss_actor                        -258.743964
memory_size                       349601.1715 

=== epoch 5/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    6
episode_length                          191.5
returns                            -94.500265
return_std                         166.892815
average_reward                      -0.461042
round_time             0 days 00:10:35.060869
episodes_test                             4.0
episode_length_test                     291.0
returns_test                       183.199639
return_std_test                    310.260162
average_reward_test                  0.700932
round_time_test        0 days 00:00:02.818300
round_time_total       0 days 00:10:35.061967
loss_total                         520.474743
loss_critic                        715.312744
loss_actor                        -258.877308
memory_size                       351452.1685 

=== epoch 5/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                   14
episode_length                      44.928571
returns                            -21.922246
return_std                          21.245007
average_reward                      -0.484048
round_time             0 days 00:10:39.302474
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       787.881705
return_std_test                      7.537863
average_reward_test                  0.787882
round_time_test        0 days 00:00:02.813449
round_time_total       0 days 00:10:39.303589
loss_total                         517.564717
loss_critic                        712.074635
loss_actor                        -260.475005
memory_size                        353080.419 

=== epoch 5/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                   15
episode_length                          100.4
returns                            -56.609694
return_std                         113.498093
average_reward                      -0.547272
round_time             0 days 00:10:41.011134
episodes_test                             2.0
episode_length_test                     506.0
returns_test                       381.339912
return_std_test                     369.67127
average_reward_test                   0.74729
round_time_test        0 days 00:00:02.823914
round_time_total       0 days 00:10:41.012246
loss_total                         526.755017
loss_critic                        723.436129
loss_actor                        -259.969478
memory_size                       354665.0535 

=== epoch 5/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:38,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:46<00:00,  3.09it/s]
episodes                                   13
episode_length                      50.769231
returns                            -39.773885
return_std                          51.823684
average_reward                      -0.559933
round_time             0 days 00:10:46.808505
episodes_test                             3.0
episode_length_test                365.666667
returns_test                       241.604044
return_std_test                    349.682448
average_reward_test                  0.625546
round_time_test        0 days 00:00:02.839601
round_time_total       0 days 00:10:46.809605
loss_total                         542.493547
loss_critic                        743.254819
loss_actor                        -260.551592
memory_size                       356278.5485 

=== epoch 5/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                   18
episode_length                      93.388889
returns                            -45.935654
return_std                         109.805512
average_reward                      -0.484884
round_time             0 days 00:10:43.617215
episodes_test                             4.0
episode_length_test                    295.25
returns_test                       166.039476
return_std_test                    249.328786
average_reward_test                  0.655262
round_time_test        0 days 00:00:02.737451
round_time_total       0 days 00:10:43.618323
loss_total                         533.013988
loss_critic                        731.618842
loss_actor                        -261.405478
memory_size                        357823.854 

=== epoch 5/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                    2
episode_length                           48.0
returns                            -11.562488
return_std                           6.778377
average_reward                      -0.451056
round_time             0 days 00:10:56.206466
episodes_test                             3.0
episode_length_test                353.333333
returns_test                       260.681405
return_std_test                     360.83127
average_reward_test                  0.754105
round_time_test        0 days 00:00:02.796344
round_time_total       0 days 00:10:56.207582
loss_total                         522.625503
loss_critic                        718.954428
loss_actor                        -262.690247
memory_size                         359516.71 

=== epoch 5/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.06it/s]
episodes                                    6
episode_length                          209.0
returns                            -95.090068
return_std                         168.233575
average_reward                      -0.468608
round_time             0 days 00:10:53.050393
episodes_test                             3.0
episode_length_test                374.666667
returns_test                       253.099557
return_std_test                    317.129784
average_reward_test                   0.71666
round_time_test        0 days 00:00:02.766474
round_time_total       0 days 00:10:53.051494
loss_total                          526.44852
loss_critic                        723.539015
loss_actor                        -261.913507
memory_size                       361350.7965 

=== epoch 5/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.06it/s]
episodes                                    2
episode_length                          517.0
returns                           -260.746271
return_std                         237.922858
average_reward                      -0.454112
round_time             0 days 00:10:53.190855
episodes_test                             3.0
episode_length_test                407.333333
returns_test                       269.350966
return_std_test                    341.725391
average_reward_test                  0.423176
round_time_test        0 days 00:00:02.784125
round_time_total       0 days 00:10:53.191971
loss_total                          532.94523
loss_critic                        731.643793
loss_actor                        -261.849078
memory_size                       363199.7115 

=== epoch 5/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                    6
episode_length                          228.5
returns                            -98.518463
return_std                         152.152448
average_reward                       -0.46815
round_time             0 days 00:10:57.600157
episodes_test                             5.0
episode_length_test                     240.2
returns_test                       150.614066
return_std_test                    303.946629
average_reward_test                  0.698985
round_time_test        0 days 00:00:02.845939
round_time_total       0 days 00:10:57.601260
loss_total                         529.722204
loss_critic                        727.897455
loss_actor                         -262.97885
memory_size                       365085.6605 

=== epoch 5/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                    5
episode_length                          230.0
returns                            -92.904055
return_std                         158.417468
average_reward                      -0.421117
round_time             0 days 00:11:02.721640
episodes_test                             2.0
episode_length_test                     516.5
returns_test                       347.573338
return_std_test                    365.380373
average_reward_test                  0.736976
round_time_test        0 days 00:00:02.793039
round_time_total       0 days 00:11:02.722738
loss_total                         533.383936
loss_critic                        732.336996
loss_actor                        -262.428356
memory_size                       366891.3615 

=== epoch 5/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:30,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    6
episode_length                      65.666667
returns                            -51.540502
return_std                          45.233826
average_reward                      -0.531737
round_time             0 days 00:11:00.463908
episodes_test                             5.0
episode_length_test                     222.0
returns_test                       109.330537
return_std_test                    232.165077
average_reward_test                  0.594738
round_time_test        0 days 00:00:02.807340
round_time_total       0 days 00:11:00.465010
loss_total                         537.308255
loss_critic                        737.294162
loss_actor                        -262.635428
memory_size                        368696.035 

=== epoch 5/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   13
episode_length                     131.461538
returns                            -76.250295
return_std                         115.849404
average_reward                      -0.557613
round_time             0 days 00:11:03.552220
episodes_test                             3.0
episode_length_test                     346.0
returns_test                       233.917549
return_std_test                    323.821055
average_reward_test                  0.693158
round_time_test        0 days 00:00:02.808825
round_time_total       0 days 00:11:03.553318
loss_total                         545.005594
loss_critic                        746.934103
loss_actor                        -262.708494
memory_size                       370329.1655 

=== epoch 5/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   14
episode_length                           47.0
returns                            -29.865423
return_std                          33.457979
average_reward                      -0.515045
round_time             0 days 00:11:04.762676
episodes_test                             4.0
episode_length_test                     278.0
returns_test                       202.643791
return_std_test                    373.976915
average_reward_test                  0.691601
round_time_test        0 days 00:00:02.834255
round_time_total       0 days 00:11:04.763776
loss_total                         541.358435
loss_critic                        742.756272
loss_actor                        -264.232971
memory_size                        372048.751 

=== epoch 5/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                    3
episode_length                     351.333333
returns                           -166.010747
return_std                         210.508069
average_reward                      -0.454888
round_time             0 days 00:11:05.405838
episodes_test                             7.0
episode_length_test                194.714286
returns_test                       105.219286
return_std_test                    255.270744
average_reward_test                   0.54954
round_time_test        0 days 00:00:02.828650
round_time_total       0 days 00:11:05.406980
loss_total                         557.475438
loss_critic                        762.872251
loss_actor                        -264.111867
memory_size                        373741.819 

=== epoch 5/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                    8
episode_length                        169.125
returns                            -61.284349
return_std                         139.402251
average_reward                      -0.373886
round_time             0 days 00:11:08.942788
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       749.079336
return_std_test                     22.766179
average_reward_test                  0.749079
round_time_test        0 days 00:00:02.793166
round_time_total       0 days 00:11:08.943917
loss_total                         551.694156
loss_critic                        755.897646
loss_actor                        -265.119854
memory_size                       375584.8465 

=== epoch 5/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                    6
episode_length                     189.333333
returns                            -83.483753
return_std                         165.193625
average_reward                      -0.421455
round_time             0 days 00:11:12.271916
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       768.318966
return_std_test                      9.451059
average_reward_test                  0.768319
round_time_test        0 days 00:00:02.796573
round_time_total       0 days 00:11:12.273033
loss_total                         564.749079
loss_critic                        772.041228
loss_actor                        -264.419569
memory_size                       377280.9435 

=== epoch 5/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:29,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                    6
episode_length                     187.833333
returns                            -78.116894
return_std                          147.85244
average_reward                       -0.43636
round_time             0 days 00:11:10.507069
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        779.99032
return_std_test                      5.177106
average_reward_test                   0.77999
round_time_test        0 days 00:00:02.851678
round_time_total       0 days 00:11:10.508168
loss_total                         551.173889
loss_critic                        755.586189
loss_actor                        -266.475357
memory_size                       379118.1505 

=== epoch 5/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   14
episode_length                          121.0
returns                            -66.144382
return_std                         104.741757
average_reward                      -0.526128
round_time             0 days 00:11:07.243577
episodes_test                             4.0
episode_length_test                     339.0
returns_test                       168.742517
return_std_test                    322.122077
average_reward_test                  0.584601
round_time_test        0 days 00:00:02.869469
round_time_total       0 days 00:11:07.244680
loss_total                         570.904296
loss_critic                        779.908948
loss_actor                        -265.114367
memory_size                        380755.615 

=== epoch 5/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   19
episode_length                      97.736842
returns                            -48.221279
return_std                         113.682154
average_reward                      -0.500646
round_time             0 days 00:11:09.611703
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       723.557716
return_std_test                     25.199621
average_reward_test                  0.723558
round_time_test        0 days 00:00:02.825653
round_time_total       0 days 00:11:09.612817
loss_total                         571.892102
loss_critic                        781.061475
loss_actor                        -264.785443
memory_size                       382391.4575 

=== epoch 5/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                   10
episode_length                          151.0
returns                             -78.22468
return_std                         143.979268
average_reward                      -0.489988
round_time             0 days 00:11:39.933904
episodes_test                             8.0
episode_length_test                   169.125
returns_test                       102.654989
return_std_test                    268.475838
average_reward_test                  0.658104
round_time_test        0 days 00:00:02.803255
round_time_total       0 days 00:11:39.935011
loss_total                         578.941823
loss_critic                        789.610059
loss_actor                        -263.731175
memory_size                       383770.1965 

=== epoch 5/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:31,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                    4
episode_length                          286.0
returns                           -137.099691
return_std                          179.68459
average_reward                      -0.473306
round_time             0 days 00:11:15.560712
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       747.541203
return_std_test                     41.507234
average_reward_test                  0.747541
round_time_test        0 days 00:00:02.803783
round_time_total       0 days 00:11:15.561810
loss_total                         577.258384
loss_critic                        787.542666
loss_actor                        -263.878803
memory_size                       385584.7105 

=== epoch 5/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:35,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                   25
episode_length                          44.52
returns                            -26.652602
return_std                          34.434928
average_reward                       -0.52057
round_time             0 days 00:11:16.065186
episodes_test                            19.0
episode_length_test                 98.421053
returns_test                         47.30452
return_std_test                    173.833851
average_reward_test                  0.457749
round_time_test        0 days 00:00:02.759727
round_time_total       0 days 00:11:16.066276
loss_total                         579.626669
loss_critic                        790.390405
loss_actor                         -263.42833
memory_size                        387220.892 

=== epoch 5/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:24,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:35<00:00,  2.88it/s]
episodes                                    5
episode_length                          232.4
returns                            -96.972749
return_std                         174.370816
average_reward                      -0.432756
round_time             0 days 00:11:35.581361
episodes_test                             4.0
episode_length_test                     275.0
returns_test                       187.228626
return_std_test                    330.954048
average_reward_test                  0.712438
round_time_test        0 days 00:00:02.870660
round_time_total       0 days 00:11:35.582469
loss_total                         594.145128
loss_critic                        808.569729
loss_actor                        -263.553333
memory_size                        388789.351 

=== epoch 5/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:33,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                   34
episode_length                      50.823529
returns                            -36.348644
return_std                          36.687495
average_reward                      -0.685512
round_time             0 days 00:11:16.774732
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       758.798483
return_std_test                     17.605844
average_reward_test                  0.758798
round_time_test        0 days 00:00:02.856237
round_time_total       0 days 00:11:16.775880
loss_total                         611.233439
loss_critic                        829.798439
loss_actor                        -263.026622
memory_size                        390087.013 

=== epoch 5/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                    5
episode_length                          230.8
returns                           -102.824405
return_std                         179.076685
average_reward                      -0.468163
round_time             0 days 00:11:19.623634
episodes_test                             5.0
episode_length_test                     249.2
returns_test                       121.356651
return_std_test                    287.743615
average_reward_test                  0.593871
round_time_test        0 days 00:00:02.830536
round_time_total       0 days 00:11:19.624742
loss_total                         634.684228
loss_critic                        858.817592
loss_actor                        -261.849289
memory_size                       391605.9085 

=== epoch 5/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:05,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:30<00:00,  2.90it/s]
episodes                                    5
episode_length                          212.8
returns                            -92.741681
return_std                         170.950074
average_reward                       -0.43741
round_time             0 days 00:11:30.698970
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       751.075156
return_std_test                      9.759521
average_reward_test                  0.751075
round_time_test        0 days 00:00:02.875083
round_time_total       0 days 00:11:30.700077
loss_total                         615.233075
loss_critic                        834.683929
loss_actor                        -262.570398
memory_size                        393454.554 

=== epoch 5/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:04,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                   14
episode_length                     115.071429
returns                            -46.130256
return_std                         105.840363
average_reward                      -0.388244
round_time             0 days 00:11:21.449282
episodes_test                             7.0
episode_length_test                166.714286
returns_test                       111.921045
return_std_test                    270.369319
average_reward_test                  0.702237
round_time_test        0 days 00:00:02.804842
round_time_total       0 days 00:11:21.450383
loss_total                         610.253166
loss_critic                        828.494318
loss_actor                        -262.711503
memory_size                        395148.928 

=== epoch 5/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:25<00:00,  2.92it/s]
episodes                                   12
episode_length                      52.416667
returns                            -38.641244
return_std                          38.740877
average_reward                      -0.516765
round_time             0 days 00:11:26.409240
episodes_test                             9.0
episode_length_test                     160.0
returns_test                        63.148606
return_std_test                    242.546401
average_reward_test                  0.437271
round_time_test        0 days 00:00:02.782443
round_time_total       0 days 00:11:26.410353
loss_total                          624.21721
loss_critic                        845.631729
loss_actor                        -261.440927
memory_size                        396739.381 

=== epoch 5/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                    1
episode_length                         1000.0
returns                           -417.577374
return_std                                0.0
average_reward                      -0.408149
round_time             0 days 00:11:23.963334
episodes_test                             2.0
episode_length_test                     505.0
returns_test                       408.199983
return_std_test                    401.415468
average_reward_test                  0.798517
round_time_test        0 days 00:00:02.848684
round_time_total       0 days 00:11:23.964442
loss_total                         612.830316
loss_critic                        831.524672
loss_actor                        -261.947168
memory_size                        398528.159 

=== epoch 5/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:47,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:32<00:00,  2.89it/s]
episodes                                    7
episode_length                     193.142857
returns                            -94.061567
return_std                         130.413912
average_reward                      -0.465937
round_time             0 days 00:11:32.553460
episodes_test                            13.0
episode_length_test                122.923077
returns_test                        68.766471
return_std_test                    209.052021
average_reward_test                  0.598009
round_time_test        0 days 00:00:02.763805
round_time_total       0 days 00:11:32.554553
loss_total                         608.180461
loss_critic                         826.07739
loss_actor                        -263.407316
memory_size                       400392.4425 

=== epoch 5/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:15,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:25<00:00,  2.92it/s]
episodes                                    9
episode_length                      40.555556
returns                            -16.648402
return_std                          18.444188
average_reward                      -0.422198
round_time             0 days 00:11:25.516519
episodes_test                            12.0
episode_length_test                137.666667
returns_test                        72.132928
return_std_test                    212.688779
average_reward_test                   0.55872
round_time_test        0 days 00:00:02.794356
round_time_total       0 days 00:11:25.517648
loss_total                         628.790842
loss_critic                        851.804897
loss_actor                        -263.265437
memory_size                       402113.7685 

=== epoch 5/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:47<00:00,  2.83it/s]
episodes                                    7
episode_length                     204.857143
returns                           -104.116482
return_std                         147.392351
average_reward                      -0.553034
round_time             0 days 00:11:47.892811
episodes_test                             4.0
episode_length_test                    307.75
returns_test                        207.85008
return_std_test                    314.348659
average_reward_test                  0.717929
round_time_test        0 days 00:00:02.835854
round_time_total       0 days 00:11:47.893917
loss_total                         634.922413
loss_critic                         859.22516
loss_actor                        -262.288638
memory_size                        403841.452 

=== epoch 5/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:16,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:37<00:00,  2.87it/s]
episodes                                    9
episode_length                     136.333333
returns                             -58.09296
return_std                         121.745203
average_reward                      -0.439345
round_time             0 days 00:11:37.904557
episodes_test                             2.0
episode_length_test                     548.0
returns_test                       379.571926
return_std_test                    387.436828
average_reward_test                  0.719809
round_time_test        0 days 00:00:02.911852
round_time_total       0 days 00:11:37.905655
loss_total                         614.617787
loss_critic                         834.35214
loss_actor                         -264.31968
memory_size                       405616.2115 

=== epoch 5/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:43,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                    4
episode_length                         286.75
returns                           -122.643228
return_std                         176.481251
average_reward                      -0.404608
round_time             0 days 00:11:37.187412
episodes_test                             4.0
episode_length_test                     277.0
returns_test                       194.075954
return_std_test                    325.534439
average_reward_test                  0.737519
round_time_test        0 days 00:00:02.933441
round_time_total       0 days 00:11:37.188514
loss_total                         638.138266
loss_critic                        863.497522
loss_actor                        -263.298824
memory_size                        407426.944 

=== epoch 5/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:46<00:00,  2.83it/s]
episodes                                    2
episode_length                          562.5
returns                           -290.568183
return_std                         137.930322
average_reward                      -0.464468
round_time             0 days 00:11:47.044437
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       397.228868
return_std_test                    284.420448
average_reward_test                  0.397229
round_time_test        0 days 00:00:02.837360
round_time_total       0 days 00:11:47.045546
loss_total                         633.476757
loss_critic                        857.994383
loss_actor                        -264.593802
memory_size                         409283.04 

=== epoch 5/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:26,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                    7
episode_length                          174.0
returns                            -93.641114
return_std                         169.195817
average_reward                       -0.53353
round_time             0 days 00:11:37.435738
episodes_test                             5.0
episode_length_test                     249.0
returns_test                       152.889296
return_std_test                    283.064753
average_reward_test                  0.659645
round_time_test        0 days 00:00:02.772551
round_time_total       0 days 00:11:37.436846
loss_total                         630.445625
loss_critic                        854.266452
loss_actor                        -264.837738
memory_size                        411112.725 

=== epoch 5/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:26,  2.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:46<00:00,  2.83it/s]
episodes                                   23
episode_length                       46.26087
returns                            -31.086119
return_std                          30.650322
average_reward                      -0.563939
round_time             0 days 00:11:47.264368
episodes_test                             4.0
episode_length_test                    267.75
returns_test                       184.574832
return_std_test                     336.11704
average_reward_test                  0.724242
round_time_test        0 days 00:00:02.771062
round_time_total       0 days 00:11:47.265500
loss_total                         657.444623
loss_critic                        887.960077
loss_actor                         -264.61725
memory_size                        412637.288 

=== epoch 5/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:53<00:00,  2.80it/s]
episodes                                   13
episode_length                     127.846154
returns                             -56.72117
return_std                         100.535456
average_reward                       -0.44551
round_time             0 days 00:11:53.799509
episodes_test                            11.0
episode_length_test                123.818182
returns_test                        69.348154
return_std_test                    243.559527
average_reward_test                  0.576921
round_time_test        0 days 00:00:02.785086
round_time_total       0 days 00:11:53.800622
loss_total                         654.735977
loss_critic                          884.6083
loss_actor                        -264.753375
memory_size                        414211.015 

=== epoch 5/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:48<00:00,  2.82it/s]
episodes                                    5
episode_length                          222.2
returns                             -92.18621
return_std                         168.178706
average_reward                      -0.428483
round_time             0 days 00:11:49.078850
episodes_test                             5.0
episode_length_test                     221.0
returns_test                        84.717688
return_std_test                    171.409363
average_reward_test                  0.535796
round_time_test        0 days 00:00:02.845947
round_time_total       0 days 00:11:49.079955
loss_total                         672.611114
loss_critic                        906.824451
loss_actor                        -264.242293
memory_size                       415808.1655 

=== epoch 5/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:49<00:00,  2.82it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    6
episode_length                          209.0
returns                            -93.691309
return_std                         139.337932
average_reward                       -0.43029
round_time             0 days 00:11:50.026577
episodes_test                             3.0
episode_length_test                348.666667
returns_test                       243.609372
return_std_test                     356.28697
average_reward_test                  0.715539
round_time_test        0 days 00:00:02.841437
round_time_total       0 days 00:11:50.027676
loss_total                         677.862047
loss_critic                        913.581648
loss_actor                         -265.01642
memory_size                        417625.712 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<11:08,  2.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                   17
episode_length                     112.764706
returns                            -64.823029
return_std                          96.643637
average_reward                      -0.559447
round_time             0 days 00:10:41.462679
episodes_test                             5.0
episode_length_test                     233.4
returns_test                       150.138648
return_std_test                    277.705901
average_reward_test                  0.307768
round_time_test        0 days 00:00:02.793309
round_time_total       0 days 00:10:41.463790
loss_total                         673.974755
loss_critic                        908.880896
loss_actor                        -265.649872
memory_size                       419377.1075 

=== epoch 6/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   11
episode_length                      48.636364
returns                             -33.13948
return_std                          32.982023
average_reward                      -0.485132
round_time             0 days 00:10:37.366632
episodes_test                            13.0
episode_length_test                125.923077
returns_test                        64.440761
return_std_test                    202.971861
average_reward_test                  0.528897
round_time_test        0 days 00:00:02.785930
round_time_total       0 days 00:10:37.367734
loss_total                         684.755173
loss_critic                        922.286273
loss_actor                         -265.36929
memory_size                       420886.6905 

=== epoch 6/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    3
episode_length                     349.666667
returns                           -154.468881
return_std                         214.719298
average_reward                       -0.44788
round_time             0 days 00:10:37.400030
episodes_test                             2.0
episode_length_test                     514.0
returns_test                       360.798236
return_std_test                     355.42998
average_reward_test                  0.751224
round_time_test        0 days 00:00:02.867359
round_time_total       0 days 00:10:37.401118
loss_total                         687.233266
loss_critic                        925.515518
loss_actor                        -265.895808
memory_size                        422620.757 

=== epoch 6/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:37,  3.13it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                    5
episode_length                          231.2
returns                           -115.072272
return_std                         179.185039
average_reward                      -0.481482
round_time             0 days 00:10:43.061845
episodes_test                             2.0
episode_length_test                     518.5
returns_test                       155.125968
return_std_test                    134.542333
average_reward_test                  0.511107
round_time_test        0 days 00:00:02.810367
round_time_total       0 days 00:10:43.062944
loss_total                         694.320464
loss_critic                        934.346316
loss_actor                        -265.783009
memory_size                        424468.193 

=== epoch 6/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   10
episode_length                           34.8
returns                            -23.080193
return_std                          28.397351
average_reward                      -0.488991
round_time             0 days 00:10:35.493488
episodes_test                             8.0
episode_length_test                     165.5
returns_test                       100.728753
return_std_test                    239.672082
average_reward_test                  0.671071
round_time_test        0 days 00:00:02.792458
round_time_total       0 days 00:10:35.494600
loss_total                         696.953135
loss_critic                        937.415257
loss_actor                         -264.89542
memory_size                        426236.401 

=== epoch 6/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                    3
episode_length                          356.0
returns                            -171.36406
return_std                         214.848809
average_reward                      -0.449374
round_time             0 days 00:10:38.919282
episodes_test                             5.0
episode_length_test                     235.2
returns_test                       158.814082
return_std_test                     305.50236
average_reward_test                  0.695288
round_time_test        0 days 00:00:02.793541
round_time_total       0 days 00:10:38.920375
loss_total                         677.968675
loss_critic                        914.127076
loss_actor                        -266.664992
memory_size                       428034.0995 

=== epoch 6/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                    3
episode_length                     377.333333
returns                           -185.134349
return_std                         204.395978
average_reward                      -0.456533
round_time             0 days 00:10:38.337767
episodes_test                            11.0
episode_length_test                     136.0
returns_test                        73.827336
return_std_test                    215.016084
average_reward_test                  0.607045
round_time_test        0 days 00:00:02.784181
round_time_total       0 days 00:10:38.338884
loss_total                         670.204296
loss_critic                        904.589696
loss_actor                        -267.337372
memory_size                       429915.4975 

=== epoch 6/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:52,  3.06it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   11
episode_length                     138.272727
returns                            -71.125215
return_std                         138.565659
average_reward                      -0.477144
round_time             0 days 00:10:36.218700
episodes_test                             3.0
episode_length_test                     378.0
returns_test                       275.127634
return_std_test                    370.285036
average_reward_test                  0.748544
round_time_test        0 days 00:00:02.803521
round_time_total       0 days 00:10:36.219806
loss_total                         690.078065
loss_critic                        929.590789
loss_actor                        -267.972893
memory_size                        431667.125 

=== epoch 6/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                    6
episode_length                          209.5
returns                            -79.328706
return_std                         152.596628
average_reward                      -0.406229
round_time             0 days 00:10:40.800469
episodes_test                             5.0
episode_length_test                     237.2
returns_test                       167.875518
return_std_test                    288.120926
average_reward_test                  0.707948
round_time_test        0 days 00:00:02.837479
round_time_total       0 days 00:10:40.801569
loss_total                          680.29506
loss_critic                        917.383386
loss_actor                        -268.058304
memory_size                        433438.295 

=== epoch 6/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    3
episode_length                     371.333333
returns                           -138.702661
return_std                         182.991178
average_reward                      -0.473119
round_time             0 days 00:10:37.043526
episodes_test                             2.0
episode_length_test                     515.5
returns_test                       403.553504
return_std_test                    406.968813
average_reward_test                  0.789879
round_time_test        0 days 00:00:02.785419
round_time_total       0 days 00:10:37.044637
loss_total                         697.536251
loss_critic                        938.893861
loss_actor                        -267.894256
memory_size                       435242.7055 

=== epoch 6/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                    4
episode_length                          300.5
returns                           -164.779696
return_std                          159.03275
average_reward                      -0.519939
round_time             0 days 00:10:37.096983
episodes_test                             5.0
episode_length_test                     227.0
returns_test                        164.72919
return_std_test                    319.849837
average_reward_test                  0.546421
round_time_test        0 days 00:00:02.800033
round_time_total       0 days 00:10:37.098085
loss_total                         703.934924
loss_critic                        946.896213
loss_actor                        -267.910294
memory_size                       437126.2625 

=== epoch 6/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.06it/s]
episodes                                    2
episode_length                          516.5
returns                           -251.145975
return_std                         214.789502
average_reward                      -0.409617
round_time             0 days 00:10:53.041930
episodes_test                             6.0
episode_length_test                     193.0
returns_test                       131.885183
return_std_test                    281.336141
average_reward_test                  0.635154
round_time_test        0 days 00:00:02.803517
round_time_total       0 days 00:10:53.043033
loss_total                         695.232073
loss_critic                        936.344571
loss_actor                         -269.21798
memory_size                       438992.5965 

=== epoch 6/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:21,  2.93it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                    5
episode_length                          225.0
returns                            -98.275108
return_std                         150.682894
average_reward                      -0.428178
round_time             0 days 00:10:43.148229
episodes_test                             3.0
episode_length_test                     350.0
returns_test                       248.546657
return_std_test                    350.477296
average_reward_test                  0.719664
round_time_test        0 days 00:00:02.870364
round_time_total       0 days 00:10:43.149331
loss_total                         694.505769
loss_critic                        935.380012
loss_actor                        -268.991272
memory_size                       440844.9845 

=== epoch 6/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:44<00:00,  3.10it/s]
episodes                                    3
episode_length                     401.333333
returns                           -154.779613
return_std                         171.233388
average_reward                      -0.433385
round_time             0 days 00:10:45.262422
episodes_test                             6.0
episode_length_test                     182.5
returns_test                       140.661966
return_std_test                    302.570535
average_reward_test                  0.758481
round_time_test        0 days 00:00:02.803527
round_time_total       0 days 00:10:45.263546
loss_total                          694.23266
loss_critic                        935.174799
loss_actor                        -269.535965
memory_size                       442737.4405 

=== epoch 6/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:38,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                    5
episode_length                          116.4
returns                            -53.768957
return_std                          49.365602
average_reward                       -0.46399
round_time             0 days 00:10:41.610402
episodes_test                             4.0
episode_length_test                     288.0
returns_test                       196.478007
return_std_test                    321.611339
average_reward_test                  0.714326
round_time_test        0 days 00:00:02.798767
round_time_total       0 days 00:10:41.611502
loss_total                         707.013274
loss_critic                        951.041502
loss_actor                        -269.099706
memory_size                       444550.4585 

=== epoch 6/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    6
episode_length                          211.5
returns                            -79.914203
return_std                         129.254222
average_reward                      -0.380291
round_time             0 days 00:10:43.665393
episodes_test                             8.0
episode_length_test                   155.625
returns_test                        90.236982
return_std_test                     249.73111
average_reward_test                  0.656314
round_time_test        0 days 00:00:02.812053
round_time_total       0 days 00:10:43.666501
loss_total                          708.30632
loss_critic                        952.670346
loss_actor                        -269.149851
memory_size                        446313.463 

=== epoch 6/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.05it/s]
episodes                                   17
episode_length                      42.647059
returns                            -31.301609
return_std                          38.090999
average_reward                      -0.557454
round_time             0 days 00:10:57.216171
episodes_test                             2.0
episode_length_test                     519.0
returns_test                       338.500162
return_std_test                    347.642195
average_reward_test                   0.72084
round_time_test        0 days 00:00:02.820091
round_time_total       0 days 00:10:57.217283
loss_total                         695.935888
loss_critic                        937.231869
loss_actor                        -269.248094
memory_size                       447954.5655 

=== epoch 6/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   12
episode_length                         131.25
returns                            -72.735674
return_std                         113.836985
average_reward                      -0.525907
round_time             0 days 00:11:06.399244
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       758.991933
return_std_test                     39.328388
average_reward_test                  0.758992
round_time_test        0 days 00:00:02.814274
round_time_total       0 days 00:11:06.400346
loss_total                         718.596909
loss_critic                        965.500013
loss_actor                         -269.01557
memory_size                       449566.6245 

=== epoch 6/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                    5
episode_length                          219.8
returns                           -102.967413
return_std                         162.480122
average_reward                      -0.453032
round_time             0 days 00:11:07.750346
episodes_test                             6.0
episode_length_test                195.333333
returns_test                       125.416297
return_std_test                    286.426928
average_reward_test                  0.700286
round_time_test        0 days 00:00:02.795664
round_time_total       0 days 00:11:07.751444
loss_total                         723.385856
loss_critic                        971.223743
loss_actor                         -267.96576
memory_size                        451315.364 

=== epoch 6/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   12
episode_length                     125.166667
returns                             -62.81558
return_std                         111.268644
average_reward                      -0.503502
round_time             0 days 00:11:04.366510
episodes_test                             3.0
episode_length_test                350.666667
returns_test                       261.699272
return_std_test                    374.509569
average_reward_test                   0.77811
round_time_test        0 days 00:00:02.821510
round_time_total       0 days 00:11:04.367624
loss_total                         728.211062
loss_critic                        977.753545
loss_actor                        -269.958938
memory_size                        452912.919 

=== epoch 6/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                    1
episode_length                         1000.0
returns                           -442.494026
return_std                                0.0
average_reward                      -0.396934
round_time             0 days 00:11:08.216025
episodes_test                             8.0
episode_length_test                   165.125
returns_test                       101.359794
return_std_test                    255.396554
average_reward_test                  0.676407
round_time_test        0 days 00:00:02.807249
round_time_total       0 days 00:11:08.217142
loss_total                         737.110692
loss_critic                        988.867325
loss_actor                        -269.915909
memory_size                        454774.081 

=== epoch 6/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                    4
episode_length                          319.5
returns                           -150.737027
return_std                         159.688845
average_reward                      -0.482879
round_time             0 days 00:11:11.973960
episodes_test                             2.0
episode_length_test                     584.5
returns_test                        366.20747
return_std_test                    393.903305
average_reward_test                  0.621094
round_time_test        0 days 00:00:02.855816
round_time_total       0 days 00:11:11.975077
loss_total                         720.517528
loss_critic                        968.168279
loss_actor                        -270.085542
memory_size                        456691.792 

=== epoch 6/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:13,  2.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                    7
episode_length                      47.142857
returns                             -11.68805
return_std                          12.471185
average_reward                      -0.380146
round_time             0 days 00:11:14.430653
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       747.097826
return_std_test                      0.465749
average_reward_test                  0.747098
round_time_test        0 days 00:00:02.865522
round_time_total       0 days 00:11:14.431775
loss_total                          725.43077
loss_critic                        974.388064
loss_actor                        -270.398472
memory_size                       458455.7075 

=== epoch 6/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:22<00:00,  2.93it/s]
episodes                                   14
episode_length                     113.571429
returns                            -56.009673
return_std                          103.44433
average_reward                      -0.463736
round_time             0 days 00:11:23.308696
episodes_test                             2.0
episode_length_test                     527.0
returns_test                       386.540004
return_std_test                    399.130663
average_reward_test                  0.764385
round_time_test        0 days 00:00:02.799800
round_time_total       0 days 00:11:23.309791
loss_total                         746.536988
loss_critic                       1000.562771
loss_actor                        -269.566212
memory_size                        460062.783 

=== epoch 6/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   12
episode_length                      37.666667
returns                            -26.571749
return_std                          28.338969
average_reward                      -0.516541
round_time             0 days 00:11:09.445421
episodes_test                            15.0
episode_length_test                107.266667
returns_test                        27.432814
return_std_test                     83.954941
average_reward_test                  0.349541
round_time_test        0 days 00:00:02.779476
round_time_total       0 days 00:11:09.446525
loss_total                         731.878822
loss_critic                        982.518626
loss_actor                        -270.680462
memory_size                       461726.2005 

=== epoch 6/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:26<00:00,  2.91it/s]
episodes                                   11
episode_length                     140.181818
returns                            -79.496939
return_std                         158.527256
average_reward                      -0.525512
round_time             0 days 00:11:27.112632
episodes_test                             5.0
episode_length_test                     221.6
returns_test                       154.197935
return_std_test                    327.834658
average_reward_test                  0.722556
round_time_test        0 days 00:00:02.824790
round_time_total       0 days 00:11:27.113736
loss_total                         746.565899
loss_critic                       1000.426778
loss_actor                        -268.877689
memory_size                       463401.5955 

=== epoch 6/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                    3
episode_length                     401.666667
returns                           -177.848498
return_std                         177.269584
average_reward                      -0.441079
round_time             0 days 00:11:20.521945
episodes_test                             3.0
episode_length_test                365.666667
returns_test                       267.045002
return_std_test                    360.491125
average_reward_test                  0.696394
round_time_test        0 days 00:00:02.805393
round_time_total       0 days 00:11:20.523037
loss_total                           762.4243
loss_critic                       1020.277928
loss_actor                        -268.990277
memory_size                       465208.7545 

=== epoch 6/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                    1
episode_length                         1000.0
returns                           -486.116706
return_std                                0.0
average_reward                      -0.475969
round_time             0 days 00:11:10.510123
episodes_test                             2.0
episode_length_test                     535.5
returns_test                       401.919255
return_std_test                    403.132234
average_reward_test                   0.78206
round_time_test        0 days 00:00:02.865558
round_time_total       0 days 00:11:10.511225
loss_total                         768.586463
loss_critic                       1028.021483
loss_actor                        -269.153689
memory_size                        467084.579 

=== epoch 6/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:41,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:56<00:00,  2.79it/s]
episodes                                    2
episode_length                          507.0
returns                           -262.144714
return_std                         260.549785
average_reward                       -0.47722
round_time             0 days 00:11:56.675037
episodes_test                             4.0
episode_length_test                     297.5
returns_test                       -32.528141
return_std_test                     76.908727
average_reward_test                  0.197795
round_time_test        0 days 00:00:02.835395
round_time_total       0 days 00:11:56.676135
loss_total                         787.451855
loss_critic                       1051.672989
loss_actor                        -269.432746
memory_size                       468975.1745 

=== epoch 6/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:06,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:37<00:00,  2.87it/s]
episodes                                    5
episode_length                          234.8
returns                            -95.027582
return_std                         141.613045
average_reward                      -0.410006
round_time             0 days 00:11:38.331934
episodes_test                             6.0
episode_length_test                206.333333
returns_test                        136.62584
return_std_test                    304.154974
average_reward_test                  0.685693
round_time_test        0 days 00:00:02.825235
round_time_total       0 days 00:11:38.333040
loss_total                         775.058903
loss_critic                       1036.243345
loss_actor                        -269.678934
memory_size                       470805.3625 

=== epoch 6/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:20,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                    4
episode_length                          292.0
returns                           -130.176867
return_std                         162.599219
average_reward                      -0.408657
round_time             0 days 00:11:29.582106
episodes_test                             3.0
episode_length_test                     385.0
returns_test                       258.322258
return_std_test                    373.162237
average_reward_test                  0.700499
round_time_test        0 days 00:00:02.831055
round_time_total       0 days 00:11:29.583212
loss_total                         754.423383
loss_critic                       1010.566653
loss_actor                        -270.149768
memory_size                        472684.056 

=== epoch 6/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                    2
episode_length                          511.0
returns                            -234.39048
return_std                         233.293576
average_reward                      -0.422371
round_time             0 days 00:11:17.233385
episodes_test                             2.0
episode_length_test                     530.5
returns_test                       412.671961
return_std_test                    384.206168
average_reward_test                  0.765782
round_time_test        0 days 00:00:02.822038
round_time_total       0 days 00:11:17.234479
loss_total                         764.051355
loss_critic                       1022.810739
loss_actor                        -270.986251
memory_size                       474565.4505 

=== epoch 6/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:28,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.89it/s]
episodes                                   11
episode_length                     138.818182
returns                            -65.978454
return_std                         110.992202
average_reward                      -0.471329
round_time             0 days 00:11:33.593881
episodes_test                            13.0
episode_length_test                109.538462
returns_test                        49.674722
return_std_test                    187.628798
average_reward_test                   0.55101
round_time_test        0 days 00:00:02.798935
round_time_total       0 days 00:11:33.594977
loss_total                         762.697586
loss_critic                       1021.300422
loss_actor                        -271.713827
memory_size                       476394.1055 

=== epoch 6/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    1
episode_length                           41.0
returns                            -39.753135
return_std                                0.0
average_reward                      -0.431455
round_time             0 days 00:11:28.667474
episodes_test                             5.0
episode_length_test                     223.0
returns_test                        24.936191
return_std_test                     37.989242
average_reward_test                  0.416773
round_time_test        0 days 00:00:02.819965
round_time_total       0 days 00:11:28.668574
loss_total                         773.250186
loss_critic                       1034.329994
loss_actor                         -271.06911
memory_size                       478149.2005 

=== epoch 6/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:57,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:26<00:00,  2.91it/s]
episodes                                   11
episode_length                     139.181818
returns                            -74.588891
return_std                         111.842544
average_reward                       -0.50757
round_time             0 days 00:11:27.002643
episodes_test                             2.0
episode_length_test                     516.0
returns_test                       396.964525
return_std_test                    411.912179
average_reward_test                  0.779518
round_time_test        0 days 00:00:02.822316
round_time_total       0 days 00:11:27.003740
loss_total                         767.569664
loss_critic                       1027.191775
loss_actor                        -270.918846
memory_size                        479941.547 

=== epoch 6/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:34<00:00,  2.88it/s]
episodes                                   15
episode_length                      92.066667
returns                            -41.141679
return_std                         102.872423
average_reward                       -0.44985
round_time             0 days 00:11:34.851675
episodes_test                             7.0
episode_length_test                     172.0
returns_test                       111.074524
return_std_test                    255.470994
average_reward_test                  0.697238
round_time_test        0 days 00:00:02.821959
round_time_total       0 days 00:11:34.852781
loss_total                         763.283324
loss_critic                       1022.093833
loss_actor                        -271.958777
memory_size                       481595.8315 

=== epoch 6/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:34,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:50<00:00,  2.81it/s]
episodes                                   11
episode_length                      50.909091
returns                            -48.365315
return_std                          55.134336
average_reward                      -0.582662
round_time             0 days 00:11:51.032412
episodes_test                             7.0
episode_length_test                174.285714
returns_test                        98.282874
return_std_test                      270.0205
average_reward_test                  0.436401
round_time_test        0 days 00:00:02.812351
round_time_total       0 days 00:11:51.033511
loss_total                         771.745077
loss_critic                       1032.648372
loss_actor                        -271.868173
memory_size                       483268.3295 

=== epoch 6/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:38,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:44<00:00,  2.84it/s]
episodes                                   11
episode_length                     133.454545
returns                            -66.830959
return_std                         111.147048
average_reward                      -0.495062
round_time             0 days 00:11:44.589353
episodes_test                             7.0
episode_length_test                186.428571
returns_test                       103.266427
return_std_test                    269.984362
average_reward_test                  0.636193
round_time_test        0 days 00:00:02.811729
round_time_total       0 days 00:11:44.590464
loss_total                         786.903354
loss_critic                       1051.449651
loss_actor                        -271.281907
memory_size                        484944.219 

=== epoch 6/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:58,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                    5
episode_length                          232.0
returns                           -103.130558
return_std                         169.219731
average_reward                      -0.419067
round_time             0 days 00:11:41.849148
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       794.258559
return_std_test                      4.513964
average_reward_test                  0.794259
round_time_test        0 days 00:00:02.847077
round_time_total       0 days 00:11:41.850261
loss_total                          785.01052
loss_critic                        1048.95015
loss_actor                        -270.748075
memory_size                       486586.4065 

=== epoch 6/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:50<00:00,  2.82it/s]
episodes                                    9
episode_length                     150.444444
returns                            -80.541616
return_std                         135.775903
average_reward                      -0.489582
round_time             0 days 00:11:50.687089
episodes_test                             4.0
episode_length_test                     265.0
returns_test                       189.291369
return_std_test                     350.32736
average_reward_test                  0.709048
round_time_test        0 days 00:00:02.806885
round_time_total       0 days 00:11:50.688193
loss_total                         788.545685
loss_critic                       1053.636084
loss_actor                        -271.815983
memory_size                       488363.8225 

=== epoch 6/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:37<00:00,  2.87it/s]
episodes                                   12
episode_length                     116.583333
returns                             -57.16828
return_std                         115.039326
average_reward                      -0.486242
round_time             0 days 00:11:38.440530
episodes_test                             8.0
episode_length_test                   153.875
returns_test                         88.53381
return_std_test                    267.030384
average_reward_test                  0.652244
round_time_test        0 days 00:00:02.840905
round_time_total       0 days 00:11:38.441639
loss_total                         802.783853
loss_critic                       1071.189139
loss_actor                        -270.837367
memory_size                        490146.235 

=== epoch 6/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:42<00:00,  2.85it/s]
episodes                                    1
episode_length                         1000.0
returns                           -397.706655
return_std                                0.0
average_reward                      -0.443116
round_time             0 days 00:11:43.233560
episodes_test                             3.0
episode_length_test                341.666667
returns_test                       251.595364
return_std_test                    370.000273
average_reward_test                  0.765735
round_time_test        0 days 00:00:02.822904
round_time_total       0 days 00:11:43.234665
loss_total                         799.344997
loss_critic                       1066.963164
loss_actor                        -271.127744
memory_size                        491863.597 

=== epoch 6/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                    7
episode_length                     181.571429
returns                            -86.147412
return_std                         143.433056
average_reward                      -0.439562
round_time             0 days 00:11:42.174772
episodes_test                             5.0
episode_length_test                     232.0
returns_test                       151.662116
return_std_test                    280.146157
average_reward_test                  0.692701
round_time_test        0 days 00:00:02.849524
round_time_total       0 days 00:11:42.175870
loss_total                         798.650884
loss_critic                       1065.967397
loss_actor                        -270.615236
memory_size                        493732.077 

=== epoch 6/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:49,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:52<00:00,  2.81it/s]
episodes                                   16
episode_length                        54.0625
returns                            -33.100698
return_std                          30.151668
average_reward                      -0.534707
round_time             0 days 00:11:52.881002
episodes_test                             3.0
episode_length_test                353.666667
returns_test                       258.066678
return_std_test                    389.517014
average_reward_test                  0.757261
round_time_test        0 days 00:00:02.828928
round_time_total       0 days 00:11:52.882109
loss_total                         811.166519
loss_critic                       1081.775115
loss_actor                        -271.267948
memory_size                       495317.7315 

=== epoch 6/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:54<00:00,  2.80it/s]
episodes                                   20
episode_length                          84.15
returns                            -46.450218
return_std                         102.514132
average_reward                       -0.53933
round_time             0 days 00:11:54.505808
episodes_test                            12.0
episode_length_test                    126.25
returns_test                        45.914307
return_std_test                    200.769484
average_reward_test                  0.469855
round_time_test        0 days 00:00:02.817825
round_time_total       0 days 00:11:54.506910
loss_total                         806.072711
loss_critic                       1075.309174
loss_actor                        -270.873217
memory_size                       496914.9795 

=== epoch 6/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:31,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:49<00:00,  2.82it/s]
episodes                                    6
episode_length                           43.0
returns                            -24.464474
return_std                          16.231369
average_reward                      -0.443704
round_time             0 days 00:11:50.284540
episodes_test                             2.0
episode_length_test                     520.5
returns_test                       397.486278
return_std_test                    406.222298
average_reward_test                  0.756212
round_time_test        0 days 00:00:02.812618
round_time_total       0 days 00:11:50.285649
loss_total                         806.490425
loss_critic                       1075.871902
loss_actor                        -271.035559
memory_size                       498446.5655 

=== epoch 6/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:32,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:49<00:00,  2.82it/s]
episodes                                   18
episode_length                      94.555556
returns                            -44.699599
return_std                          91.835706
average_reward                      -0.476124
round_time             0 days 00:11:49.873035
episodes_test                             8.0
episode_length_test                   184.875
returns_test                        70.789241
return_std_test                     227.54013
average_reward_test                  0.465751
round_time_test        0 days 00:00:02.790728
round_time_total       0 days 00:11:49.874176
loss_total                         826.686002
loss_critic                       1100.897903
loss_actor                        -270.161675
memory_size                       500047.9125 

=== epoch 6/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:05,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:57<00:00,  2.79it/s]
episodes                                    3
episode_length                     385.333333
returns                           -174.702284
return_std                          160.54546
average_reward                      -0.444915
round_time             0 days 00:11:58.068586
episodes_test                             7.0
episode_length_test                160.714286
returns_test                       112.277339
return_std_test                    295.665571
average_reward_test                  0.741804
round_time_test        0 days 00:00:02.828669
round_time_total       0 days 00:11:58.069701
loss_total                         831.764312
loss_critic                       1107.098541
loss_actor                        -269.572682
memory_size                       501779.8305 

=== epoch 6/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:55<00:00,  2.79it/s]
episodes                                   13
episode_length                      35.307692
returns                            -27.257632
return_std                          18.480269
average_reward                      -0.500862
round_time             0 days 00:11:56.175844
episodes_test                             4.0
episode_length_test                     281.5
returns_test                       175.180541
return_std_test                     286.38649
average_reward_test                  0.692726
round_time_test        0 days 00:00:02.846053
round_time_total       0 days 00:11:56.176957
loss_total                         817.228935
loss_critic                       1089.011352
loss_actor                        -269.900813
memory_size                        503523.329 

=== epoch 6/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:29,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    2
episode_length                          516.5
returns                           -236.624462
return_std                         222.142597
average_reward                      -0.465041
round_time             0 days 00:12:08.682442
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       720.171191
return_std_test                     42.797905
average_reward_test                  0.720171
round_time_test        0 days 00:00:02.804216
round_time_total       0 days 00:12:08.683549
loss_total                         814.678374
loss_critic                       1086.126372
loss_actor                        -271.113694
memory_size                        505243.152 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<11:31,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    7
episode_length                     189.285714
returns                            -80.145213
return_std                         119.725961
average_reward                      -0.425939
round_time             0 days 00:10:43.293960
episodes_test                             5.0
episode_length_test                     244.6
returns_test                       150.049584
return_std_test                    325.601373
average_reward_test                  0.700169
round_time_test        0 days 00:00:02.842370
round_time_total       0 days 00:10:43.295097
loss_total                         833.327252
loss_critic                       1109.684955
loss_actor                        -272.103632
memory_size                        507016.934 

=== epoch 7/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                    3
episode_length                          355.0
returns                            -145.85811
return_std                         205.802329
average_reward                      -0.426633
round_time             0 days 00:10:42.360805
episodes_test                             4.0
episode_length_test                    288.75
returns_test                       178.735827
return_std_test                    383.558278
average_reward_test                  0.683903
round_time_test        0 days 00:00:02.814441
round_time_total       0 days 00:10:42.361937
loss_total                         815.522725
loss_critic                       1087.545651
loss_actor                        -272.569059
memory_size                        508887.156 

=== epoch 7/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:01,  3.02it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                    4
episode_length                          268.0
returns                           -140.847251
return_std                         195.618339
average_reward                      -0.478443
round_time             0 days 00:10:48.812653
episodes_test                             5.0
episode_length_test                     224.8
returns_test                       158.633449
return_std_test                    316.841439
average_reward_test                  0.717323
round_time_test        0 days 00:00:02.824335
round_time_total       0 days 00:10:48.813762
loss_total                          813.30447
loss_critic                       1084.789424
loss_actor                        -272.635421
memory_size                        510749.754 

=== epoch 7/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:57,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:43<00:00,  1.20it/s]
episodes                                    8
episode_length                        168.625
returns                            -80.710151
return_std                         116.878515
average_reward                      -0.440637
round_time             0 days 00:27:44.103925
episodes_test                             3.0
episode_length_test                350.666667
returns_test                       237.455553
return_std_test                    370.599554
average_reward_test                  0.748102
round_time_test        0 days 00:00:02.841977
round_time_total       0 days 00:27:44.105068
loss_total                         792.091964
loss_critic                       1058.796331
loss_actor                        -274.725573
memory_size                       512604.9405 

=== epoch 7/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:13,  2.96it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                    3
episode_length                          387.0
returns                           -145.463861
return_std                         188.334661
average_reward                      -0.401821
round_time             0 days 00:11:06.876918
episodes_test                             8.0
episode_length_test                     154.0
returns_test                        94.451423
return_std_test                    256.466597
average_reward_test                  0.670106
round_time_test        0 days 00:00:02.881803
round_time_total       0 days 00:11:06.878342
loss_total                         811.374332
loss_critic                       1082.722327
loss_actor                        -274.017727
memory_size                        514356.855 

=== epoch 7/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                    1
episode_length                         1000.0
returns                           -428.532451
return_std                                0.0
average_reward                      -0.437141
round_time             0 days 00:11:21.330922
episodes_test                             8.0
episode_length_test                   180.375
returns_test                        94.963639
return_std_test                    274.652712
average_reward_test                  0.579999
round_time_test        0 days 00:00:03.352903
round_time_total       0 days 00:11:21.332028
loss_total                         823.355089
loss_critic                       1097.851879
loss_actor                        -274.632144
memory_size                        516245.109 

=== epoch 7/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:32<00:00,  2.89it/s]
episodes                                    6
episode_length                          189.5
returns                            -72.404403
return_std                         152.819743
average_reward                      -0.404432
round_time             0 days 00:11:32.515052
episodes_test                             7.0
episode_length_test                175.714286
returns_test                        95.381143
return_std_test                    248.996968
average_reward_test                  0.626893
round_time_test        0 days 00:00:02.843690
round_time_total       0 days 00:11:32.516173
loss_total                         798.454121
loss_critic                       1066.945501
loss_actor                        -275.511476
memory_size                       518042.9445 

=== epoch 7/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.89it/s]
episodes                                   15
episode_length                     111.866667
returns                            -61.148655
return_std                         109.339096
average_reward                      -0.535558
round_time             0 days 00:11:33.641530
episodes_test                             2.0
episode_length_test                     512.0
returns_test                       407.238882
return_std_test                    411.031382
average_reward_test                   0.79689
round_time_test        0 days 00:00:02.850261
round_time_total       0 days 00:11:33.642646
loss_total                         795.378061
loss_critic                       1063.159745
loss_actor                        -275.748747
memory_size                        519853.791 

=== epoch 7/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                   13
episode_length                     122.923077
returns                            -76.860931
return_std                         131.718688
average_reward                      -0.589019
round_time             0 days 00:11:21.102528
episodes_test                             2.0
episode_length_test                     540.0
returns_test                       380.401565
return_std_test                    366.002287
average_reward_test                  0.619267
round_time_test        0 days 00:00:02.823853
round_time_total       0 days 00:11:21.103635
loss_total                         824.552321
loss_critic                       1099.607881
loss_actor                        -275.670003
memory_size                        521385.849 

=== epoch 7/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                   11
episode_length                     124.181818
returns                             -58.52459
return_std                         119.962251
average_reward                      -0.465535
round_time             0 days 00:10:43.386359
episodes_test                            11.0
episode_length_test                120.272727
returns_test                        69.007948
return_std_test                      225.7397
average_reward_test                  0.658665
round_time_test        0 days 00:00:02.805888
round_time_total       0 days 00:10:43.387460
loss_total                         822.724169
loss_critic                       1097.257989
loss_actor                        -275.411182
memory_size                       523135.5975 

=== epoch 7/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    8
episode_length                         156.25
returns                            -77.119713
return_std                         154.576042
average_reward                      -0.458469
round_time             0 days 00:10:44.448900
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       794.429919
return_std_test                     12.341186
average_reward_test                   0.79443
round_time_test        0 days 00:00:02.861665
round_time_total       0 days 00:10:44.450001
loss_total                         827.935631
loss_critic                       1103.652423
loss_actor                        -274.931612
memory_size                       524712.7905 

=== epoch 7/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.09it/s]
episodes                                    5
episode_length                          221.6
returns                           -106.049658
return_std                         157.415894
average_reward                        -0.4497
round_time             0 days 00:10:48.741377
episodes_test                             4.0
episode_length_test                     278.5
returns_test                       195.379066
return_std_test                    339.844443
average_reward_test                  0.716738
round_time_test        0 days 00:00:02.820302
round_time_total       0 days 00:10:48.742481
loss_total                          843.58204
loss_critic                       1123.220389
loss_actor                        -274.971432
memory_size                        526508.995 

=== epoch 7/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:41,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:46<00:00,  2.83it/s]
episodes                                    5
episode_length                          257.2
returns                           -111.145881
return_std                         145.535553
average_reward                       -0.43403
round_time             0 days 00:11:46.531295
episodes_test                             7.0
episode_length_test                170.571429
returns_test                       105.138125
return_std_test                    273.136052
average_reward_test                  0.663143
round_time_test        0 days 00:00:02.812497
round_time_total       0 days 00:11:46.532401
loss_total                         831.022304
loss_critic                        1107.64369
loss_actor                         -275.46331
memory_size                       528332.1705 

=== epoch 7/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                    6
episode_length                     197.666667
returns                             -77.73436
return_std                         143.859911
average_reward                      -0.419205
round_time             0 days 00:11:07.195948
episodes_test                             4.0
episode_length_test                     275.0
returns_test                       194.895087
return_std_test                    344.701248
average_reward_test                  0.745111
round_time_test        0 days 00:00:02.812206
round_time_total       0 days 00:11:07.197042
loss_total                         836.609547
loss_critic                       1114.571503
loss_actor                        -275.238357
memory_size                       530221.3885 

=== epoch 7/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                    4
episode_length                         275.75
returns                           -140.061535
return_std                         204.621397
average_reward                      -0.479822
round_time             0 days 00:11:12.417268
episodes_test                             5.0
episode_length_test                     230.6
returns_test                       168.210068
return_std_test                    333.205385
average_reward_test                  0.670456
round_time_test        0 days 00:00:02.833100
round_time_total       0 days 00:11:12.418375
loss_total                         844.889366
loss_critic                        1124.85023
loss_actor                        -274.954157
memory_size                       532027.7835 

=== epoch 7/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:19,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    5
episode_length                          237.0
returns                           -130.218382
return_std                         169.049048
average_reward                      -0.510914
round_time             0 days 00:11:01.492167
episodes_test                             4.0
episode_length_test                     294.0
returns_test                       171.295422
return_std_test                    375.974761
average_reward_test                  0.656922
round_time_test        0 days 00:00:02.847103
round_time_total       0 days 00:11:01.493340
loss_total                         843.573004
loss_critic                       1123.699705
loss_actor                        -276.933886
memory_size                        533798.413 

=== epoch 7/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:30<00:00,  2.15it/s]
episodes                                    7
episode_length                     169.285714
returns                            -79.895834
return_std                         140.255128
average_reward                       -0.50122
round_time             0 days 00:15:31.095512
episodes_test                            14.0
episode_length_test                104.142857
returns_test                        49.911398
return_std_test                    212.343318
average_reward_test                  0.565125
round_time_test        0 days 00:00:02.804265
round_time_total       0 days 00:15:31.097565
loss_total                         842.133397
loss_critic                       1121.681823
loss_actor                        -276.060386
memory_size                        535638.509 

=== epoch 7/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:19,  1.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:03<00:00,  1.51it/s]
episodes                                   10
episode_length                          152.4
returns                            -93.937536
return_std                         120.669465
average_reward                      -0.571535
round_time             0 days 00:22:04.943139
episodes_test                             6.0
episode_length_test                218.666667
returns_test                        84.523681
return_std_test                    232.490154
average_reward_test                  0.521536
round_time_test        0 days 00:00:03.830492
round_time_total       0 days 00:22:04.945083
loss_total                         842.231019
loss_critic                       1121.808927
loss_actor                        -276.080687
memory_size                       537390.1065 

=== epoch 7/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<17:23,  1.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:05<00:00,  1.44it/s]
episodes                                    4
episode_length                         301.75
returns                           -121.166567
return_std                         178.698649
average_reward                      -0.424136
round_time             0 days 00:23:06.225581
episodes_test                             4.0
episode_length_test                     295.0
returns_test                       181.910295
return_std_test                    320.533388
average_reward_test                  0.702859
round_time_test        0 days 00:00:03.472216
round_time_total       0 days 00:23:06.227299
loss_total                         854.213077
loss_critic                       1136.860594
loss_actor                        -276.377064
memory_size                        539057.414 

=== epoch 7/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<31:35,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:32<00:00,  1.36it/s]
episodes                                   12
episode_length                     144.416667
returns                            -56.091787
return_std                         100.718719
average_reward                      -0.401151
round_time             0 days 00:24:34.173719
episodes_test                            12.0
episode_length_test                137.333333
returns_test                        60.982338
return_std_test                    220.665427
average_reward_test                  0.507715
round_time_test        0 days 00:00:04.250652
round_time_total       0 days 00:24:34.175719
loss_total                         851.597325
loss_critic                       1133.595755
loss_actor                        -276.396464
memory_size                        540874.562 

=== epoch 7/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:18,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:57<00:00,  1.39it/s]
episodes                                    7
episode_length                     167.571429
returns                            -76.344499
return_std                         136.413538
average_reward                      -0.434043
round_time             0 days 00:23:59.276578
episodes_test                             2.0
episode_length_test                     508.0
returns_test                       385.395708
return_std_test                    399.712955
average_reward_test                  0.738505
round_time_test        0 days 00:00:04.537867
round_time_total       0 days 00:23:59.278249
loss_total                         871.732255
loss_critic                       1158.793151
loss_actor                        -276.511404
memory_size                       542524.2185 

=== epoch 7/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<27:34,  1.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:15<00:00,  1.37it/s]
episodes                                   15
episode_length                     100.066667
returns                            -55.812496
return_std                         106.062565
average_reward                      -0.519132
round_time             0 days 00:24:16.240015
episodes_test                             3.0
episode_length_test                351.333333
returns_test                       277.281822
return_std_test                    384.422916
average_reward_test                  0.764127
round_time_test        0 days 00:00:04.890992
round_time_total       0 days 00:24:16.241541
loss_total                         869.551734
loss_critic                       1156.032787
loss_actor                        -276.372557
memory_size                        544158.937 

=== epoch 7/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<32:26,  1.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:47<00:00,  1.40it/s]
episodes                                   11
episode_length                      36.363636
returns                            -20.232078
return_std                          26.258019
average_reward                      -0.466743
round_time             0 days 00:23:48.492486
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       760.931358
return_std_test                     14.928671
average_reward_test                  0.760931
round_time_test        0 days 00:00:03.956186
round_time_total       0 days 00:23:48.493980
loss_total                         870.620249
loss_critic                       1157.446325
loss_actor                         -276.68414
memory_size                        545818.398 

=== epoch 7/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:51<00:00,  1.34it/s]
episodes                                   14
episode_length                      92.642857
returns                            -40.438054
return_std                          99.242475
average_reward                      -0.453501
round_time             0 days 00:24:52.046412
episodes_test                             2.0
episode_length_test                     518.5
returns_test                        399.59502
return_std_test                    394.814288
average_reward_test                  0.765802
round_time_test        0 days 00:00:04.928694
round_time_total       0 days 00:24:52.047922
loss_total                         859.519108
loss_critic                       1143.625377
loss_actor                        -276.906054
memory_size                       547525.7395 

=== epoch 7/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:15,  1.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:20<00:00,  1.27it/s]
episodes                                    5
episode_length                          230.4
returns                           -119.956444
return_std                         145.661094
average_reward                      -0.478575
round_time             0 days 00:26:21.296854
episodes_test                             5.0
episode_length_test                     237.0
returns_test                       152.498621
return_std_test                    317.471665
average_reward_test                  0.701462
round_time_test        0 days 00:00:04.174998
round_time_total       0 days 00:26:21.298845
loss_total                         857.330583
loss_critic                       1140.896999
loss_actor                        -276.935157
memory_size                       549250.7495 

=== epoch 7/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:22,  1.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:46<00:00,  1.25it/s]
episodes                                    5
episode_length                          231.6
returns                             -94.00223
return_std                         150.047762
average_reward                      -0.421721
round_time             0 days 00:26:47.120411
episodes_test                             7.0
episode_length_test                177.142857
returns_test                       106.477019
return_std_test                    280.012272
average_reward_test                   0.63282
round_time_test        0 days 00:00:03.761443
round_time_total       0 days 00:26:47.123032
loss_total                         870.650769
loss_critic                       1157.734457
loss_actor                        -277.684064
memory_size                        551067.152 

=== epoch 7/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<35:19,  1.06s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:46<00:00,  1.24it/s]
episodes                                    7
episode_length                     173.285714
returns                            -77.098822
return_std                         148.624883
average_reward                      -0.427944
round_time             0 days 00:26:49.146570
episodes_test                             5.0
episode_length_test                     239.6
returns_test                       160.958768
return_std_test                    346.043489
average_reward_test                  0.702026
round_time_test        0 days 00:00:04.841858
round_time_total       0 days 00:26:49.150044
loss_total                         861.245499
loss_critic                       1146.188401
loss_actor                         -278.52619
memory_size                        552923.595 

=== epoch 7/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<30:23,  1.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:57<00:00,  1.28it/s]
episodes                                    4
episode_length                          272.0
returns                           -119.218394
return_std                         193.976296
average_reward                      -0.430345
round_time             0 days 00:25:59.745405
episodes_test                             5.0
episode_length_test                     233.6
returns_test                       140.078769
return_std_test                    326.264027
average_reward_test                  0.697534
round_time_test        0 days 00:00:04.152415
round_time_total       0 days 00:25:59.748040
loss_total                         854.896364
loss_critic                       1138.566345
loss_actor                        -279.783641
memory_size                         554738.09 

=== epoch 7/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:38,  1.09it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:06<00:00,  1.28it/s]
episodes                                    5
episode_length                          224.0
returns                            -84.655125
return_std                         166.493928
average_reward                      -0.399831
round_time             0 days 00:26:08.989594
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        824.80074
return_std_test                      4.840585
average_reward_test                  0.824801
round_time_test        0 days 00:00:05.214068
round_time_total       0 days 00:26:08.992510
loss_total                         840.661652
loss_critic                       1120.971441
loss_actor                        -280.577581
memory_size                       556539.2595 

=== epoch 7/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:56,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:22<00:00,  1.37it/s]
episodes                                    4
episode_length                         276.75
returns                           -118.647055
return_std                         159.658065
average_reward                      -0.430849
round_time             0 days 00:24:24.857179
episodes_test                             8.0
episode_length_test                   158.375
returns_test                        89.231619
return_std_test                    261.244782
average_reward_test                  0.612113
round_time_test        0 days 00:00:04.595068
round_time_total       0 days 00:24:24.859247
loss_total                         851.644861
loss_critic                       1135.044033
loss_actor                        -281.951909
memory_size                       558372.3805 

=== epoch 7/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:44,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:42<00:00,  1.47it/s]
episodes                                   18
episode_length                      91.388889
returns                            -52.949666
return_std                         106.319323
average_reward                      -0.538248
round_time             0 days 00:22:43.109292
episodes_test                             8.0
episode_length_test                    160.75
returns_test                         92.94538
return_std_test                    271.613209
average_reward_test                  0.657335
round_time_test        0 days 00:00:03.353215
round_time_total       0 days 00:22:43.110789
loss_total                         855.854208
loss_critic                       1140.204038
loss_actor                        -281.545186
memory_size                        560126.106 

=== epoch 7/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:02,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:32<00:00,  1.42it/s]
episodes                                    2
episode_length                          524.0
returns                           -238.629124
return_std                         214.408836
average_reward                      -0.431976
round_time             0 days 00:23:32.913642
episodes_test                             5.0
episode_length_test                     229.0
returns_test                       156.987666
return_std_test                    325.828173
average_reward_test                  0.717302
round_time_test        0 days 00:00:04.832946
round_time_total       0 days 00:23:32.915600
loss_total                         865.950888
loss_critic                       1152.705743
loss_actor                        -281.068616
memory_size                        561787.743 

=== epoch 7/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:20,  1.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:36<00:00,  1.47it/s]
episodes                                    6
episode_length                      60.666667
returns                            -55.660269
return_std                          49.308735
average_reward                      -0.512503
round_time             0 days 00:22:37.883468
episodes_test                             6.0
episode_length_test                206.166667
returns_test                       118.885036
return_std_test                    299.521055
average_reward_test                  0.662326
round_time_test        0 days 00:00:04.297198
round_time_total       0 days 00:22:37.885363
loss_total                         848.135644
loss_critic                       1130.896851
loss_actor                        -282.909265
memory_size                       563631.2485 

=== epoch 7/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:37,  1.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:42<00:00,  1.47it/s]
episodes                                   20
episode_length                          84.55
returns                            -43.855682
return_std                          83.889424
average_reward                       -0.50957
round_time             0 days 00:22:43.604970
episodes_test                             2.0
episode_length_test                     514.0
returns_test                       383.038646
return_std_test                    395.550899
average_reward_test                  0.763477
round_time_test        0 days 00:00:03.933101
round_time_total       0 days 00:22:43.606666
loss_total                         857.016015
loss_critic                       1142.013394
loss_actor                        -282.973573
memory_size                       565169.4065 

=== epoch 7/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:41,  1.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:53<00:00,  1.52it/s]
episodes                                    4
episode_length                         275.75
returns                           -109.944952
return_std                         164.679653
average_reward                      -0.368545
round_time             0 days 00:21:54.782401
episodes_test                             8.0
episode_length_test                     154.5
returns_test                        86.620694
return_std_test                    243.274671
average_reward_test                  0.617321
round_time_test        0 days 00:00:03.586744
round_time_total       0 days 00:21:54.784249
loss_total                         852.502682
loss_critic                       1136.210348
loss_actor                        -282.328063
memory_size                        566828.828 

=== epoch 7/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:49,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:54<00:00,  1.46it/s]
episodes                                    4
episode_length                          286.0
returns                           -132.114235
return_std                         173.194786
average_reward                      -0.443623
round_time             0 days 00:22:55.236616
episodes_test                             4.0
episode_length_test                    298.75
returns_test                       164.073757
return_std_test                    344.859393
average_reward_test                  0.635037
round_time_test        0 days 00:00:03.703198
round_time_total       0 days 00:22:55.238621
loss_total                         833.703288
loss_critic                       1112.934327
loss_actor                        -283.220942
memory_size                         568663.28 

=== epoch 7/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:23,  1.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:38<00:00,  1.47it/s]
episodes                                   18
episode_length                      85.555556
returns                             -42.64696
return_std                          96.982269
average_reward                      -0.490745
round_time             0 days 00:22:39.789729
episodes_test                             2.0
episode_length_test                     528.0
returns_test                         356.6988
return_std_test                    400.062765
average_reward_test                  0.727865
round_time_test        0 days 00:00:03.673558
round_time_total       0 days 00:22:39.793052
loss_total                         845.928484
loss_critic                       1128.157624
loss_actor                        -282.988153
memory_size                       570354.1235 

=== epoch 7/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<36:42,  1.10s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:47<00:00,  1.40it/s]
episodes                                    6
episode_length                     191.333333
returns                            -79.759622
return_std                         158.125333
average_reward                      -0.436176
round_time             0 days 00:23:49.696248
episodes_test                             2.0
episode_length_test                     542.0
returns_test                       347.623813
return_std_test                    439.884864
average_reward_test                  0.690953
round_time_test        0 days 00:00:03.659678
round_time_total       0 days 00:23:49.698153
loss_total                         838.191191
loss_critic                       1118.608867
loss_actor                        -283.479588
memory_size                       572089.7605 

=== epoch 7/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:10,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:13<00:00,  1.44it/s]
episodes                                   16
episode_length                       114.0625
returns                            -63.402395
return_std                         103.499447
average_reward                       -0.53081
round_time             0 days 00:23:14.171002
episodes_test                            13.0
episode_length_test                110.076923
returns_test                        53.056466
return_std_test                    201.642426
average_reward_test                  0.533029
round_time_test        0 days 00:00:03.330336
round_time_total       0 days 00:23:14.172464
loss_total                         856.661143
loss_critic                       1141.796881
loss_actor                        -283.881892
memory_size                       573710.0685 

=== epoch 7/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:12,  1.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:52<00:00,  1.46it/s]
episodes                                    4
episode_length                          263.5
returns                           -105.939044
return_std                         160.494549
average_reward                      -0.413083
round_time             0 days 00:22:53.974341
episodes_test                             3.0
episode_length_test                342.666667
returns_test                       271.428481
return_std_test                    398.076179
average_reward_test                  0.779815
round_time_test        0 days 00:00:04.467279
round_time_total       0 days 00:22:53.975838
loss_total                         876.110532
loss_critic                       1166.279376
loss_actor                        -284.564924
memory_size                        575468.317 

=== epoch 7/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:28,  1.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:08<00:00,  1.38it/s]
episodes                                    2
episode_length                          529.0
returns                           -214.092833
return_std                         179.901672
average_reward                      -0.421448
round_time             0 days 00:24:09.513478
episodes_test                            12.0
episode_length_test                    120.25
returns_test                        61.083686
return_std_test                     195.79368
average_reward_test                  0.583426
round_time_test        0 days 00:00:04.333735
round_time_total       0 days 00:24:09.515294
loss_total                         878.636476
loss_critic                       1169.237435
loss_actor                        -283.767442
memory_size                       577337.2845 

=== epoch 7/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:47,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:58<00:00,  1.33it/s]
episodes                                    5
episode_length                          246.0
returns                           -104.603297
return_std                         160.626318
average_reward                      -0.430083
round_time             0 days 00:24:59.765433
episodes_test                             2.0
episode_length_test                     528.5
returns_test                       355.711445
return_std_test                    368.925178
average_reward_test                  0.753988
round_time_test        0 days 00:00:03.743930
round_time_total       0 days 00:24:59.767307
loss_total                         884.787129
loss_critic                       1176.973947
loss_actor                        -283.960215
memory_size                       579225.2435 

=== epoch 7/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:30,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:45<00:00,  1.29it/s]
episodes                                    9
episode_length                     146.444444
returns                             -63.79322
return_std                         117.944437
average_reward                      -0.417175
round_time             0 days 00:25:46.054457
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       746.793638
return_std_test                     20.464993
average_reward_test                  0.746794
round_time_test        0 days 00:00:04.623935
round_time_total       0 days 00:25:46.057783
loss_total                         877.804017
loss_critic                       1168.262038
loss_actor                        -284.028143
memory_size                        580961.969 

=== epoch 7/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<34:45,  1.04s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:13<00:00,  1.27it/s]
episodes                                    5
episode_length                          255.8
returns                           -123.785507
return_std                         174.712362
average_reward                      -0.468166
round_time             0 days 00:26:16.280601
episodes_test                             3.0
episode_length_test                354.666667
returns_test                       227.452001
return_std_test                    336.197757
average_reward_test                  0.698221
round_time_test        0 days 00:00:04.565940
round_time_total       0 days 00:26:16.282420
loss_total                         857.054037
loss_critic                       1142.522748
loss_actor                        -284.820888
memory_size                       582796.7565 

=== epoch 7/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:33,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:30<00:00,  1.26it/s]
episodes                                    7
episode_length                      34.857143
returns                            -28.943887
return_std                          18.425369
average_reward                      -0.449648
round_time             0 days 00:26:31.937434
episodes_test                             3.0
episode_length_test                369.666667
returns_test                       250.630541
return_std_test                    356.149362
average_reward_test                  0.747198
round_time_test        0 days 00:00:04.690807
round_time_total       0 days 00:26:31.939219
loss_total                          868.74018
loss_critic                       1157.070958
loss_actor                         -284.58301
memory_size                       584561.8355 

=== epoch 7/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:08,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:33<00:00,  1.36it/s]
episodes                                   13
episode_length                     113.153846
returns                            -70.492062
return_std                         144.607224
average_reward                      -0.561012
round_time             0 days 00:24:34.595168
episodes_test                             8.0
episode_length_test                     181.0
returns_test                        93.152877
return_std_test                    265.415113
average_reward_test                   0.58098
round_time_test        0 days 00:00:04.067453
round_time_total       0 days 00:24:34.596858
loss_total                         848.742799
loss_critic                       1132.225974
loss_actor                        -285.189972
memory_size                       586179.7865 

=== epoch 7/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:45,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:13<00:00,  1.44it/s]
episodes                                    9
episode_length                     147.444444
returns                             -73.25271
return_std                         130.681247
average_reward                      -0.460728
round_time             0 days 00:23:14.180337
episodes_test                             5.0
episode_length_test                     216.6
returns_test                       155.133853
return_std_test                      312.4173
average_reward_test                  0.762243
round_time_test        0 days 00:00:04.015520
round_time_total       0 days 00:23:14.182141
loss_total                         873.505229
loss_critic                       1163.019522
loss_actor                        -284.552022
memory_size                        587950.369 

=== epoch 7/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:35,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:37<00:00,  1.54it/s]
episodes                                    8
episode_length                           28.0
returns                            -16.058459
return_std                          18.890127
average_reward                      -0.471492
round_time             0 days 00:21:37.969165
episodes_test                             2.0
episode_length_test                     543.5
returns_test                       368.853548
return_std_test                    365.254534
average_reward_test                  0.732486
round_time_test        0 days 00:00:03.918163
round_time_total       0 days 00:21:37.971051
loss_total                         874.780856
loss_critic                       1164.607607
loss_actor                        -284.526219
memory_size                        589687.974 

=== epoch 7/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:08,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:59<00:00,  1.45it/s]
episodes                                   11
episode_length                     128.545455
returns                            -68.429226
return_std                         140.829395
average_reward                      -0.489267
round_time             0 days 00:23:00.086899
episodes_test                             5.0
episode_length_test                     260.4
returns_test                       131.242975
return_std_test                    295.979604
average_reward_test                  0.601969
round_time_test        0 days 00:00:03.446387
round_time_total       0 days 00:23:00.088664
loss_total                         858.394439
loss_critic                       1144.423428
loss_actor                        -285.721595
memory_size                        591333.678 

=== epoch 7/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:04<00:00,  1.95it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    6
episode_length                     199.666667
returns                            -95.678906
return_std                          147.33085
average_reward                      -0.480508
round_time             0 days 00:17:05.614519
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       784.654681
return_std_test                     31.354302
average_reward_test                  0.784655
round_time_test        0 days 00:00:03.407761
round_time_total       0 days 00:17:05.615840
loss_total                         887.017854
loss_critic                       1180.010296
loss_actor                        -284.951999
memory_size                        593161.682 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:02<17:08,  1.94it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:18<00:00,  2.51it/s]
episodes                                    4
episode_length                         278.25
returns                            -122.38749
return_std                         176.769582
average_reward                      -0.426019
round_time             0 days 00:13:18.059763
episodes_test                             7.0
episode_length_test                177.428571
returns_test                       110.052396
return_std_test                     292.51926
average_reward_test                  0.668564
round_time_test        0 days 00:00:03.181842
round_time_total       0 days 00:13:18.061038
loss_total                          881.20833
loss_critic                       1172.901185
loss_actor                        -285.563175
memory_size                        594921.891 

=== epoch 8/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:48,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                    5
episode_length                          216.4
returns                           -110.149894
return_std                         175.066979
average_reward                      -0.481703
round_time             0 days 00:12:28.155447
episodes_test                             9.0
episode_length_test                139.555556
returns_test                        93.988628
return_std_test                    259.663431
average_reward_test                  0.705108
round_time_test        0 days 00:00:03.096202
round_time_total       0 days 00:12:28.156728
loss_total                         896.322826
loss_critic                       1191.617259
loss_actor                        -284.854986
memory_size                       596811.3455 

=== epoch 8/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:42,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:40<00:00,  2.85it/s]
episodes                                    7
episode_length                     166.714286
returns                            -83.249067
return_std                         156.322825
average_reward                      -0.478619
round_time             0 days 00:11:41.513231
episodes_test                             3.0
episode_length_test                575.666667
returns_test                       209.358166
return_std_test                    450.291216
average_reward_test                  0.425717
round_time_test        0 days 00:00:02.814408
round_time_total       0 days 00:11:41.514424
loss_total                         895.501218
loss_critic                       1190.662784
loss_actor                        -285.145127
memory_size                       598625.1565 

=== epoch 8/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                   11
episode_length                     128.454545
returns                            -64.032757
return_std                         122.069816
average_reward                      -0.473016
round_time             0 days 00:11:18.381463
episodes_test                             4.0
episode_length_test                     272.5
returns_test                       209.707205
return_std_test                    343.693744
average_reward_test                  0.793918
round_time_test        0 days 00:00:02.818139
round_time_total       0 days 00:11:18.382575
loss_total                         884.189634
loss_critic                       1176.774676
loss_actor                        -286.150622
memory_size                       600364.6315 

=== epoch 8/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:48,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:47<00:00,  3.09it/s]
episodes                                    3
episode_length                      85.333333
returns                            -42.158237
return_std                            38.9048
average_reward                      -0.434796
round_time             0 days 00:10:47.605700
episodes_test                             4.0
episode_length_test                    268.75
returns_test                       204.918569
return_std_test                    346.872916
average_reward_test                  0.744927
round_time_test        0 days 00:00:02.827428
round_time_total       0 days 00:10:47.606805
loss_total                         891.844009
loss_critic                       1186.321271
loss_actor                        -286.065116
memory_size                        602145.546 

=== epoch 8/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:58,  3.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                    4
episode_length                         278.25
returns                           -141.928919
return_std                         155.369703
average_reward                       -0.47496
round_time             0 days 00:10:41.736204
episodes_test                             7.0
episode_length_test                178.571429
returns_test                        98.871191
return_std_test                    276.056184
average_reward_test                  0.597065
round_time_test        0 days 00:00:02.831905
round_time_total       0 days 00:10:41.737309
loss_total                         889.567492
loss_critic                       1183.847116
loss_actor                        -287.551081
memory_size                       603978.8165 

=== epoch 8/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                    5
episode_length                          237.0
returns                            -82.794853
return_std                         147.633418
average_reward                      -0.343537
round_time             0 days 00:10:43.702613
episodes_test                            15.0
episode_length_test                114.533333
returns_test                        41.611302
return_std_test                    208.160609
average_reward_test                  0.421796
round_time_test        0 days 00:00:02.786445
round_time_total       0 days 00:10:43.703720
loss_total                         865.938426
loss_critic                       1154.273111
loss_actor                        -287.400387
memory_size                       605838.6075 

=== epoch 8/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:34,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:49<00:00,  3.08it/s]
episodes                                    3
episode_length                          385.0
returns                           -149.282611
return_std                         190.261024
average_reward                      -0.399807
round_time             0 days 00:10:49.977574
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       773.766518
return_std_test                     36.224989
average_reward_test                  0.773767
round_time_test        0 days 00:00:02.831960
round_time_total       0 days 00:10:49.978675
loss_total                         886.228327
loss_critic                       1179.339914
loss_actor                        -286.218092
memory_size                       607666.2045 

=== epoch 8/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:50<00:00,  3.07it/s]
episodes                                    1
episode_length                         1000.0
returns                           -404.058637
return_std                                0.0
average_reward                      -0.438809
round_time             0 days 00:10:51.377388
episodes_test                             3.0
episode_length_test                353.666667
returns_test                        284.78364
return_std_test                    388.056275
average_reward_test                  0.793106
round_time_test        0 days 00:00:02.761443
round_time_total       0 days 00:10:51.378505
loss_total                         890.577239
loss_critic                       1185.152331
loss_actor                        -287.723214
memory_size                        609586.517 

=== epoch 8/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:01,  3.02it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:45<00:00,  3.10it/s]
episodes                                    9
episode_length                     137.333333
returns                            -65.581731
return_std                         130.626769
average_reward                      -0.466748
round_time             0 days 00:10:46.338464
episodes_test                             3.0
episode_length_test                365.666667
returns_test                       273.538908
return_std_test                    386.260444
average_reward_test                  0.778294
round_time_test        0 days 00:00:02.829713
round_time_total       0 days 00:10:46.339565
loss_total                         871.507557
loss_critic                        1161.40273
loss_actor                        -288.073214
memory_size                        611432.584 

=== epoch 8/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:46<00:00,  3.09it/s]
episodes                                    2
episode_length                          523.0
returns                             -206.6811
return_std                         176.169521
average_reward                      -0.419255
round_time             0 days 00:10:46.778811
episodes_test                            10.0
episode_length_test                     147.2
returns_test                        72.859112
return_std_test                    259.332889
average_reward_test                  0.559973
round_time_test        0 days 00:00:02.772595
round_time_total       0 days 00:10:46.779920
loss_total                          873.68671
loss_critic                       1164.139002
loss_actor                        -288.122532
memory_size                       613249.2425 

=== epoch 8/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.07it/s]
episodes                                    9
episode_length                          135.0
returns                             -59.72844
return_std                         123.242872
average_reward                      -0.436086
round_time             0 days 00:10:52.755852
episodes_test                             2.0
episode_length_test                     506.5
returns_test                       427.032037
return_std_test                    435.892848
average_reward_test                  0.844684
round_time_test        0 days 00:00:02.808817
round_time_total       0 days 00:10:52.756977
loss_total                         863.576507
loss_critic                       1151.812164
loss_actor                         -289.36619
memory_size                        615149.715 

=== epoch 8/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                    2
episode_length                           57.5
returns                            -46.543059
return_std                          28.687273
average_reward                      -0.435036
round_time             0 days 00:10:54.991755
episodes_test                            12.0
episode_length_test                122.166667
returns_test                        64.853405
return_std_test                    235.747683
average_reward_test                  0.607246
round_time_test        0 days 00:00:02.793579
round_time_total       0 days 00:10:54.992852
loss_total                         873.215868
loss_critic                       1163.703986
loss_actor                        -288.736684
memory_size                        616900.117 

=== epoch 8/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.04it/s]
episodes                                    7
episode_length                     171.571429
returns                            -85.975489
return_std                          167.93017
average_reward                       -0.44002
round_time             0 days 00:10:57.331129
episodes_test                             3.0
episode_length_test                348.333333
returns_test                       270.082083
return_std_test                    384.657503
average_reward_test                  0.797285
round_time_test        0 days 00:00:02.827192
round_time_total       0 days 00:10:57.332229
loss_total                         876.919362
loss_critic                         1168.4948
loss_actor                        -289.382472
memory_size                        618750.096 

=== epoch 8/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                    8
episode_length                         158.25
returns                            -81.623539
return_std                         125.371877
average_reward                      -0.503491
round_time             0 days 00:10:54.594981
episodes_test                             8.0
episode_length_test                   164.375
returns_test                        84.958657
return_std_test                    270.592995
average_reward_test                  0.561371
round_time_test        0 days 00:00:02.775592
round_time_total       0 days 00:10:54.596079
loss_total                         876.580979
loss_critic                       1168.479669
loss_actor                        -291.013861
memory_size                       620478.3345 

=== epoch 8/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:35,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                    2
episode_length                          505.5
returns                           -213.992627
return_std                         208.546247
average_reward                      -0.419594
round_time             0 days 00:10:58.104389
episodes_test                             3.0
episode_length_test                     357.0
returns_test                       271.097226
return_std_test                    379.524595
average_reward_test                    0.7502
round_time_test        0 days 00:00:02.824911
round_time_total       0 days 00:10:58.105486
loss_total                         880.003424
loss_critic                       1173.000523
loss_actor                        -291.985045
memory_size                        622364.014 

=== epoch 8/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                    4
episode_length                          270.0
returns                           -126.048685
return_std                         174.989307
average_reward                      -0.433677
round_time             0 days 00:10:58.807799
episodes_test                             5.0
episode_length_test                     239.6
returns_test                         144.8978
return_std_test                    296.258794
average_reward_test                  0.697626
round_time_test        0 days 00:00:02.799232
round_time_total       0 days 00:10:58.808891
loss_total                         871.792819
loss_critic                        1163.12095
loss_actor                        -293.519783
memory_size                       624255.3785 

=== epoch 8/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.96it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                    8
episode_length                          147.5
returns                            -71.419269
return_std                         121.696447
average_reward                      -0.452046
round_time             0 days 00:11:11.233477
episodes_test                             6.0
episode_length_test                211.833333
returns_test                       107.494059
return_std_test                    294.219411
average_reward_test                  0.621235
round_time_test        0 days 00:00:02.790486
round_time_total       0 days 00:11:11.234580
loss_total                         857.530216
loss_critic                       1145.088357
loss_actor                        -292.702432
memory_size                        626131.081 

=== epoch 8/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                    7
episode_length                          164.0
returns                            -83.085239
return_std                          157.55273
average_reward                      -0.484985
round_time             0 days 00:11:13.690209
episodes_test                             5.0
episode_length_test                     245.6
returns_test                       150.938795
return_std_test                     301.36261
average_reward_test                   0.68507
round_time_test        0 days 00:00:02.824746
round_time_total       0 days 00:11:13.691298
loss_total                         861.791413
loss_critic                       1150.414045
loss_actor                        -292.699192
memory_size                         627876.57 

=== epoch 8/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                    6
episode_length                          190.0
returns                            -86.447711
return_std                         164.734975
average_reward                      -0.457317
round_time             0 days 00:11:11.986225
episodes_test                             9.0
episode_length_test                148.222222
returns_test                        70.075034
return_std_test                    243.384288
average_reward_test                  0.604236
round_time_test        0 days 00:00:02.795954
round_time_total       0 days 00:11:11.987338
loss_total                         850.625502
loss_critic                       1136.287623
loss_actor                        -292.023062
memory_size                       629608.3295 

=== epoch 8/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
episodes                                    6
episode_length                     211.666667
returns                            -97.890293
return_std                         153.792384
average_reward                      -0.449787
round_time             0 days 00:11:17.400331
episodes_test                             6.0
episode_length_test                195.333333
returns_test                       128.329707
return_std_test                     322.97222
average_reward_test                  0.737991
round_time_test        0 days 00:00:02.771840
round_time_total       0 days 00:11:17.401434
loss_total                         865.311687
loss_critic                       1154.690581
loss_actor                        -292.203967
memory_size                       631414.5045 

=== epoch 8/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                    3
episode_length                          360.0
returns                           -160.203375
return_std                         198.418153
average_reward                      -0.439198
round_time             0 days 00:11:11.577069
episodes_test                             5.0
episode_length_test                     213.8
returns_test                         155.5138
return_std_test                    311.655946
average_reward_test                  0.762511
round_time_test        0 days 00:00:02.817269
round_time_total       0 days 00:11:11.578180
loss_total                         861.649452
loss_critic                       1150.660847
loss_actor                        -294.396207
memory_size                        633315.169 

=== epoch 8/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                    2
episode_length                          534.0
returns                           -189.822419
return_std                         172.050345
average_reward                      -0.369995
round_time             0 days 00:11:16.440606
episodes_test                             3.0
episode_length_test                345.333333
returns_test                       266.258284
return_std_test                    380.762102
average_reward_test                  0.806784
round_time_test        0 days 00:00:02.833466
round_time_total       0 days 00:11:16.441719
loss_total                         861.387591
loss_critic                       1150.595873
loss_actor                        -295.445614
memory_size                        635193.718 

=== epoch 8/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                   13
episode_length                      41.461538
returns                            -28.347139
return_std                          24.716237
average_reward                      -0.514667
round_time             0 days 00:11:28.014362
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       859.636539
return_std_test                     19.694159
average_reward_test                  0.859637
round_time_test        0 days 00:00:02.829416
round_time_total       0 days 00:11:28.015467
loss_total                         853.054518
loss_critic                       1140.352554
loss_actor                        -296.137698
memory_size                        636924.248 

=== epoch 8/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
episodes                                    2
episode_length                          534.5
returns                           -212.513927
return_std                         175.727927
average_reward                      -0.395404
round_time             0 days 00:11:17.492435
episodes_test                             3.0
episode_length_test                342.333333
returns_test                       290.062091
return_std_test                    417.819197
average_reward_test                   0.83874
round_time_test        0 days 00:00:02.758260
round_time_total       0 days 00:11:17.493539
loss_total                         850.305635
loss_critic                       1136.964943
loss_actor                        -296.331668
memory_size                        638683.905 

=== epoch 8/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:04,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                    3
episode_length                     365.666667
returns                           -154.328059
return_std                         192.734748
average_reward                      -0.426182
round_time             0 days 00:11:28.066851
episodes_test                             4.0
episode_length_test                     283.5
returns_test                       184.837932
return_std_test                    335.708346
average_reward_test                  0.694219
round_time_test        0 days 00:00:02.804202
round_time_total       0 days 00:11:28.067951
loss_total                         847.894746
loss_critic                       1133.869718
loss_actor                        -296.005222
memory_size                       640558.1885 

=== epoch 8/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                    9
episode_length                     155.333333
returns                            -72.462104
return_std                         113.551733
average_reward                      -0.445372
round_time             0 days 00:11:18.268452
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       809.154472
return_std_test                     31.374495
average_reward_test                  0.809154
round_time_test        0 days 00:00:02.803745
round_time_total       0 days 00:11:18.269567
loss_total                         855.075031
loss_critic                       1142.818536
loss_actor                        -295.899068
memory_size                        642388.106 

=== epoch 8/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:21,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.88it/s]
episodes                                    4
episode_length                          274.5
returns                           -132.163406
return_std                         189.085028
average_reward                      -0.442685
round_time             0 days 00:11:34.291670
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       770.226069
return_std_test                     45.157994
average_reward_test                  0.770226
round_time_test        0 days 00:00:02.849259
round_time_total       0 days 00:11:34.292776
loss_total                         875.733053
loss_critic                       1168.594657
loss_actor                        -295.713439
memory_size                        644201.149 

=== epoch 8/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:34,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                    4
episode_length                          276.5
returns                           -110.861901
return_std                          140.23637
average_reward                      -0.418562
round_time             0 days 00:11:20.562883
episodes_test                             2.0
episode_length_test                     517.5
returns_test                       363.389842
return_std_test                    391.311211
average_reward_test                  0.763792
round_time_test        0 days 00:00:02.831946
round_time_total       0 days 00:11:20.563986
loss_total                         848.275289
loss_critic                       1134.663636
loss_actor                        -297.278179
memory_size                        646088.447 

=== epoch 8/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:30,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:37<00:00,  2.87it/s]
episodes                                    5
episode_length                          224.6
returns                           -100.176671
return_std                         172.225891
average_reward                      -0.424331
round_time             0 days 00:11:38.371484
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       813.181842
return_std_test                      9.814658
average_reward_test                  0.813182
round_time_test        0 days 00:00:02.835609
round_time_total       0 days 00:11:38.372586
loss_total                          842.10114
loss_critic                       1126.938171
loss_actor                        -297.247053
memory_size                        647908.073 

=== epoch 8/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:49,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:31<00:00,  2.89it/s]
episodes                                    2
episode_length                           78.5
returns                            -68.691742
return_std                          30.367114
average_reward                      -0.434619
round_time             0 days 00:11:32.095355
episodes_test                             5.0
episode_length_test                     235.6
returns_test                       179.908606
return_std_test                    325.002281
average_reward_test                  0.752856
round_time_test        0 days 00:00:02.826560
round_time_total       0 days 00:11:32.096473
loss_total                          847.14678
loss_critic                       1133.294211
loss_actor                        -297.443023
memory_size                        649755.667 

=== epoch 8/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:39,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:56<00:00,  2.79it/s]
episodes                                    8
episode_length                          145.0
returns                            -59.567983
return_std                         118.779915
average_reward                      -0.422115
round_time             0 days 00:11:56.800902
episodes_test                             2.0
episode_length_test                     543.5
returns_test                       435.823525
return_std_test                    441.750695
average_reward_test                   0.82119
round_time_test        0 days 00:00:02.837847
round_time_total       0 days 00:11:56.802026
loss_total                         841.267983
loss_critic                        1125.97982
loss_actor                        -297.579447
memory_size                        651557.688 

=== epoch 8/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:24,  2.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:43<00:00,  2.84it/s]
episodes                                    7
episode_length                     169.571429
returns                            -66.515054
return_std                         118.195979
average_reward                      -0.435548
round_time             0 days 00:11:44.507352
episodes_test                             3.0
episode_length_test                348.333333
returns_test                       257.094099
return_std_test                    371.170487
average_reward_test                  0.787539
round_time_test        0 days 00:00:02.821915
round_time_total       0 days 00:11:44.508463
loss_total                         851.001834
loss_critic                       1138.055075
loss_actor                        -297.211208
memory_size                        653395.412 

=== epoch 8/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:21,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:54<00:00,  2.80it/s]
episodes                                   11
episode_length                     131.454545
returns                            -77.065559
return_std                          121.19921
average_reward                      -0.555528
round_time             0 days 00:11:55.222753
episodes_test                             2.0
episode_length_test                     512.5
returns_test                       404.152579
return_std_test                    404.980254
average_reward_test                  0.789052
round_time_test        0 days 00:00:02.827070
round_time_total       0 days 00:11:55.223850
loss_total                         834.678802
loss_critic                       1117.970237
loss_actor                        -298.487017
memory_size                       655122.7365 

=== epoch 8/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:09,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:58<00:00,  2.78it/s]
episodes                                    3
episode_length                     346.333333
returns                           -143.209791
return_std                         175.510365
average_reward                      -0.406443
round_time             0 days 00:11:59.236110
episodes_test                             3.0
episode_length_test                348.333333
returns_test                       265.481808
return_std_test                    408.180337
average_reward_test                  0.782014
round_time_test        0 days 00:00:02.811955
round_time_total       0 days 00:11:59.237225
loss_total                          838.09314
loss_critic                       1122.334565
loss_actor                        -298.872638
memory_size                       656876.2385 

=== epoch 8/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:05,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                    4
episode_length                           16.0
returns                              -5.66747
return_std                          10.482948
average_reward                      -0.439008
round_time             0 days 00:11:39.768900
episodes_test                             3.0
episode_length_test                     351.0
returns_test                       291.158406
return_std_test                    401.527062
average_reward_test                  0.832273
round_time_test        0 days 00:00:02.792375
round_time_total       0 days 00:11:39.769994
loss_total                         826.555765
loss_critic                        1108.28456
loss_actor                        -300.359493
memory_size                        658759.169 

=== epoch 8/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:14,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:57<00:00,  2.79it/s]
episodes                                   10
episode_length                          139.8
returns                            -73.002029
return_std                         114.964582
average_reward                      -0.483901
round_time             0 days 00:11:57.669223
episodes_test                             4.0
episode_length_test                    303.25
returns_test                       204.885773
return_std_test                    374.853013
average_reward_test                  0.726568
round_time_test        0 days 00:00:02.811427
round_time_total       0 days 00:11:57.670324
loss_total                         829.567168
loss_critic                        1111.97146
loss_actor                        -300.050069
memory_size                        660493.881 

=== epoch 8/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:28<00:00,  2.47it/s]
episodes                                    3
episode_length                          365.0
returns                           -189.238387
return_std                         248.218101
average_reward                      -0.442401
round_time             0 days 00:13:28.687026
episodes_test                             6.0
episode_length_test                190.333333
returns_test                       113.789402
return_std_test                    260.719532
average_reward_test                  0.686947
round_time_test        0 days 00:00:02.784329
round_time_total       0 days 00:13:28.688855
loss_total                         861.867505
loss_critic                       1152.191051
loss_actor                        -299.426755
memory_size                       662307.6105 

=== epoch 8/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:16,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:32<00:00,  1.17it/s]
episodes                                   10
episode_length                          135.0
returns                             -70.81976
return_std                         122.781141
average_reward                      -0.508863
round_time             0 days 00:28:33.129879
episodes_test                             5.0
episode_length_test                     231.2
returns_test                        159.52267
return_std_test                    331.067449
average_reward_test                  0.736391
round_time_test        0 days 00:00:03.554524
round_time_total       0 days 00:28:33.134454
loss_total                         840.756452
loss_critic                       1125.910928
loss_actor                        -299.861534
memory_size                         664125.28 

=== epoch 8/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<34:12,  1.03s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:21<00:00,  1.06it/s]
episodes                                    5
episode_length                           32.6
returns                            -17.253708
return_std                          18.696858
average_reward                      -0.408076
round_time             0 days 00:31:24.080162
episodes_test                             5.0
episode_length_test                     223.6
returns_test                       162.679274
return_std_test                    332.550269
average_reward_test                  0.726394
round_time_test        0 days 00:00:04.575199
round_time_total       0 days 00:31:24.083175
loss_total                         856.734941
loss_critic                       1145.940268
loss_actor                        -300.086445
memory_size                       665862.6855 

=== epoch 8/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<35:54,  1.08s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:57<00:00,  1.08it/s]
episodes                                   23
episode_length                      77.521739
returns                            -32.582258
return_std                          77.256541
average_reward                      -0.389679
round_time             0 days 00:30:59.865830
episodes_test                             2.0
episode_length_test                     519.5
returns_test                       397.262062
return_std_test                    402.169698
average_reward_test                  0.752464
round_time_test        0 days 00:00:03.782420
round_time_total       0 days 00:30:59.868321
loss_total                         853.115071
loss_critic                       1141.616234
loss_actor                        -300.889653
memory_size                       667506.8145 

=== epoch 8/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<31:41,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:23<00:00,  1.06it/s]
episodes                                    6
episode_length                     187.666667
returns                            -91.917795
return_std                         153.942908
average_reward                      -0.465581
round_time             0 days 00:31:25.289546
episodes_test                            13.0
episode_length_test                130.307692
returns_test                        55.925985
return_std_test                    231.370238
average_reward_test                  0.501487
round_time_test        0 days 00:00:03.822191
round_time_total       0 days 00:31:25.292836
loss_total                         860.328445
loss_critic                       1150.556495
loss_actor                        -300.583836
memory_size                       669093.1065 

=== epoch 8/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<33:35,  1.01s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [32:17<00:00,  1.03it/s]
episodes                                    4
episode_length                         284.75
returns                           -116.944907
return_std                         158.208595
average_reward                      -0.407032
round_time             0 days 00:32:20.140293
episodes_test                             4.0
episode_length_test                    293.75
returns_test                       161.643012
return_std_test                     375.83477
average_reward_test                  0.673173
round_time_test        0 days 00:00:04.511975
round_time_total       0 days 00:32:20.142934
loss_total                         859.206753
loss_critic                       1148.944684
loss_actor                        -299.745043
memory_size                        670869.929 

=== epoch 8/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<34:22,  1.03s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:22<00:00,  1.10it/s]
episodes                                    3
episode_length                     348.666667
returns                           -142.678717
return_std                          196.98232
average_reward                      -0.412712
round_time             0 days 00:30:24.732412
episodes_test                             7.0
episode_length_test                188.714286
returns_test                       116.117841
return_std_test                    305.977287
average_reward_test                  0.557801
round_time_test        0 days 00:00:04.655755
round_time_total       0 days 00:30:24.735255
loss_total                         864.680324
loss_critic                       1155.801679
loss_actor                        -299.805175
memory_size                       672764.9935 

=== epoch 8/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:08,  1.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:50<00:00,  1.08it/s]
episodes                                   12
episode_length                          106.5
returns                            -37.993089
return_std                          98.550083
average_reward                      -0.387521
round_time             0 days 00:30:52.259743
episodes_test                             2.0
episode_length_test                     511.0
returns_test                       425.538494
return_std_test                    412.952861
average_reward_test                  0.843097
round_time_test        0 days 00:00:03.999856
round_time_total       0 days 00:30:52.261775
loss_total                         855.026243
loss_critic                       1143.731497
loss_actor                        -299.794851
memory_size                        674486.745 

=== epoch 8/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:30,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:58<00:00,  1.08it/s]
episodes                                    7
episode_length                     170.142857
returns                             -82.24243
return_std                         147.140592
average_reward                      -0.472678
round_time             0 days 00:30:59.988492
episodes_test                             5.0
episode_length_test                     219.0
returns_test                       158.068879
return_std_test                    315.478624
average_reward_test                  0.780309
round_time_test        0 days 00:00:04.055204
round_time_total       0 days 00:30:59.992516
loss_total                         862.946001
loss_critic                       1153.736123
loss_actor                        -300.214568
memory_size                       676316.5745 

=== epoch 8/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<35:54,  1.08s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:37<00:00,  1.09it/s]
episodes                                    4
episode_length                          265.5
returns                           -119.127264
return_std                         176.428928
average_reward                      -0.438707
round_time             0 days 00:30:40.359133
episodes_test                            18.0
episode_length_test                 89.833333
returns_test                        44.304282
return_std_test                    203.592115
average_reward_test                  0.522748
round_time_test        0 days 00:00:03.973622
round_time_total       0 days 00:30:40.361809
loss_total                         854.014745
loss_critic                       1142.481615
loss_actor                        -299.852821
memory_size                       678123.3775 

=== epoch 8/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<31:40,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [34:38<00:00,  1.04s/it]
episodes                                    4
episode_length                          304.0
returns                           -151.883007
return_std                         191.844402
average_reward                      -0.451952
round_time             0 days 00:34:40.443716
episodes_test                             3.0
episode_length_test                     355.0
returns_test                       274.317736
return_std_test                    388.950362
average_reward_test                  0.799391
round_time_test        0 days 00:00:04.323520
round_time_total       0 days 00:34:40.445695
loss_total                         869.815357
loss_critic                       1162.399066
loss_actor                        -300.519558
memory_size                        679963.868 

=== epoch 8/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:33,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [38:30<00:00,  1.16s/it]
episodes                                    3
episode_length                      76.666667
returns                            -38.036622
return_std                          28.683773
average_reward                      -0.403778
round_time             0 days 00:38:31.430443
episodes_test                             2.0
episode_length_test                     506.0
returns_test                       434.254057
return_std_test                     431.36137
average_reward_test                   0.81812
round_time_test        0 days 00:00:04.641906
round_time_total       0 days 00:38:31.433389
loss_total                         856.438324
loss_critic                       1146.016318
loss_actor                        -301.873734
memory_size                        681820.142 

=== epoch 8/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<35:49,  1.08s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:41<00:00,  1.09it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   14
episode_length                     108.214286
returns                            -62.660293
return_std                         113.137793
average_reward                      -0.524177
round_time             0 days 00:30:43.645562
episodes_test                             2.0
episode_length_test                     533.0
returns_test                        421.68006
return_std_test                    426.341167
average_reward_test                  0.796198
round_time_test        0 days 00:00:04.342306
round_time_total       0 days 00:30:43.647412
loss_total                         866.619937
loss_critic                       1158.745624
loss_actor                        -301.882893
memory_size                       683509.0135 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<15:56,  2.09it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:40<00:00,  1.69it/s]
episodes                                    7
episode_length                     190.428571
returns                            -94.401204
return_std                         135.446731
average_reward                      -0.455427
round_time             0 days 00:19:40.672781
episodes_test                             3.0
episode_length_test                360.333333
returns_test                       276.935751
return_std_test                    412.787058
average_reward_test                   0.78965
round_time_test        0 days 00:00:03.821710
round_time_total       0 days 00:19:40.674591
loss_total                         881.002197
loss_critic                       1176.618466
loss_actor                        -301.462961
memory_size                        685157.579 

=== epoch 9/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:39,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:53<00:00,  1.60it/s]
episodes                                    8
episode_length                         167.75
returns                            -91.201526
return_std                         125.443052
average_reward                      -0.491904
round_time             0 days 00:20:54.464448
episodes_test                             4.0
episode_length_test                     268.5
returns_test                       192.810271
return_std_test                    359.414981
average_reward_test                  0.780896
round_time_test        0 days 00:00:03.894693
round_time_total       0 days 00:20:54.465945
loss_total                         875.494845
loss_critic                       1169.725131
loss_actor                        -301.426378
memory_size                       686988.8285 

=== epoch 9/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:14,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:28<00:00,  1.63it/s]
episodes                                    4
episode_length                          260.5
returns                           -103.903138
return_std                         181.944649
average_reward                      -0.424958
round_time             0 days 00:20:29.274550
episodes_test                             3.0
episode_length_test                     358.0
returns_test                       264.863391
return_std_test                    409.731506
average_reward_test                  0.807479
round_time_test        0 days 00:00:03.681980
round_time_total       0 days 00:20:29.276547
loss_total                         889.383032
loss_critic                       1187.235646
loss_actor                        -302.027508
memory_size                       688789.5005 

=== epoch 9/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<17:20,  1.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:44<00:00,  1.61it/s]
episodes                                   13
episode_length                      48.769231
returns                            -24.388102
return_std                          25.820236
average_reward                      -0.480611
round_time             0 days 00:20:45.842831
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       842.579197
return_std_test                      0.883873
average_reward_test                  0.842579
round_time_test        0 days 00:00:03.873789
round_time_total       0 days 00:20:45.844806
loss_total                         877.965303
loss_critic                       1173.090379
loss_actor                        -302.535084
memory_size                       690536.9755 

=== epoch 9/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:02<16:00,  2.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:21<00:00,  1.64it/s]
episodes                                    2
episode_length                          508.5
returns                           -221.962341
return_std                         210.192304
average_reward                      -0.425719
round_time             0 days 00:20:22.427305
episodes_test                             3.0
episode_length_test                     342.0
returns_test                        281.66185
return_std_test                    403.339659
average_reward_test                  0.826225
round_time_test        0 days 00:00:04.227411
round_time_total       0 days 00:20:22.429188
loss_total                          900.25254
loss_critic                       1200.907604
loss_actor                        -302.367787
memory_size                       692282.8355 

=== epoch 9/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:54,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:05<00:00,  1.66it/s]
episodes                                    3
episode_length                     373.666667
returns                           -164.073234
return_std                         175.331947
average_reward                      -0.419292
round_time             0 days 00:20:06.501159
episodes_test                             5.0
episode_length_test                     257.6
returns_test                        104.42227
return_std_test                    258.940417
average_reward_test                  0.561203
round_time_test        0 days 00:00:04.590495
round_time_total       0 days 00:20:06.503065
loss_total                         896.576819
loss_critic                       1196.222489
loss_actor                        -302.005952
memory_size                        694186.102 

=== epoch 9/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:44,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:34<00:00,  1.62it/s]
episodes                                   19
episode_length                      61.578947
returns                            -42.144638
return_std                          40.829165
average_reward                      -0.577763
round_time             0 days 00:20:35.945465
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       831.133115
return_std_test                      19.84876
average_reward_test                  0.831133
round_time_test        0 days 00:00:04.133076
round_time_total       0 days 00:20:35.947010
loss_total                         915.532479
loss_critic                        1220.07125
loss_actor                        -302.622684
memory_size                       695864.3495 

=== epoch 9/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:44,  1.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:13<00:00,  1.65it/s]
episodes                                    3
episode_length                     383.666667
returns                            -159.48287
return_std                         169.333765
average_reward                      -0.418755
round_time             0 days 00:20:14.923334
episodes_test                             2.0
episode_length_test                     520.0
returns_test                       409.305253
return_std_test                    421.261693
average_reward_test                  0.789556
round_time_test        0 days 00:00:04.174552
round_time_total       0 days 00:20:14.926944
loss_total                          918.92892
loss_critic                       1224.182295
loss_actor                        -302.084657
memory_size                       697564.9265 

=== epoch 9/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<29:05,  1.14it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:31<00:00,  1.62it/s]
episodes                                   10
episode_length                           34.0
returns                            -14.973564
return_std                          25.060915
average_reward                      -0.460013
round_time             0 days 00:20:33.364024
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        807.55171
return_std_test                      1.920258
average_reward_test                  0.807552
round_time_test        0 days 00:00:03.652575
round_time_total       0 days 00:20:33.365814
loss_total                         907.128872
loss_critic                       1209.529208
loss_actor                        -302.472554
memory_size                       699328.8905 

=== epoch 9/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:45,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:56<00:00,  1.67it/s]
episodes                                    5
episode_length                          232.6
returns                           -101.674341
return_std                         149.201874
average_reward                      -0.440312
round_time             0 days 00:19:57.241766
episodes_test                             4.0
episode_length_test                     286.0
returns_test                       185.093752
return_std_test                    352.452302
average_reward_test                  0.719792
round_time_test        0 days 00:00:03.578568
round_time_total       0 days 00:19:57.243659
loss_total                         935.151649
loss_critic                       1244.316209
loss_actor                        -301.506671
memory_size                       701108.2825 

=== epoch 9/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:09,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:35<00:00,  1.62it/s]
episodes                                    7
episode_length                     180.857143
returns                             -84.07417
return_std                         131.525987
average_reward                      -0.460503
round_time             0 days 00:20:36.476567
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       748.251379
return_std_test                     57.060618
average_reward_test                  0.748251
round_time_test        0 days 00:00:04.211224
round_time_total       0 days 00:20:36.478028
loss_total                         927.890237
loss_critic                       1235.511913
loss_actor                        -302.596548
memory_size                        702922.944 

=== epoch 9/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:22,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:37<00:00,  1.62it/s]
episodes                                   11
episode_length                          129.0
returns                            -55.491119
return_std                         128.425321
average_reward                        -0.4085
round_time             0 days 00:20:38.930649
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       815.739966
return_std_test                     20.968151
average_reward_test                   0.81574
round_time_test        0 days 00:00:03.483067
round_time_total       0 days 00:20:38.932527
loss_total                         931.157667
loss_critic                       1239.354335
loss_actor                        -301.629082
memory_size                       704706.7725 

=== epoch 9/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:39,  1.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:12<00:00,  1.65it/s]
episodes                                    2
episode_length                          509.0
returns                           -205.342104
return_std                         204.468906
average_reward                      -0.445378
round_time             0 days 00:20:13.657771
episodes_test                            10.0
episode_length_test                     129.4
returns_test                        74.851756
return_std_test                    227.164474
average_reward_test                  0.677457
round_time_test        0 days 00:00:04.394296
round_time_total       0 days 00:20:13.659725
loss_total                         939.165773
loss_critic                        1249.21175
loss_actor                         -301.01822
memory_size                        706447.689 

=== epoch 9/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:08,  1.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:48<00:00,  1.60it/s]
episodes                                   19
episode_length                      85.578947
returns                            -41.414165
return_std                          74.500453
average_reward                      -0.430548
round_time             0 days 00:20:49.249447
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       745.781502
return_std_test                     89.106246
average_reward_test                  0.745782
round_time_test        0 days 00:00:03.612129
round_time_total       0 days 00:20:49.252701
loss_total                         947.147883
loss_critic                       1259.319095
loss_actor                        -301.537049
memory_size                        708144.944 

=== epoch 9/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:57,  1.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:05<00:00,  1.66it/s]
episodes                                    7
episode_length                     167.428571
returns                            -74.701086
return_std                         136.019143
average_reward                      -0.447098
round_time             0 days 00:20:07.737753
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       800.148996
return_std_test                     42.455036
average_reward_test                  0.800149
round_time_test        0 days 00:00:04.943487
round_time_total       0 days 00:20:07.739182
loss_total                         936.932492
loss_critic                       1246.581236
loss_actor                        -301.662579
memory_size                       709819.7835 

=== epoch 9/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:39,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:07<00:00,  1.66it/s]
episodes                                    5
episode_length                          224.4
returns                            -90.242582
return_std                         167.103007
average_reward                      -0.428309
round_time             0 days 00:20:08.729485
episodes_test                             3.0
episode_length_test                343.666667
returns_test                       288.644776
return_std_test                    411.213075
average_reward_test                  0.814671
round_time_test        0 days 00:00:03.315304
round_time_total       0 days 00:20:08.731480
loss_total                         939.445975
loss_critic                       1249.671346
loss_actor                        -301.455594
memory_size                         711589.59 

=== epoch 9/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:48,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:56<00:00,  1.59it/s]
episodes                                   11
episode_length                     153.818182
returns                            -88.051894
return_std                         131.524573
average_reward                      -0.567611
round_time             0 days 00:20:57.465753
episodes_test                             3.0
episode_length_test                     348.0
returns_test                       246.537007
return_std_test                    340.701201
average_reward_test                  0.772871
round_time_test        0 days 00:00:03.378919
round_time_total       0 days 00:20:57.467184
loss_total                         957.244485
loss_critic                        1271.93966
loss_actor                        -301.536301
memory_size                       713285.7225 

=== epoch 9/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:12,  1.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:12<00:00,  1.57it/s]
episodes                                    8
episode_length                          159.5
returns                            -77.378961
return_std                         143.966807
average_reward                       -0.44949
round_time             0 days 00:21:13.531567
episodes_test                             6.0
episode_length_test                202.833333
returns_test                       131.634082
return_std_test                    300.397495
average_reward_test                  0.729696
round_time_test        0 days 00:00:03.389134
round_time_total       0 days 00:21:13.533496
loss_total                         977.048676
loss_critic                        1296.73978
loss_actor                        -301.715823
memory_size                        715081.258 

=== epoch 9/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:48,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:57<00:00,  1.59it/s]
episodes                                    7
episode_length                      32.571429
returns                            -16.459744
return_std                          14.016923
average_reward                      -0.457028
round_time             0 days 00:20:58.645158
episodes_test                             7.0
episode_length_test                162.714286
returns_test                        98.153573
return_std_test                    280.361029
average_reward_test                  0.682676
round_time_test        0 days 00:00:03.676781
round_time_total       0 days 00:20:58.646962
loss_total                         973.475239
loss_critic                       1292.442461
loss_actor                        -302.393733
memory_size                          716777.4 

=== epoch 9/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:16,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:52<00:00,  1.60it/s]
episodes                                    4
episode_length                         269.25
returns                           -137.796752
return_std                         204.616154
average_reward                      -0.477728
round_time             0 days 00:20:53.537630
episodes_test                             2.0
episode_length_test                     538.0
returns_test                       392.828987
return_std_test                    425.720782
average_reward_test                  0.754654
round_time_test        0 days 00:00:04.015601
round_time_total       0 days 00:20:53.539497
loss_total                          958.44159
loss_critic                        1273.38091
loss_actor                        -301.315773
memory_size                       718578.4935 

=== epoch 9/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:21,  2.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                    9
episode_length                     143.555556
returns                            -75.211776
return_std                         132.294727
average_reward                      -0.508997
round_time             0 days 00:15:48.121589
episodes_test                             3.0
episode_length_test                     368.0
returns_test                       239.502318
return_std_test                    390.323678
average_reward_test                  0.751872
round_time_test        0 days 00:00:03.292175
round_time_total       0 days 00:15:48.123339
loss_total                         990.401309
loss_critic                       1313.160148
loss_actor                        -300.634137
memory_size                        720313.278 

=== epoch 9/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:27<00:00,  2.30it/s]
episodes                                    6
episode_length                          181.5
returns                            -68.745038
return_std                         147.586367
average_reward                      -0.377109
round_time             0 days 00:14:28.549729
episodes_test                             3.0
episode_length_test                392.666667
returns_test                       224.995917
return_std_test                    377.865441
average_reward_test                  0.673877
round_time_test        0 days 00:00:03.074878
round_time_total       0 days 00:14:28.550975
loss_total                         966.022392
loss_critic                       1283.078314
loss_actor                        -302.201379
memory_size                        722162.267 

=== epoch 9/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:59<00:00,  2.57it/s]
episodes                                    3
episode_length                     347.666667
returns                           -152.464171
return_std                         187.422944
average_reward                      -0.390779
round_time             0 days 00:12:59.992239
episodes_test                             5.0
episode_length_test                     247.2
returns_test                       153.385104
return_std_test                    316.025542
average_reward_test                  0.685265
round_time_test        0 days 00:00:02.870839
round_time_total       0 days 00:12:59.993449
loss_total                         969.918913
loss_critic                        1287.79568
loss_actor                        -301.588247
memory_size                        724041.429 

=== epoch 9/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:08,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                    6
episode_length                          208.0
returns                           -101.244327
return_std                         148.187441
average_reward                       -0.50213
round_time             0 days 00:12:32.345212
episodes_test                             7.0
episode_length_test                174.571429
returns_test                       119.943686
return_std_test                    298.287503
average_reward_test                  0.715124
round_time_test        0 days 00:00:02.824118
round_time_total       0 days 00:12:32.346330
loss_total                         964.870421
loss_critic                       1281.693081
loss_actor                        -302.420305
memory_size                        725853.945 

=== epoch 9/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:34<00:00,  2.88it/s]
episodes                                    3
episode_length                     351.666667
returns                            -144.33283
return_std                         164.349642
average_reward                      -0.421812
round_time             0 days 00:11:34.632804
episodes_test                             3.0
episode_length_test                     363.0
returns_test                       250.121896
return_std_test                    388.230283
average_reward_test                  0.771667
round_time_test        0 days 00:00:02.828088
round_time_total       0 days 00:11:34.633901
loss_total                         991.650555
loss_critic                       1314.949611
loss_actor                        -301.545753
memory_size                        727668.584 

=== epoch 9/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                    6
episode_length                     199.666667
returns                              -93.1308
return_std                         126.141489
average_reward                      -0.432901
round_time             0 days 00:11:16.408586
episodes_test                             3.0
episode_length_test                353.333333
returns_test                       279.810345
return_std_test                    401.545636
average_reward_test                  0.815751
round_time_test        0 days 00:00:02.789094
round_time_total       0 days 00:11:16.409689
loss_total                         986.102433
loss_critic                       1308.087379
loss_actor                        -301.837433
memory_size                        729500.414 

=== epoch 9/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                    4
episode_length                         268.25
returns                           -122.010036
return_std                         178.183395
average_reward                      -0.417749
round_time             0 days 00:11:12.947130
episodes_test                             6.0
episode_length_test                202.833333
returns_test                       151.938472
return_std_test                    316.473366
average_reward_test                  0.793181
round_time_test        0 days 00:00:02.805791
round_time_total       0 days 00:11:12.948241
loss_total                         956.541714
loss_critic                       1271.426625
loss_actor                        -302.998022
memory_size                       731363.1265 

=== epoch 9/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.97it/s]
episodes                                    9
episode_length                     145.333333
returns                            -65.382986
return_std                         118.431665
average_reward                      -0.482218
round_time             0 days 00:11:14.831022
episodes_test                             4.0
episode_length_test                    280.25
returns_test                        207.74747
return_std_test                    356.297143
average_reward_test                  0.795341
round_time_test        0 days 00:00:02.794598
round_time_total       0 days 00:11:14.832134
loss_total                         952.206946
loss_critic                        1265.86495
loss_actor                         -302.42516
memory_size                       733144.1935 

=== epoch 9/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:32,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                    7
episode_length                     167.571429
returns                            -76.049963
return_std                            136.142
average_reward                      -0.441845
round_time             0 days 00:11:39.899535
episodes_test                             3.0
episode_length_test                355.666667
returns_test                       277.034495
return_std_test                    397.879873
average_reward_test                  0.786845
round_time_test        0 days 00:00:02.855014
round_time_total       0 days 00:11:39.900632
loss_total                         978.530396
loss_critic                       1298.524833
loss_actor                        -301.447454
memory_size                       734952.6765 

=== epoch 9/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:32,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                    6
episode_length                      71.333333
returns                             -35.79941
return_std                          27.434297
average_reward                      -0.441225
round_time             0 days 00:11:16.740310
episodes_test                             3.0
episode_length_test                     370.0
returns_test                       275.498171
return_std_test                    397.258251
average_reward_test                  0.798836
round_time_test        0 days 00:00:02.828299
round_time_total       0 days 00:11:16.741419
loss_total                         981.320192
loss_critic                       1301.936292
loss_actor                        -301.144297
memory_size                        736693.504 

=== epoch 9/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:23,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                    6
episode_length                     190.833333
returns                            -93.994069
return_std                         160.982262
average_reward                      -0.470137
round_time             0 days 00:11:27.906817
episodes_test                             2.0
episode_length_test                     531.0
returns_test                       429.959333
return_std_test                    401.669458
average_reward_test                    0.8325
round_time_test        0 days 00:00:02.850661
round_time_total       0 days 00:11:27.907965
loss_total                         988.778267
loss_critic                       1311.419639
loss_actor                        -301.787314
memory_size                         738484.56 

=== epoch 9/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:17,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.92it/s]
episodes                                    3
episode_length                     344.333333
returns                           -143.248139
return_std                         178.977913
average_reward                      -0.404844
round_time             0 days 00:11:24.322858
episodes_test                             3.0
episode_length_test                     348.0
returns_test                       266.121141
return_std_test                    381.125864
average_reward_test                  0.793773
round_time_test        0 days 00:00:02.826878
round_time_total       0 days 00:11:24.323957
loss_total                         977.292587
loss_critic                       1297.244021
loss_actor                        -302.513247
memory_size                       740335.5685 

=== epoch 9/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:01,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                   10
episode_length                          148.0
returns                            -66.078325
return_std                         110.190806
average_reward                      -0.436159
round_time             0 days 00:11:30.268601
episodes_test                             5.0
episode_length_test                     232.6
returns_test                       172.461139
return_std_test                    351.537473
average_reward_test                  0.771608
round_time_test        0 days 00:00:02.772039
round_time_total       0 days 00:11:30.269707
loss_total                         959.096526
loss_critic                       1274.763631
loss_actor                         -303.57198
memory_size                       742225.6375 

=== epoch 9/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:22,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.90it/s]
episodes                                    5
episode_length                          218.2
returns                           -102.348477
return_std                         169.269406
average_reward                       -0.46835
round_time             0 days 00:11:29.073881
episodes_test                             3.0
episode_length_test                382.333333
returns_test                       272.130458
return_std_test                    385.043545
average_reward_test                  0.736757
round_time_test        0 days 00:00:02.797123
round_time_total       0 days 00:11:29.074991
loss_total                         977.197515
loss_critic                       1297.218969
loss_actor                        -302.888383
memory_size                       743884.2185 

=== epoch 9/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:38,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                    9
episode_length                          148.0
returns                             -83.42407
return_std                         125.579949
average_reward                      -0.501905
round_time             0 days 00:11:20.338668
episodes_test                             2.0
episode_length_test                     529.0
returns_test                       439.810992
return_std_test                     418.96938
average_reward_test                  0.846946
round_time_test        0 days 00:00:02.835191
round_time_total       0 days 00:11:20.339774
loss_total                        1011.694126
loss_critic                       1340.370483
loss_actor                        -303.011395
memory_size                        745691.484 

=== epoch 9/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                    6
episode_length                          203.0
returns                            -87.864039
return_std                         124.758046
average_reward                      -0.461639
round_time             0 days 00:11:42.422412
episodes_test                             7.0
episode_length_test                182.714286
returns_test                       119.876574
return_std_test                    290.751478
average_reward_test                  0.687356
round_time_test        0 days 00:00:02.793520
round_time_total       0 days 00:11:42.423503
loss_total                         997.405506
loss_critic                       1322.535338
loss_actor                        -303.113912
memory_size                       747448.8605 

=== epoch 9/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:14,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:26<00:00,  2.91it/s]
episodes                                    6
episode_length                     208.666667
returns                            -96.514742
return_std                         135.105142
average_reward                      -0.448412
round_time             0 days 00:11:26.718708
episodes_test                             3.0
episode_length_test                356.333333
returns_test                       277.425788
return_std_test                    392.231147
average_reward_test                   0.76788
round_time_test        0 days 00:00:02.874929
round_time_total       0 days 00:11:26.719812
loss_total                         989.147033
loss_critic                       1312.264556
loss_actor                        -303.323149
memory_size                       749305.2175 

=== epoch 9/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:56,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:47<00:00,  2.83it/s]
episodes                                    3
episode_length                          354.0
returns                           -154.188551
return_std                         178.632802
average_reward                      -0.426109
round_time             0 days 00:11:48.120191
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        816.93014
return_std_test                     13.540457
average_reward_test                   0.81693
round_time_test        0 days 00:00:02.793021
round_time_total       0 days 00:11:48.121290
loss_total                         985.886069
loss_critic                       1308.345347
loss_actor                        -303.951137
memory_size                       751083.3955 

=== epoch 9/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:12,  2.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:39<00:00,  2.86it/s]
episodes                                    7
episode_length                     168.428571
returns                            -91.708411
return_std                         143.744282
average_reward                      -0.477978
round_time             0 days 00:11:40.477758
episodes_test                             3.0
episode_length_test                348.333333
returns_test                       281.481103
return_std_test                    400.493357
average_reward_test                  0.844687
round_time_test        0 days 00:00:02.816182
round_time_total       0 days 00:11:40.478872
loss_total                         987.469983
loss_critic                        1310.50084
loss_actor                        -304.653541
memory_size                       752899.6245 

=== epoch 9/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:39,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:35<00:00,  2.88it/s]
episodes                                   10
episode_length                          149.1
returns                            -65.205217
return_std                         122.535115
average_reward                      -0.428044
round_time             0 days 00:11:35.992987
episodes_test                             2.0
episode_length_test                     524.0
returns_test                       386.133774
return_std_test                    426.090942
average_reward_test                  0.749889
round_time_test        0 days 00:00:02.812736
round_time_total       0 days 00:11:35.994075
loss_total                        1006.050111
loss_critic                        1333.89083
loss_actor                        -305.312854
memory_size                       754701.4755 

=== epoch 9/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:09,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:40<00:00,  2.86it/s]
episodes                                   13
episode_length                      32.076923
returns                            -18.753392
return_std                          11.928038
average_reward                      -0.465545
round_time             0 days 00:11:40.936035
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                        811.69424
return_std_test                     27.950854
average_reward_test                  0.811694
round_time_test        0 days 00:00:02.830900
round_time_total       0 days 00:11:40.937130
loss_total                         989.893504
loss_critic                       1313.669342
loss_actor                        -305.209937
memory_size                        756328.097 

=== epoch 9/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:45,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:41<00:00,  2.85it/s]
episodes                                    2
episode_length                          515.0
returns                           -198.723999
return_std                         175.552817
average_reward                      -0.392265
round_time             0 days 00:11:41.988942
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       834.805322
return_std_test                     56.855997
average_reward_test                  0.834805
round_time_test        0 days 00:00:02.836409
round_time_total       0 days 00:11:41.990062
loss_total                         986.602935
loss_critic                       1309.259021
loss_actor                        -304.021502
memory_size                       758102.9425 

=== epoch 9/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:40<00:00,  2.85it/s]
episodes                                    5
episode_length                          245.8
returns                             -92.22254
return_std                         143.530119
average_reward                      -0.367074
round_time             0 days 00:11:41.205647
episodes_test                             6.0
episode_length_test                     272.5
returns_test                       150.064861
return_std_test                    321.151742
average_reward_test                  0.583548
round_time_test        0 days 00:00:02.785855
round_time_total       0 days 00:11:41.206758
loss_total                         993.761493
loss_critic                       1318.328489
loss_actor                        -304.506585
memory_size                       759994.0315 

=== epoch 9/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:42,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:54<00:00,  2.80it/s]
episodes                                    3
episode_length                          368.0
returns                            -157.07992
return_std                         174.886364
average_reward                      -0.413436
round_time             0 days 00:11:55.152375
episodes_test                             3.0
episode_length_test                350.666667
returns_test                       210.462795
return_std_test                    309.405035
average_reward_test                  0.702825
round_time_test        0 days 00:00:02.825356
round_time_total       0 days 00:11:55.153473
loss_total                        1010.540397
loss_critic                       1339.481294
loss_actor                        -305.223281
memory_size                         761845.61 

=== epoch 9/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:22,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:52<00:00,  2.81it/s]
episodes                                    6
episode_length                     190.833333
returns                            -94.127334
return_std                         147.836123
average_reward                      -0.466994
round_time             0 days 00:11:53.131522
episodes_test                             2.0
episode_length_test                     517.5
returns_test                       376.770796
return_std_test                    407.041296
average_reward_test                  0.729746
round_time_test        0 days 00:00:02.821094
round_time_total       0 days 00:11:53.132618
loss_total                         1001.45014
loss_critic                       1328.230116
loss_actor                        -305.669849
memory_size                       763667.5135 

=== epoch 9/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:05,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:56<00:00,  2.79it/s]
episodes                                   10
episode_length                          132.7
returns                            -67.920361
return_std                         117.045724
average_reward                      -0.488495
round_time             0 days 00:11:57.403044
episodes_test                             6.0
episode_length_test                     211.5
returns_test                       111.300414
return_std_test                    320.936512
average_reward_test                  0.620174
round_time_test        0 days 00:00:02.794533
round_time_total       0 days 00:11:57.404178
loss_total                        1023.121068
loss_critic                       1355.252535
loss_actor                        -305.404883
memory_size                       765411.4895 

=== epoch 9/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:03<00:00,  2.76it/s]
episodes                                    1
episode_length                         1000.0
returns                           -440.281054
return_std                                0.0
average_reward                      -0.436634
round_time             0 days 00:12:04.051095
episodes_test                             7.0
episode_length_test                183.142857
returns_test                        98.052331
return_std_test                    280.767499
average_reward_test                  0.635761
round_time_test        0 days 00:00:02.807956
round_time_total       0 days 00:12:04.052198
loss_total                        1006.247411
loss_critic                       1334.140331
loss_actor                        -305.324364
memory_size                        767242.755 

=== epoch 9/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:04,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:48<00:00,  2.82it/s]
episodes                                    5
episode_length                          223.2
returns                            -93.921455
return_std                         143.863885
average_reward                      -0.435645
round_time             0 days 00:11:49.215613
episodes_test                             7.0
episode_length_test                176.428571
returns_test                        103.83506
return_std_test                    312.981199
average_reward_test                  0.697773
round_time_test        0 days 00:00:02.789182
round_time_total       0 days 00:11:49.216734
loss_total                        1018.138699
loss_critic                       1349.186131
loss_actor                        -306.051123
memory_size                       769085.9515 

=== epoch 9/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:58<00:00,  2.78it/s]
episodes                                    2
episode_length                          525.5
returns                           -196.878331
return_std                         177.385023
average_reward                      -0.405007
round_time             0 days 00:11:59.187151
episodes_test                             5.0
episode_length_test                     253.6
returns_test                        135.75941
return_std_test                    303.061865
average_reward_test                  0.663932
round_time_test        0 days 00:00:02.791902
round_time_total       0 days 00:11:59.188265
loss_total                        1008.786945
loss_critic                       1337.549315
loss_actor                        -306.262623
memory_size                         770987.11 

=== epoch 9/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:59<00:00,  2.78it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   13
episode_length                     103.615385
returns                            -50.901908
return_std                         115.563837
average_reward                      -0.472286
round_time             0 days 00:12:00.123237
episodes_test                             2.0
episode_length_test                     518.5
returns_test                       408.916942
return_std_test                     448.90807
average_reward_test                  0.790185
round_time_test        0 days 00:00:02.772288
round_time_total       0 days 00:12:00.124325
loss_total                         991.063811
loss_critic                       1315.817708
loss_actor                        -307.951875
memory_size                        772774.589 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    4
episode_length                          276.0
returns                           -120.516709
return_std                         163.626601
average_reward                      -0.394896
round_time             0 days 00:10:34.554415
episodes_test                             7.0
episode_length_test                182.285714
returns_test                        90.993233
return_std_test                    267.908126
average_reward_test                  0.625485
round_time_test        0 days 00:00:02.777810
round_time_total       0 days 00:10:34.555525
loss_total                        1007.848378
loss_critic                       1336.656036
loss_actor                        -307.382357
memory_size                         774523.43 

=== epoch 10/10 ==== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:47,  3.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                    7
episode_length                     179.857143
returns                             -87.91804
return_std                         145.307887
average_reward                        -0.4874
round_time             0 days 00:10:33.493357
episodes_test                             2.0
episode_length_test                     515.0
returns_test                       421.004271
return_std_test                    443.123767
average_reward_test                  0.833785
round_time_test        0 days 00:00:02.800779
round_time_total       0 days 00:10:33.494468
loss_total                          1031.3231
loss_critic                       1366.012029
loss_actor                        -307.432707
memory_size                       776360.2725 

=== epoch 10/10 ==== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                    8
episode_length                          155.0
returns                            -76.957828
return_std                         131.573252
average_reward                        -0.4582
round_time             0 days 00:10:41.115519
episodes_test                             8.0
episode_length_test                     152.5
returns_test                        90.466914
return_std_test                    258.604907
average_reward_test                  0.708888
round_time_test        0 days 00:00:02.796377
round_time_total       0 days 00:10:41.116624
loss_total                        1009.438367
loss_critic                       1338.595989
loss_actor                        -307.192207
memory_size                        778141.151 

=== epoch 10/10 ==== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   12
episode_length                          108.0
returns                            -60.974684
return_std                         132.712559
average_reward                      -0.484653
round_time             0 days 00:10:38.333230
episodes_test                             9.0
episode_length_test                175.111111
returns_test                        62.563245
return_std_test                    270.845873
average_reward_test                  0.464823
round_time_test        0 days 00:00:02.774175
round_time_total       0 days 00:10:38.334366
loss_total                        1017.623779
loss_critic                       1348.921557
loss_actor                        -307.567418
memory_size                         779974.38 

=== epoch 10/10 ==== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:07,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                    4
episode_length                         266.25
returns                           -125.411944
return_std                         184.629769
average_reward                      -0.466191
round_time             0 days 00:10:41.210493
episodes_test                             3.0
episode_length_test                347.666667
returns_test                        254.40592
return_std_test                    357.814769
average_reward_test                  0.791213
round_time_test        0 days 00:00:02.790965
round_time_total       0 days 00:10:41.211587
loss_total                        1024.725901
loss_critic                       1357.697372
loss_actor                        -307.160074
memory_size                       781704.4895 

=== epoch 10/10 ==== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                    3
episode_length                     354.666667
returns                           -151.759507
return_std                         178.369738
average_reward                      -0.412472
round_time             0 days 00:10:43.007956
episodes_test                             7.0
episode_length_test                172.285714
returns_test                        93.998641
return_std_test                    263.847246
average_reward_test                  0.680961
round_time_test        0 days 00:00:02.788888
round_time_total       0 days 00:10:43.009042
loss_total                         1052.29655
loss_critic                       1392.036605
loss_actor                        -306.663772
memory_size                       783515.6035 

=== epoch 10/10 ==== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:34,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                    8
episode_length                        172.375
returns                            -88.018404
return_std                         125.055475
average_reward                      -0.494647
round_time             0 days 00:10:43.363057
episodes_test                             7.0
episode_length_test                     160.0
returns_test                       127.546423
return_std_test                    306.326857
average_reward_test                  0.828454
round_time_test        0 days 00:00:02.765274
round_time_total       0 days 00:10:43.364177
loss_total                        1018.757355
loss_critic                       1350.319821
loss_actor                        -307.492604
memory_size                       785275.3645 

=== epoch 10/10 ==== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                    4
episode_length                          271.5
returns                           -117.651104
return_std                         163.097775
average_reward                      -0.437387
round_time             0 days 00:10:41.104258
episodes_test                             6.0
episode_length_test                     214.5
returns_test                        117.78129
return_std_test                    286.035803
average_reward_test                  0.639222
round_time_test        0 days 00:00:02.767802
round_time_total       0 days 00:10:41.105353
loss_total                        1034.362825
loss_critic                       1369.806775
loss_actor                        -307.413068
memory_size                       787169.0095 

=== epoch 10/10 ==== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:11,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                    2
episode_length                          511.0
returns                           -214.409662
return_std                         202.058998
average_reward                      -0.406185
round_time             0 days 00:10:41.315843
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       740.280481
return_std_test                      8.688718
average_reward_test                   0.74028
round_time_test        0 days 00:00:02.821479
round_time_total       0 days 00:10:41.317007
loss_total                        1035.392592
loss_critic                       1371.383966
loss_actor                           -308.573
memory_size                       789048.7605 

=== epoch 10/10 ==== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:01,  3.02it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:44<00:00,  3.11it/s]
episodes                                    4
episode_length                         279.75
returns                           -117.151154
return_std                         167.973532
average_reward                      -0.401728
round_time             0 days 00:10:44.601107
episodes_test                             2.0
episode_length_test                     507.0
returns_test                        365.14461
return_std_test                    363.676586
average_reward_test                   0.76872
round_time_test        0 days 00:00:02.817456
round_time_total       0 days 00:10:44.602204
loss_total                        1045.841524
loss_critic                       1384.282457
loss_actor                        -307.922307
memory_size                        790927.757 

=== epoch 10/10 ==== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.96it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                    8
episode_length                        152.375
returns                            -71.495949
return_std                         128.096033
average_reward                       -0.43605
round_time             0 days 00:10:53.873643
episodes_test                             8.0
episode_length_test                    155.25
returns_test                        77.941218
return_std_test                    247.813158
average_reward_test                  0.642033
round_time_test        0 days 00:00:02.817006
round_time_total       0 days 00:10:53.874772
loss_total                        1062.603511
loss_critic                       1405.259985
loss_actor                        -308.022474
memory_size                       792751.9165 

=== epoch 10/10 ==== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                    7
episode_length                      45.428571
returns                            -43.078533
return_std                          36.953784
average_reward                      -0.543101
round_time             0 days 00:10:42.619896
episodes_test                             2.0
episode_length_test                     544.0
returns_test                       441.347448
return_std_test                    432.832123
average_reward_test                  0.713933
round_time_test        0 days 00:00:02.811311
round_time_total       0 days 00:10:42.621031
loss_total                        1027.590913
loss_critic                       1361.634004
loss_actor                        -308.581547
memory_size                        794535.611 

=== epoch 10/10 ==== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:45<00:00,  3.10it/s]
episodes                                    2
episode_length                          504.5
returns                           -215.365455
return_std                         209.814215
average_reward                      -0.422544
round_time             0 days 00:10:46.384332
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       835.651917
return_std_test                     21.588213
average_reward_test                  0.835652
round_time_test        0 days 00:00:02.791798
round_time_total       0 days 00:10:46.385445
loss_total                        1057.617748
loss_critic                       1399.152047
loss_actor                        -308.519545
memory_size                        796362.701 

=== epoch 10/10 ==== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                    3
episode_length                     353.333333
returns                           -146.423198
return_std                         188.001977
average_reward                      -0.415826
round_time             0 days 00:10:48.879611
episodes_test                             6.0
episode_length_test                192.166667
returns_test                       124.995585
return_std_test                     296.77947
average_reward_test                  0.724073
round_time_test        0 days 00:00:02.768195
round_time_total       0 days 00:10:48.880697
loss_total                        1050.093664
loss_critic                       1389.911718
loss_actor                        -309.178646
memory_size                       798254.5725 

=== epoch 10/10 ==== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:47<00:00,  3.09it/s]
episodes                                    5
episode_length                          223.0
returns                            -93.427651
return_std                         143.953279
average_reward                      -0.404299
round_time             0 days 00:10:47.992673
episodes_test                             2.0
episode_length_test                     509.0
returns_test                       422.859091
return_std_test                    427.185777
average_reward_test                  0.819503
round_time_test        0 days 00:00:02.883583
round_time_total       0 days 00:10:47.993891
loss_total                         1044.10114
loss_critic                       1382.365504
loss_actor                        -308.956403
memory_size                       800104.0725 

=== epoch 10/10 ==== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:51<00:00,  3.07it/s]
episodes                                    5
episode_length                          263.4
returns                           -120.722986
return_std                         143.116399
average_reward                      -0.412408
round_time             0 days 00:10:52.016523
episodes_test                             4.0
episode_length_test                     274.0
returns_test                       207.691756
return_std_test                    390.823041
average_reward_test                    0.8082
round_time_test        0 days 00:00:02.792628
round_time_total       0 days 00:10:52.017648
loss_total                        1038.986403
loss_critic                       1376.019517
loss_actor                         -309.14615
memory_size                       801947.3165 

=== epoch 10/10 ==== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                    6
episode_length                     191.166667
returns                            -82.506507
return_std                         139.150121
average_reward                      -0.429504
round_time             0 days 00:10:55.675558
episodes_test                             4.0
episode_length_test                     265.5
returns_test                       182.307987
return_std_test                    353.020634
average_reward_test                  0.782144
round_time_test        0 days 00:00:02.778757
round_time_total       0 days 00:10:55.676667
loss_total                        1048.793453
loss_critic                       1388.452815
loss_actor                        -309.844081
memory_size                       803779.8435 

=== epoch 10/10 ==== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                    2
episode_length                          534.0
returns                           -235.472094
return_std                         152.683375
average_reward                      -0.433211
round_time             0 days 00:11:03.658573
episodes_test                             3.0
episode_length_test                363.666667
returns_test                        250.63698
return_std_test                    378.595349
average_reward_test                    0.6798
round_time_test        0 days 00:00:02.767265
round_time_total       0 days 00:11:03.659668
loss_total                        1035.890824
loss_critic                       1372.501048
loss_actor                         -310.55016
memory_size                       805612.2515 

=== epoch 10/10 ==== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:22,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                    2
episode_length                          510.5
returns                           -189.407075
return_std                         170.123463
average_reward                      -0.401764
round_time             0 days 00:11:06.817766
episodes_test                             5.0
episode_length_test                     223.4
returns_test                       156.554697
return_std_test                    312.116673
average_reward_test                  0.781231
round_time_test        0 days 00:00:02.865271
round_time_total       0 days 00:11:06.818871
loss_total                        1020.328354
loss_critic                       1353.098529
loss_actor                        -310.752436
memory_size                        807529.497 

=== epoch 10/10 ==== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:25,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                    8
episode_length                          159.0
returns                            -78.731414
return_std                         155.591963
average_reward                      -0.468299
round_time             0 days 00:11:10.214726
episodes_test                             2.0
episode_length_test                     513.0
returns_test                        431.99075
return_std_test                    442.074751
average_reward_test                  0.818868
round_time_test        0 days 00:00:02.772658
round_time_total       0 days 00:11:10.215825
loss_total                        1009.540695
loss_critic                       1339.724605
loss_actor                        -311.195033
memory_size                       809259.0365 

=== epoch 10/10 ==== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                    3
episode_length                     350.666667
returns                           -142.675569
return_std                         177.368828
average_reward                      -0.402936
round_time             0 days 00:11:09.081776
episodes_test                             2.0
episode_length_test                     527.0
returns_test                       417.338198
return_std_test                    453.883669
average_reward_test                  0.820046
round_time_test        0 days 00:00:02.783456
round_time_total       0 days 00:11:09.082872
loss_total                         1042.60849
loss_critic                       1381.063228
loss_actor                        -311.210549
memory_size                        811130.943 

=== epoch 10/10 ==== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                    4
episode_length                          279.0
returns                           -110.711908
return_std                         154.922403
average_reward                      -0.419796
round_time             0 days 00:11:13.018185
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       792.180374
return_std_test                      7.402019
average_reward_test                   0.79218
round_time_test        0 days 00:00:02.829077
round_time_total       0 days 00:11:13.019278
loss_total                        1050.838236
loss_critic                       1391.282339
loss_actor                        -310.938267
memory_size                         813010.59 

=== epoch 10/10 ==== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                   12
episode_length                     121.416667
returns                            -51.791301
return_std                         106.189272
average_reward                      -0.426844
round_time             0 days 00:11:16.611245
episodes_test                             3.0
episode_length_test                348.666667
returns_test                       286.641901
return_std_test                    405.877219
average_reward_test                  0.831339
round_time_test        0 days 00:00:02.796834
round_time_total       0 days 00:11:16.612356
loss_total                        1011.329184
loss_critic                       1342.296675
loss_actor                        -312.540869
memory_size                       814826.1935 

=== epoch 10/10 ==== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                    9
episode_length                      65.666667
returns                            -23.514642
return_std                          36.089989
average_reward                      -0.397707
round_time             0 days 00:11:12.581913
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       831.738527
return_std_test                     17.307635
average_reward_test                  0.831739
round_time_test        0 days 00:00:02.806874
round_time_total       0 days 00:11:12.583025
loss_total                        1023.906219
loss_critic                       1357.523627
loss_actor                        -310.563511
memory_size                        816489.259 

=== epoch 10/10 ==== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.96it/s]
episodes                                    8
episode_length                        167.375
returns                            -72.633888
return_std                         130.288676
average_reward                       -0.47747
round_time             0 days 00:11:15.285166
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       814.357387
return_std_test                      23.71633
average_reward_test                  0.814357
round_time_test        0 days 00:00:02.825619
round_time_total       0 days 00:11:15.286273
loss_total                        1053.966467
loss_critic                       1395.118082
loss_actor                        -310.640086
memory_size                       818273.2105 

=== epoch 10/10 ==== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                    2
episode_length                          517.0
returns                           -223.689255
return_std                         192.302157
average_reward                       -0.42543
round_time             0 days 00:11:30.437175
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       836.593766
return_std_test                     30.692751
average_reward_test                  0.836594
round_time_test        0 days 00:00:02.815674
round_time_total       0 days 00:11:30.438269
loss_total                        1056.814519
loss_critic                       1398.652109
loss_actor                        -310.535943
memory_size                       820021.2945 

=== epoch 10/10 ==== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:42,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:22<00:00,  2.93it/s]
episodes                                    2
episode_length                          526.5
returns                           -210.562901
return_std                         206.211985
average_reward                      -0.404639
round_time             0 days 00:11:23.223680
episodes_test                             7.0
episode_length_test                157.142857
returns_test                        98.816886
return_std_test                     244.20838
average_reward_test                  0.643802
round_time_test        0 days 00:00:02.780144
round_time_total       0 days 00:11:23.224780
loss_total                        1058.726552
loss_critic                       1401.149457
loss_actor                        -310.965168
memory_size                        821938.247 

=== epoch 10/10 ==== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                    4
episode_length                          304.0
returns                           -150.781748
return_std                         145.026445
average_reward                       -0.45366
round_time             0 days 00:11:14.240035
episodes_test                             6.0
episode_length_test                200.666667
returns_test                       141.586304
return_std_test                    307.504346
average_reward_test                  0.738972
round_time_test        0 days 00:00:02.774889
round_time_total       0 days 00:11:14.241127
loss_total                        1076.507353
loss_critic                       1423.434446
loss_actor                        -311.201121
memory_size                       823738.3895 

=== epoch 10/10 ==== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                   12
episode_length                     115.333333
returns                            -49.702205
return_std                         102.186519
average_reward                      -0.435476
round_time             0 days 00:11:15.751920
episodes_test                             3.0
episode_length_test                     353.0
returns_test                       284.690434
return_std_test                    421.746508
average_reward_test                  0.837199
round_time_test        0 days 00:00:02.748991
round_time_total       0 days 00:11:15.753004
loss_total                        1042.249494
loss_critic                       1380.678124
loss_actor                        -311.465126
memory_size                        825439.546 

=== epoch 10/10 ==== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                    6
episode_length                          186.0
returns                             -90.09168
return_std                         169.685993
average_reward                      -0.449405
round_time             0 days 00:11:18.024452
episodes_test                             4.0
episode_length_test                    271.25
returns_test                       206.809591
return_std_test                    365.177522
average_reward_test                  0.764986
round_time_test        0 days 00:00:02.780426
round_time_total       0 days 00:11:18.025550
loss_total                        1067.781567
loss_critic                       1412.626144
loss_actor                        -311.596842
memory_size                       827273.9975 

=== epoch 10/10 ==== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                    9
episode_length                      51.222222
returns                            -32.144496
return_std                          30.837692
average_reward                      -0.462135
round_time             0 days 00:11:25.065082
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       821.186959
return_std_test                       2.21892
average_reward_test                  0.821187
round_time_test        0 days 00:00:02.763694
round_time_total       0 days 00:11:25.066190
loss_total                        1054.369213
loss_critic                        1395.69434
loss_actor                        -310.931386
memory_size                        828998.915 

=== epoch 10/10 ==== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:37,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                    7
episode_length                     173.142857
returns                            -73.073174
return_std                         134.311207
average_reward                      -0.430765
round_time             0 days 00:11:20.963622
episodes_test                             4.0
episode_length_test                    271.25
returns_test                       169.894999
return_std_test                    299.039102
average_reward_test                  0.730408
round_time_test        0 days 00:00:02.853062
round_time_total       0 days 00:11:20.964764
loss_total                        1072.368887
loss_critic                       1418.403178
loss_actor                        -311.768368
memory_size                        830746.973 

=== epoch 10/10 ==== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:34<00:00,  2.88it/s]
episodes                                    3
episode_length                     341.666667
returns                           -144.450296
return_std                         197.536635
average_reward                      -0.417262
round_time             0 days 00:11:35.393698
episodes_test                             5.0
episode_length_test                     216.4
returns_test                       166.969165
return_std_test                    341.673981
average_reward_test                  0.768653
round_time_test        0 days 00:00:02.774056
round_time_total       0 days 00:11:35.394806
loss_total                         1053.92347
loss_critic                       1395.452769
loss_actor                        -312.193826
memory_size                        832589.643 

=== epoch 10/10 ==== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.91it/s]
episodes                                    7
episode_length                     175.285714
returns                            -92.086004
return_std                         137.091138
average_reward                       -0.50439
round_time             0 days 00:11:28.500606
episodes_test                             6.0
episode_length_test                195.833333
returns_test                       137.985861
return_std_test                    304.611157
average_reward_test                  0.744452
round_time_test        0 days 00:00:02.782493
round_time_total       0 days 00:11:28.501724
loss_total                        1040.312078
loss_critic                       1378.428305
loss_actor                        -312.152927
memory_size                        834428.549 

=== epoch 10/10 ==== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                   14
episode_length                      37.142857
returns                            -19.188465
return_std                          18.526146
average_reward                      -0.439281
round_time             0 days 00:11:22.192112
episodes_test                             8.0
episode_length_test                   163.875
returns_test                       106.645196
return_std_test                    300.135115
average_reward_test                  0.718646
round_time_test        0 days 00:00:02.775270
round_time_total       0 days 00:11:22.193219
loss_total                        1035.969855
loss_critic                       1373.222595
loss_actor                        -313.041206
memory_size                        836086.771 

=== epoch 10/10 ==== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                    7
episode_length                     180.571429
returns                            -74.324535
return_std                         137.220605
average_reward                       -0.39759
round_time             0 days 00:11:24.640815
episodes_test                             4.0
episode_length_test                    290.25
returns_test                       214.556871
return_std_test                    357.379153
average_reward_test                  0.788315
round_time_test        0 days 00:00:02.800519
round_time_total       0 days 00:11:24.641907
loss_total                        1023.649957
loss_critic                       1357.745155
loss_actor                         -312.73092
memory_size                       837715.7685 

=== epoch 10/10 ==== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                    9
episode_length                     158.333333
returns                            -85.317676
return_std                          133.16878
average_reward                      -0.490006
round_time             0 days 00:11:20.861700
episodes_test                             3.0
episode_length_test                346.333333
returns_test                       247.344194
return_std_test                    362.123483
average_reward_test                  0.753846
round_time_test        0 days 00:00:02.778523
round_time_total       0 days 00:11:20.862792
loss_total                        1034.875126
loss_critic                       1371.941218
loss_actor                        -313.389342
memory_size                       839477.8485 

=== epoch 10/10 ==== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:37,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                    2
episode_length                          508.5
returns                           -199.002788
return_std                         208.126074
average_reward                       -0.38049
round_time             0 days 00:11:25.200123
episodes_test                             2.0
episode_length_test                     512.0
returns_test                       372.060066
return_std_test                    401.357023
average_reward_test                  0.759504
round_time_test        0 days 00:00:02.760338
round_time_total       0 days 00:11:25.201210
loss_total                        1036.900485
loss_critic                       1374.485129
loss_actor                        -313.438174
memory_size                       841313.2145 

=== epoch 10/10 ==== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:00,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.90it/s]
episodes                                    2
episode_length                          536.0
returns                           -222.315928
return_std                         186.777422
average_reward                      -0.418081
round_time             0 days 00:11:29.095378
episodes_test                             5.0
episode_length_test                     227.6
returns_test                       170.944884
return_std_test                    337.086422
average_reward_test                   0.81685
round_time_test        0 days 00:00:02.771952
round_time_total       0 days 00:11:29.096469
loss_total                        1039.210951
loss_critic                       1377.410578
loss_actor                        -313.587652
memory_size                       843237.2675 

=== epoch 10/10 ==== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:30<00:00,  2.90it/s]
episodes                                    3
episode_length                           24.0
returns                             -13.70054
return_std                           1.105916
average_reward                      -0.408605
round_time             0 days 00:11:30.896990
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       894.691207
return_std_test                      4.972443
average_reward_test                  0.894691
round_time_test        0 days 00:00:02.766515
round_time_total       0 days 00:11:30.898101
loss_total                        1032.011166
loss_critic                       1368.652305
loss_actor                        -314.553478
memory_size                       845106.0265 

=== epoch 10/10 ==== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:40,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                    7
episode_length                          174.0
returns                            -82.296801
return_std                         122.405248
average_reward                      -0.430511
round_time             0 days 00:11:29.962541
episodes_test                             5.0
episode_length_test                     220.0
returns_test                       156.432307
return_std_test                    313.717143
average_reward_test                  0.777358
round_time_test        0 days 00:00:02.779140
round_time_total       0 days 00:11:29.963632
loss_total                        1038.432521
loss_critic                       1376.653131
loss_actor                        -314.450015
memory_size                       846922.9755 

=== epoch 10/10 ==== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:29<00:00,  2.90it/s]
episodes                                   20
episode_length                          85.95
returns                            -43.351214
return_std                          86.116846
average_reward                       -0.48661
round_time             0 days 00:11:29.858550
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       805.621355
return_std_test                     10.298625
average_reward_test                  0.805621
round_time_test        0 days 00:00:02.789026
round_time_total       0 days 00:11:29.859672
loss_total                         1046.49733
loss_critic                       1386.285414
loss_actor                        -312.655096
memory_size                       848473.5275 

=== epoch 10/10 ==== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                    5
episode_length                          231.2
returns                           -106.285885
return_std                         154.189419
average_reward                      -0.435008
round_time             0 days 00:11:36.724551
episodes_test                             2.0
episode_length_test                     507.5
returns_test                       438.866917
return_std_test                    430.372907
average_reward_test                  0.841358
round_time_test        0 days 00:00:02.798075
round_time_total       0 days 00:11:36.725646
loss_total                        1075.519439
loss_critic                       1422.485936
loss_actor                        -312.346663
memory_size                        850138.204 

=== epoch 10/10 ==== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:40,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.88it/s]
episodes                                    2
episode_length                          530.5
returns                           -220.628419
return_std                         193.168842
average_reward                      -0.375679
round_time             0 days 00:11:34.238642
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       694.470484
return_std_test                     41.239209
average_reward_test                   0.69447
round_time_test        0 days 00:00:02.793299
round_time_total       0 days 00:11:34.239736
loss_total                        1043.397476
loss_critic                        1382.37061
loss_actor                        -312.495161
memory_size                       852025.2915 

=== epoch 10/10 ==== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:23,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:37<00:00,  2.87it/s]
episodes                                    8
episode_length                        147.875
returns                            -62.788122
return_std                         127.550455
average_reward                      -0.430406
round_time             0 days 00:11:38.031012
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       873.457479
return_std_test                      0.003121
average_reward_test                  0.873457
round_time_test        0 days 00:00:02.740860
round_time_total       0 days 00:11:38.032106
loss_total                        1051.843208
loss_critic                        1393.14632
loss_actor                        -313.369334
memory_size                       853856.8505 

=== epoch 10/10 ==== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:38<00:00,  2.86it/s]
episodes                                   10
episode_length                          135.6
returns                            -56.255621
return_std                         124.888873
average_reward                      -0.399746
round_time             0 days 00:11:39.354417
episodes_test                             3.0
episode_length_test                352.333333
returns_test                       260.758717
return_std_test                    388.051806
average_reward_test                  0.774355
round_time_test        0 days 00:00:02.738945
round_time_total       0 days 00:11:39.355502
loss_total                         1042.59768
loss_critic                       1381.725437
loss_actor                         -313.91343
memory_size                       855622.3885 

=== epoch 10/10 ==== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:33,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:42<00:00,  2.85it/s]
episodes                                    3
episode_length                     375.333333
returns                           -199.307883
return_std                         195.630108
average_reward                       -0.48158
round_time             0 days 00:11:43.470494
episodes_test                             2.0
episode_length_test                    1000.0
returns_test                       833.244079
return_std_test                      5.177758
average_reward_test                  0.833244
round_time_test        0 days 00:00:02.798256
round_time_total       0 days 00:11:43.471589
loss_total                        1050.224971
loss_critic                       1391.063356
loss_actor                         -313.12866
memory_size                        857416.453 

=== epoch 10/10 ==== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:42,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:46<00:00,  2.83it/s]
episodes                                    3
episode_length                           33.0
returns                            -31.647508
return_std                          14.207415
average_reward                      -0.421941
round_time             0 days 00:11:47.157138
episodes_test                             3.0
episode_length_test                     344.0
returns_test                       269.064046
return_std_test                    391.291032
average_reward_test                  0.778813
round_time_test        0 days 00:00:02.782627
round_time_total       0 days 00:11:47.158256
loss_total                        1052.995709
loss_critic                       1394.754976
loss_actor                        -314.041463
memory_size                       859245.5695 

=== epoch 10/10 ==== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:30,  2.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:52<00:00,  2.81it/s]
episodes                                    1
episode_length                         1000.0
returns                           -435.885396
return_std                                0.0
average_reward                      -0.430889
round_time             0 days 00:11:52.916969
episodes_test                             6.0
episode_length_test                186.166667
returns_test                       126.146265
return_std_test                    299.899726
average_reward_test                  0.741117
round_time_test        0 days 00:00:02.735086
round_time_total       0 days 00:11:52.918069
loss_total                        1059.863017
loss_critic                       1403.421394
loss_actor                        -314.370598
memory_size                        861134.469 

=== epoch 10/10 ==== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:02<00:00,  2.77it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
episodes                                    6
episode_length                     196.333333
returns                            -78.945738
return_std                         125.408818
average_reward                      -0.420061
round_time             0 days 00:12:03.468371
episodes_test                             2.0
episode_length_test                     507.0
returns_test                       405.233796
return_std_test                     411.42791
average_reward_test                  0.790507
round_time_test        0 days 00:00:02.768603
round_time_total       0 days 00:12:03.469462
loss_total                         1053.96994
loss_critic                       1396.301047
loss_actor                        -315.354584
memory_size                       863003.6595 


