/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: null
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Humanoid-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 7
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>

<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
 96%|█████████▌| 1911/2000 [00:03<00:00, 479.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 514.80it/s]
episodes                                   91
episode_length                       21.67033
returns                            108.541585
return_std                          19.131263
average_reward                       5.008781
round_time             0 days 00:00:03.981324
episodes_test                            88.0
episode_length_test                 22.556818
returns_test                       115.125543
return_std_test                     17.037065
average_reward_test                  5.103255
round_time_test        0 days 00:00:04.944061
round_time_total       0 days 00:00:07.725530 

=== epoch 1/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 56%|█████▌    | 1123/2000 [00:02<00:01, 562.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 561.13it/s]
episodes                                   92
episode_length                           21.5
returns                            107.120324
return_std                          25.536769
average_reward                       4.981333
round_time             0 days 00:00:05.898324
episodes_test                            85.0
episode_length_test                 23.470588
returns_test                       119.634452
return_std_test                     18.954335
average_reward_test                  5.096831
round_time_test        0 days 00:00:04.371293
round_time_total       0 days 00:00:07.783349 

=== epoch 1/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 57%|█████▋    | 1149/2000 [00:02<00:01, 522.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 537.89it/s]
episodes                                   93
episode_length                      21.258065
returns                            105.935582
return_std                          20.626549
average_reward                       4.982636
round_time             0 days 00:00:05.574392
episodes_test                            86.0
episode_length_test                 23.023256
returns_test                       117.469002
return_std_test                     18.688085
average_reward_test                  5.101446
round_time_test        0 days 00:00:04.526722
round_time_total       0 days 00:00:07.570651 

=== epoch 1/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 59%|█████▉    | 1188/2000 [00:02<00:01, 694.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 627.83it/s]
episodes                                   91
episode_length                      21.769231
returns                            108.490782
return_std                          24.488801
average_reward                       4.983018
round_time             0 days 00:00:04.993800
episodes_test                            88.0
episode_length_test                 22.522727
returns_test                       114.271586
return_std_test                     17.622122
average_reward_test                  5.072835
round_time_test        0 days 00:00:04.354228
round_time_total       0 days 00:00:07.252277 

=== epoch 1/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 57%|█████▋    | 1144/2000 [00:02<00:01, 557.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:03<00:00, 563.25it/s]
episodes                                   93
episode_length                      21.258065
returns                            106.213151
return_std                          21.522969
average_reward                       4.993597
round_time             0 days 00:00:05.391885
episodes_test                            89.0
episode_length_test                 22.325843
returns_test                       113.459442
return_std_test                     18.579383
average_reward_test                  5.081301
round_time_test        0 days 00:00:05.056883
round_time_total       0 days 00:00:07.863297 

=== epoch 1/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/home/anon/20260123-icml-dcac/dcac/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 1/2000 [00:00<27:40,  1.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:51<00:00,  1.29it/s]
starting training
episodes                                   85
episode_length                           23.2
returns                            116.228287
return_std                          35.473768
average_reward                       5.009507
round_time             0 days 00:25:53.077191
episodes_test                            88.0
episode_length_test                 22.602273
returns_test                        115.04642
return_std_test                     18.135602
average_reward_test                  5.089341
round_time_test        0 days 00:00:03.936460
round_time_total       0 days 00:25:53.079738
loss_total                       12702.380333
loss_critic                      16009.183994
loss_actor                        -524.835382
memory_size                            51.141 

=== epoch 1/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:35,  1.16it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [33:26<00:00,  1.00s/it]
episodes                                  102
episode_length                      19.509804
returns                             96.890316
return_std                          19.371672
average_reward                       4.966423
round_time             0 days 00:33:28.354031
episodes_test                           100.0
episode_length_test                     19.88
returns_test                       100.494826
return_std_test                     17.761123
average_reward_test                    5.0546
round_time_test        0 days 00:00:04.036247
round_time_total       0 days 00:33:28.357128
loss_total                        -407.508196
loss_critic                        665.587588
loss_actor                       -4699.891313
memory_size                              76.0 

=== epoch 1/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<31:44,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [42:56<00:00,  1.29s/it]
episodes                                  105
episode_length                      18.971429
returns                             93.735998
return_std                          19.164596
average_reward                       4.940577
round_time             0 days 00:42:58.267575
episodes_test                           109.0
episode_length_test                 18.311927
returns_test                        91.378595
return_std_test                      13.36419
average_reward_test                  4.989917
round_time_test        0 days 00:00:03.815830
round_time_total       0 days 00:42:58.270116
loss_total                      183025.368907
loss_critic                     281277.491883
loss_actor                     -209983.140016
memory_size                            76.834 

=== epoch 1/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<31:49,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [35:10<00:00,  1.06s/it]
episodes                                  109
episode_length                      18.174312
returns                             89.189748
return_std                          15.568887
average_reward                       4.907426
round_time             0 days 00:35:12.586111
episodes_test                            99.0
episode_length_test                  20.10101
returns_test                       100.788605
return_std_test                     18.836174
average_reward_test                  5.013858
round_time_test        0 days 00:00:04.704820
round_time_total       0 days 00:35:12.588443
loss_total               1485073030409.370605
loss_critic              1856343542228.440186
loss_actor                     -9136257.07375
memory_size                              77.0 

=== epoch 1/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<32:26,  1.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [35:48<00:00,  1.07s/it]
episodes                                  108
episode_length                      18.324074
returns                             89.571577
return_std                          15.822045
average_reward                       4.888535
round_time             0 days 00:35:50.221092
episodes_test                           105.0
episode_length_test                 18.885714
returns_test                        93.023357
return_std_test                     14.771522
average_reward_test                  4.925651
round_time_test        0 days 00:00:03.765246
round_time_total       0 days 00:35:50.223833
loss_total               357847361975746.5625
loss_critic                 447309202870763.5
loss_actor                       -66333358.85
memory_size                              77.0 

=== epoch 1/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<31:43,  1.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [32:33<00:00,  1.02it/s]
episodes                                  106
episode_length                      18.783019
returns                             91.739481
return_std                          17.918047
average_reward                       4.884485
round_time             0 days 00:32:35.176475
episodes_test                           120.0
episode_length_test                 16.641667
returns_test                         80.59097
return_std_test                      8.193717
average_reward_test                  4.842775
round_time_test        0 days 00:00:03.839426
round_time_total       0 days 00:32:35.179072
loss_total                 2823493403310817.5
loss_critic                3529366693732155.5
loss_actor                      -164643789.78
memory_size                              77.0 

=== epoch 1/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<32:19,  1.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:43<00:00,  1.20it/s]
episodes                                  102
episode_length                      19.490196
returns                             95.461673
return_std                          17.921553
average_reward                       4.897334
round_time             0 days 00:27:45.628786
episodes_test                           101.0
episode_length_test                 19.772277
returns_test                        96.932488
return_std_test                     16.264068
average_reward_test                  4.902393
round_time_test        0 days 00:00:05.136954
round_time_total       0 days 00:27:45.630932
loss_total                10013438852242342.0
loss_critic               12516798364211216.0
loss_actor                      -298371582.88
memory_size                              77.0 

=== epoch 1/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:16,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:15<00:00,  1.37it/s]
episodes                                   90
episode_length                      21.822222
returns                            105.905111
return_std                          44.003816
average_reward                       4.855951
round_time             0 days 00:24:16.649153
episodes_test                           111.0
episode_length_test                 17.954955
returns_test                        87.473816
return_std_test                     14.310063
average_reward_test                  4.871564
round_time_test        0 days 00:00:04.809714
round_time_total       0 days 00:24:16.651552
loss_total                46173798373517168.0
loss_critic               57717246869263880.0
loss_actor                     -491884786.464
memory_size                           86.4435 

=== epoch 1/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<21:01,  1.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:49<00:00,  1.34it/s]
episodes                                  102
episode_length                      19.303922
returns                             93.722651
return_std                          57.714414
average_reward                       4.861238
round_time             0 days 00:24:50.581203
episodes_test                            80.0
episode_length_test                     24.75
returns_test                       120.246901
return_std_test                     83.809634
average_reward_test                   4.85916
round_time_test        0 days 00:00:04.955858
round_time_total       0 days 00:24:50.583425
loss_total                89419324914768288.0
loss_critic              111774154364270144.0
loss_actor                     -756837691.424
memory_size                          307.0895 

=== epoch 1/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:17,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:21<00:00,  1.37it/s]
episodes                                  115
episode_length                      17.269565
returns                             82.910687
return_std                          37.782831
average_reward                       4.800812
round_time             0 days 00:24:22.998806
episodes_test                           117.0
episode_length_test                 17.017094
returns_test                        81.495467
return_std_test                      19.29192
average_reward_test                  4.789052
round_time_test        0 days 00:00:05.025604
round_time_total       0 days 00:24:23.001085
loss_total                44939381768212968.0
loss_critic               56174226243361704.0
loss_actor                     -556550585.456
memory_size                           369.022 

=== epoch 1/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<21:56,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:49<00:00,  1.46it/s]
episodes                                  122
episode_length                      16.262295
returns                             77.588725
return_std                           2.135877
average_reward                       4.771072
round_time             0 days 00:22:50.957419
episodes_test                           120.0
episode_length_test                 16.666667
returns_test                        79.805356
return_std_test                     13.284934
average_reward_test                  4.788321
round_time_test        0 days 00:00:03.809280
round_time_total       0 days 00:22:50.959301
loss_total                28022384193011976.0
loss_critic               35027979629366344.0
loss_actor                      -386642662.24
memory_size                             401.0 

=== epoch 1/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:49,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:54<00:00,  1.46it/s]
episodes                                  115
episode_length                      17.269565
returns                             83.255104
return_std                          12.471636
average_reward                       4.820529
round_time             0 days 00:22:55.425730
episodes_test                           125.0
episode_length_test                    15.992
returns_test                        76.337353
return_std_test                       1.86931
average_reward_test                  4.773544
round_time_test        0 days 00:00:04.656807
round_time_total       0 days 00:22:55.427263
loss_total                24167982563713876.0
loss_critic               30209977671932184.0
loss_actor                     -408545546.752
memory_size                             401.0 

=== epoch 1/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:35,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:21<00:00,  1.43it/s]
episodes                                  118
episode_length                      16.762712
returns                             80.452896
return_std                            7.33709
average_reward                       4.800194
round_time             0 days 00:23:22.594915
episodes_test                           118.0
episode_length_test                 16.940678
returns_test                        81.678526
return_std_test                      8.133398
average_reward_test                  4.821477
round_time_test        0 days 00:00:03.856370
round_time_total       0 days 00:23:22.597190
loss_total                26079542496892812.0
loss_critic               32599427567870540.0
loss_actor                     -509451535.856
memory_size                             401.0 

=== epoch 1/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:29,  1.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:15<00:00,  1.50it/s]
episodes                                  124
episode_length                      16.080645
returns                             76.770522
return_std                           1.551888
average_reward                       4.774198
round_time             0 days 00:22:16.476541
episodes_test                           118.0
episode_length_test                 16.864407
returns_test                         81.12056
return_std_test                      8.100076
average_reward_test                   4.81003
round_time_test        0 days 00:00:03.872703
round_time_total       0 days 00:22:16.478600
loss_total                31808714045969660.0
loss_critic               39760891868522480.0
loss_actor                     -611980763.328
memory_size                             401.0 

=== epoch 1/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:44,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:34<00:00,  1.48it/s]
episodes                                  123
episode_length                      16.121951
returns                             76.933989
return_std                           2.869642
average_reward                       4.772068
round_time             0 days 00:22:35.369606
episodes_test                           124.0
episode_length_test                  16.08871
returns_test                        76.773091
return_std_test                      1.718629
average_reward_test                  4.772013
round_time_test        0 days 00:00:04.048453
round_time_total       0 days 00:22:35.371773
loss_total                39719319227038960.0
loss_critic               49649148182724080.0
loss_actor                     -729735759.104
memory_size                             401.0 

=== epoch 1/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:55,  1.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:35<00:00,  1.48it/s]
episodes                                  124
episode_length                      16.008065
returns                             76.432942
return_std                           3.033564
average_reward                       4.774626
round_time             0 days 00:22:36.339810
episodes_test                           125.0
episode_length_test                    15.928
returns_test                        76.095958
return_std_test                      2.389206
average_reward_test                    4.7776
round_time_test        0 days 00:00:04.010233
round_time_total       0 days 00:22:36.341533
loss_total                51879601416242728.0
loss_critic               64849500734947856.0
loss_actor                     -869297985.248
memory_size                             401.0 

=== epoch 1/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:40,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:41<00:00,  1.47it/s]
episodes                                  124
episode_length                      15.919355
returns                             75.963567
return_std                           1.325965
average_reward                       4.771774
round_time             0 days 00:22:43.078294
episodes_test                           125.0
episode_length_test                    15.912
returns_test                        75.948043
return_std_test                      1.348966
average_reward_test                  4.773107
round_time_test        0 days 00:00:03.945328
round_time_total       0 days 00:22:43.080326
loss_total                67203623086062568.0
loss_critic               84004527371519520.0
loss_actor                    -1025977907.008
memory_size                             401.0 

=== epoch 1/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<18:31,  1.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:41<00:00,  1.54it/s]
episodes                                  124
episode_length                      16.104839
returns                             76.933707
return_std                           9.030579
average_reward                       4.777022
round_time             0 days 00:21:42.842801
episodes_test                           126.0
episode_length_test                 15.753968
returns_test                        75.227546
return_std_test                      2.138685
average_reward_test                  4.775069
round_time_test        0 days 00:00:03.883954
round_time_total       0 days 00:21:42.844216
loss_total                90501005339422352.0
loss_critic              113126254732415856.0
loss_actor                    -1190276377.664
memory_size                           401.717 

=== epoch 1/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:33<00:00,  1.62it/s]
episodes                                  125
episode_length                           16.0
returns                             76.367016
return_std                           2.862849
average_reward                       4.772939
round_time             0 days 00:20:34.750632
episodes_test                           124.0
episode_length_test                 16.032258
returns_test                         76.50953
return_std_test                      1.276454
average_reward_test                  4.772324
round_time_test        0 days 00:00:04.590888
round_time_total       0 days 00:20:34.752500
loss_total               105480663084495280.0
loss_critic              131850826838058208.0
loss_actor                    -1360395900.096
memory_size                             402.0 

=== epoch 1/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:08,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:04<00:00,  1.58it/s]
episodes                                  124
episode_length                      16.040323
returns                             76.531294
return_std                           1.781937
average_reward                       4.771159
round_time             0 days 00:21:05.349776
episodes_test                           125.0
episode_length_test                     15.96
returns_test                        76.153152
return_std_test                      1.100898
average_reward_test                  4.771557
round_time_test        0 days 00:00:04.474197
round_time_total       0 days 00:21:05.351610
loss_total               137768808974672464.0
loss_critic              172211008090530656.0
loss_actor                    -1580744367.872
memory_size                             402.0 

=== epoch 1/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:07,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:11<00:00,  1.57it/s]
episodes                                  124
episode_length                      16.040323
returns                             76.558248
return_std                           1.472936
average_reward                       4.772863
round_time             0 days 00:21:12.966472
episodes_test                           124.0
episode_length_test                 16.024194
returns_test                        76.441751
return_std_test                      1.951871
average_reward_test                  4.770404
round_time_test        0 days 00:00:04.301602
round_time_total       0 days 00:21:12.968199
loss_total               186610065696899456.0
loss_critic              233262578158480128.0
loss_actor                    -1850530785.792
memory_size                             402.0 

=== epoch 1/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:52,  1.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:57<00:00,  1.59it/s]
episodes                                  123
episode_length                      16.113821
returns                             76.981382
return_std                           7.511659
average_reward                       4.777283
round_time             0 days 00:20:58.765416
episodes_test                           124.0
episode_length_test                  16.08871
returns_test                        76.835763
return_std_test                      5.675232
average_reward_test                  4.775816
round_time_test        0 days 00:00:04.250380
round_time_total       0 days 00:20:58.766845
loss_total               244142753973163520.0
loss_critic              305178437390876800.0
loss_actor                    -2151306056.896
memory_size                             402.0 

=== epoch 1/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<17:28,  1.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:06<00:00,  1.58it/s]
episodes                                  124
episode_length                      15.959677
returns                             76.154935
return_std                           1.204844
average_reward                       4.771718
round_time             0 days 00:21:07.301577
episodes_test                           125.0
episode_length_test                    15.936
returns_test                         76.12092
return_std_test                      2.676948
average_reward_test                  4.776725
round_time_test        0 days 00:00:04.121635
round_time_total       0 days 00:21:07.303008
loss_total               319562311963584256.0
loss_critic              399452883033140480.0
loss_actor                    -2472745146.368
memory_size                             402.0 

=== epoch 1/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:17,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:25<00:00,  1.56it/s]
episodes                                  125
episode_length                         15.912
returns                             75.904122
return_std                           1.358081
average_reward                       4.770353
round_time             0 days 00:21:25.997579
episodes_test                           125.0
episode_length_test                    15.968
returns_test                        76.178909
return_std_test                      0.838944
average_reward_test                  4.770834
round_time_test        0 days 00:00:04.059696
round_time_total       0 days 00:21:25.999405
loss_total               407945577735902976.0
loss_critic              509931963631483776.0
loss_actor                    -2824364598.784
memory_size                             402.0 

=== epoch 1/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:38,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:03<00:00,  1.58it/s]
episodes                                  125
episode_length                         15.936
returns                             76.029001
return_std                           1.107488
average_reward                       4.770828
round_time             0 days 00:21:04.292922
episodes_test                           123.0
episode_length_test                 16.154472
returns_test                        77.138223
return_std_test                      7.988224
average_reward_test                  4.774976
round_time_test        0 days 00:00:03.848925
round_time_total       0 days 00:21:04.294667
loss_total               528850527162787200.0
loss_critic              661063146906100736.0
loss_actor                     -3223758442.88
memory_size                             402.0 

=== epoch 1/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<19:09,  1.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:46<00:00,  1.53it/s]
episodes                                  125
episode_length                         15.952
returns                             76.092482
return_std                            1.02361
average_reward                       4.770207
round_time             0 days 00:21:47.747850
episodes_test                           124.0
episode_length_test                 16.120968
returns_test                        77.042161
return_std_test                      7.422352
average_reward_test                  4.779076
round_time_test        0 days 00:00:04.070153
round_time_total       0 days 00:21:47.749701
loss_total               668095846512576128.0
loss_critic              835119793600108416.0
loss_actor                    -3661464896.256
memory_size                             402.0 

=== epoch 1/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:21,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:05<00:00,  1.58it/s]
episodes                                  124
episode_length                      15.959677
returns                             76.115305
return_std                           0.870452
average_reward                       4.769119
round_time             0 days 00:21:06.317882
episodes_test                           124.0
episode_length_test                 16.096774
returns_test                        76.844112
return_std_test                      7.861286
average_reward_test                  4.774015
round_time_test        0 days 00:00:04.194939
round_time_total       0 days 00:21:06.319413
loss_total               844680311498728320.0
loss_critic             1055850371764044416.0
loss_actor                    -4130363814.912
memory_size                             402.0 

=== epoch 1/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:46,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:12<00:00,  1.57it/s]
episodes                                  124
episode_length                      15.975806
returns                             76.207796
return_std                           2.594141
average_reward                       4.770154
round_time             0 days 00:21:13.540203
episodes_test                           125.0
episode_length_test                    15.952
returns_test                        76.088585
return_std_test                      1.406663
average_reward_test                  4.769947
round_time_test        0 days 00:00:04.086387
round_time_total       0 days 00:21:13.541646
loss_total              1054312007765306880.0
loss_critic             1317889986629774336.0
loss_actor                    -4638679433.856
memory_size                             402.0 

=== epoch 1/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:14,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:14<00:00,  1.57it/s]
episodes                                  124
episode_length                           16.0
returns                             76.379942
return_std                           5.570418
average_reward                       4.773806
round_time             0 days 00:21:15.630935
episodes_test                           125.0
episode_length_test                     15.96
returns_test                         76.14675
return_std_test                      1.255247
average_reward_test                   4.77118
round_time_test        0 days 00:00:04.934024
round_time_total       0 days 00:21:15.632760
loss_total              1283328803317655552.0
loss_critic             1604160974735133440.0
loss_actor                    -5198803788.288
memory_size                             402.0 

=== epoch 1/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:56,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:27<00:00,  1.63it/s]
episodes                                  125
episode_length                         15.912
returns                             75.894303
return_std                           1.671758
average_reward                         4.7697
round_time             0 days 00:20:28.450507
episodes_test                           125.0
episode_length_test                    15.952
returns_test                        76.083985
return_std_test                      1.169591
average_reward_test                  4.769681
round_time_test        0 days 00:00:04.351628
round_time_total       0 days 00:20:28.451938
loss_total              1593296277785016320.0
loss_critic             1991620313490007296.0
loss_actor                    -5797582334.464
memory_size                             402.0 

=== epoch 1/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:03,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:53<00:00,  1.60it/s]
episodes                                  124
episode_length                      15.951613
returns                             76.072799
return_std                           1.059588
average_reward                       4.768948
round_time             0 days 00:20:54.506841
episodes_test                           125.0
episode_length_test                    15.984
returns_test                        76.327253
return_std_test                      3.832158
average_reward_test                  4.775357
round_time_test        0 days 00:00:03.892215
round_time_total       0 days 00:20:54.508673
loss_total              1927961937017732864.0
loss_critic             2409952380092019712.0
loss_actor                    -6396169825.536
memory_size                             402.0 

=== epoch 1/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:02,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:54<00:00,  1.59it/s]
episodes                                  125
episode_length                          15.96
returns                             76.133837
return_std                           0.938577
average_reward                       4.770417
round_time             0 days 00:20:55.296637
episodes_test                           125.0
episode_length_test                    15.928
returns_test                        75.971051
return_std_test                       1.34551
average_reward_test                  4.769626
round_time_test        0 days 00:00:03.981849
round_time_total       0 days 00:20:55.298458
loss_total              2256594099516710400.0
loss_critic             2820742575519160320.0
loss_actor                    -7061668985.344
memory_size                             402.0 

=== epoch 1/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:23,  1.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:20<00:00,  1.64it/s]
episodes                                  124
episode_length                      15.983871
returns                             76.348165
return_std                           3.117018
average_reward                       4.776519
round_time             0 days 00:20:21.358334
episodes_test                           125.0
episode_length_test                    15.952
returns_test                        76.127099
return_std_test                      2.039201
average_reward_test                  4.772355
round_time_test        0 days 00:00:03.779622
round_time_total       0 days 00:20:21.360349
loss_total              2763626493549427712.0
loss_critic             3454533058027013120.0
loss_actor                    -7823335465.984
memory_size                             402.0 

=== epoch 1/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:48,  1.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:37<00:00,  1.62it/s]
episodes                                  124
episode_length                      15.927419
returns                             75.980871
return_std                           1.243503
average_reward                       4.770359
round_time             0 days 00:20:38.906123
episodes_test                           125.0
episode_length_test                    15.984
returns_test                        76.230272
return_std_test                      0.616734
average_reward_test                  4.769312
round_time_test        0 days 00:00:04.135161
round_time_total       0 days 00:20:38.907594
loss_total              3334428838852273664.0
loss_critic             4168035974039069696.0
loss_actor                 -8620651464.959999
memory_size                             402.0 

=== epoch 1/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:37,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:30<00:00,  1.62it/s]
episodes                                  125
episode_length                          15.92
returns                             75.975418
return_std                           1.325862
average_reward                       4.772349
round_time             0 days 00:20:31.949058
episodes_test                           124.0
episode_length_test                 16.008065
returns_test                        76.425233
return_std_test                      3.370365
average_reward_test                  4.774097
round_time_test        0 days 00:00:03.745545
round_time_total       0 days 00:20:31.950476
loss_total              4096122837824528384.0
loss_critic             5120153455248101376.0
loss_actor                    -9494953536.768
memory_size                             402.0 

=== epoch 1/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:40,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:35<00:00,  1.62it/s]
episodes                                  124
episode_length                      15.975806
returns                             76.342314
return_std                           5.254813
average_reward                       4.778585
round_time             0 days 00:20:36.786100
episodes_test                           124.0
episode_length_test                 16.032258
returns_test                        76.543602
return_std_test                      3.973787
average_reward_test                  4.774344
round_time_test        0 days 00:00:03.775766
round_time_total       0 days 00:20:36.788003
loss_total              5176991687743277056.0
loss_critic             6471239496669916160.0
loss_actor                   -10467343791.104
memory_size                             402.0 

=== epoch 1/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:45,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:58<00:00,  1.59it/s]
episodes                                  124
episode_length                      16.056452
returns                             76.582504
return_std                           3.766274
average_reward                       4.769622
round_time             0 days 00:21:00.058758
episodes_test                           124.0
episode_length_test                 16.056452
returns_test                         76.60616
return_std_test                      7.231412
average_reward_test                  4.771062
round_time_test        0 days 00:00:04.648433
round_time_total       0 days 00:21:00.060597
loss_total              6196121377359011840.0
loss_critic             7745151591612795904.0
loss_actor                -11506961618.431999
memory_size                             402.0 

=== epoch 1/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:26,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:51<00:00,  1.60it/s]
episodes                                  124
episode_length                      15.959677
returns                             76.131907
return_std                           1.215416
average_reward                       4.770013
round_time             0 days 00:20:53.024176
episodes_test                           125.0
episode_length_test                    15.928
returns_test                        75.969311
return_std_test                      1.244231
average_reward_test                  4.769576
round_time_test        0 days 00:00:04.479144
round_time_total       0 days 00:20:53.026361
loss_total              7456265264003821568.0
loss_critic             9320331418479646720.0
loss_actor                    -12560356433.92
memory_size                             402.0 

=== epoch 1/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<22:57,  1.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:25<00:00,  1.63it/s]
episodes                                  125
episode_length                         15.952
returns                             76.162474
return_std                           4.441413
average_reward                        4.77438
round_time             0 days 00:20:26.886951
episodes_test                           125.0
episode_length_test                    15.888
returns_test                        75.787784
return_std_test                      1.489631
average_reward_test                  4.770089
round_time_test        0 days 00:00:04.842082
round_time_total       0 days 00:20:26.888844
loss_total              8784067674667569152.0
loss_critic            10980084405695930368.0
loss_actor                -13671552108.544001
memory_size                             402.0 

=== epoch 1/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:03,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:02<00:00,  1.58it/s]
episodes                                  124
episode_length                      16.104839
returns                             76.886342
return_std                           9.253685
average_reward                       4.774226
round_time             0 days 00:21:03.117906
episodes_test                           126.0
episode_length_test                 15.849206
returns_test                        75.595522
return_std_test                      1.695285
average_reward_test                  4.769797
round_time_test        0 days 00:00:04.711446
round_time_total       0 days 00:21:03.119618
loss_total             10265930641815990272.0
loss_critic            12832413072472057856.0
loss_actor                -14804433032.191999
memory_size                           404.331 

=== epoch 1/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:55,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:22<00:00,  1.64it/s]
episodes                                  124
episode_length                      15.903226
returns                             75.850239
return_std                           1.425024
average_reward                       4.769271
round_time             0 days 00:20:23.447213
episodes_test                           124.0
episode_length_test                 16.048387
returns_test                        76.620916
return_std_test                      5.691489
average_reward_test                  4.774461
round_time_test        0 days 00:00:05.335937
round_time_total       0 days 00:20:23.448987
loss_total             12142512240282650624.0
loss_critic            15178140037088997376.0
loss_actor                -15952622239.743999
memory_size                             405.0 

=== epoch 1/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:05,  1.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:47<00:00,  1.60it/s]
episodes                                  126
episode_length                      15.857143
returns                             75.650084
return_std                           1.662806
average_reward                       4.770785
round_time             0 days 00:20:48.776272
episodes_test                           123.0
episode_length_test                 16.195122
returns_test                        77.530987
return_std_test                     17.553042
average_reward_test                  4.787322
round_time_test        0 days 00:00:03.786545
round_time_total       0 days 00:20:48.777984
loss_total             14320051730770477056.0
loss_critic            17900064374497697792.0
loss_actor                -17220169538.560001
memory_size                             405.0 

=== epoch 1/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:01,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:24<00:00,  1.56it/s]
episodes                                  125
episode_length                         15.856
returns                             75.641586
return_std                           1.693843
average_reward                       4.770496
round_time             0 days 00:21:24.970175
episodes_test                           125.0
episode_length_test                     15.88
returns_test                        75.776525
return_std_test                      4.057015
average_reward_test                  4.771761
round_time_test        0 days 00:00:03.875342
round_time_total       0 days 00:21:24.972346
loss_total             17054808327649462272.0
loss_critic            21318510029749280768.0
loss_actor                -18589210443.776001
memory_size                             405.0 

=== epoch 1/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:47,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:11<00:00,  1.57it/s]
episodes                                  125
episode_length                         15.896
returns                             75.785151
return_std                           1.469961
average_reward                       4.767449
round_time             0 days 00:21:12.296494
episodes_test                           125.0
episode_length_test                    15.952
returns_test                         76.18401
return_std_test                      7.472468
average_reward_test                  4.775866
round_time_test        0 days 00:00:04.025452
round_time_total       0 days 00:21:12.298423
loss_total             19934678918587699200.0
loss_critic            24918348184240717824.0
loss_actor                   -20036930166.784
memory_size                             405.0 

=== epoch 1/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:49,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:05<00:00,  1.58it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  124
episode_length                      16.064516
returns                             76.672449
return_std                           8.418077
average_reward                       4.772816
round_time             0 days 00:21:06.330317
episodes_test                           126.0
episode_length_test                 15.801587
returns_test                        75.362032
return_std_test                      1.873877
average_reward_test                  4.769294
round_time_test        0 days 00:00:03.940459
round_time_total       0 days 00:21:06.331740
loss_total             24024547940638027776.0
loss_critic            30030684357212581888.0
loss_actor                -21744096844.799999
memory_size                             405.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<21:32,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:22<00:00,  1.56it/s]
episodes                                  126
episode_length                      15.833333
returns                              75.52568
return_std                           1.767333
average_reward                       4.770142
round_time             0 days 00:21:22.710511
episodes_test                           124.0
episode_length_test                 16.008065
returns_test                        76.398919
return_std_test                      8.505445
average_reward_test                  4.772456
round_time_test        0 days 00:00:04.173835
round_time_total       0 days 00:21:22.712368
loss_total             29729607675329540096.0
loss_critic            37162008950810189824.0
loss_actor                -23550978056.192001
memory_size                             405.0 

=== epoch 2/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:22,  1.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:18<00:00,  1.56it/s]
episodes                                  126
episode_length                      15.753968
returns                             75.105202
return_std                           2.133165
average_reward                       4.767421
round_time             0 days 00:21:19.058929
episodes_test                           125.0
episode_length_test                    15.904
returns_test                        75.883909
return_std_test                      4.914837
average_reward_test                  4.771415
round_time_test        0 days 00:00:05.167702
round_time_total       0 days 00:21:19.060461
loss_total             34378955017603268608.0
loss_critic            42973692974033518592.0
loss_actor                   -25460561870.848
memory_size                             405.0 

=== epoch 2/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:09,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:33<00:00,  1.55it/s]
episodes                                  125
episode_length                         15.904
returns                             76.029199
return_std                          12.009789
average_reward                        4.78036
round_time             0 days 00:21:34.542909
episodes_test                           126.0
episode_length_test                 15.825397
returns_test                        75.490598
return_std_test                      1.968524
average_reward_test                  4.770297
round_time_test        0 days 00:00:04.536593
round_time_total       0 days 00:21:34.544777
loss_total             51624567836924551168.0
loss_critic            64530708699667718144.0
loss_actor                   -27742799405.056
memory_size                           411.615 

=== epoch 2/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:06,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:44<00:00,  1.53it/s]
episodes                                  126
episode_length                      15.857143
returns                             75.685526
return_std                           4.013239
average_reward                       4.773093
round_time             0 days 00:21:45.379717
episodes_test                           126.0
episode_length_test                 15.833333
returns_test                         75.58364
return_std_test                      5.720098
average_reward_test                    4.7738
round_time_test        0 days 00:00:03.901749
round_time_total       0 days 00:21:45.381185
loss_total             62409044732039938048.0
loss_critic            78011304639478988800.0
loss_actor                   -30142872398.848
memory_size                             412.0 

=== epoch 2/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:05,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:53<00:00,  1.52it/s]
episodes                                  125
episode_length                         15.824
returns                             75.498756
return_std                           1.826744
average_reward                       4.771093
round_time             0 days 00:21:54.396541
episodes_test                           126.0
episode_length_test                 15.857143
returns_test                        75.652351
return_std_test                        5.2452
average_reward_test                  4.771007
round_time_test        0 days 00:00:03.932730
round_time_total       0 days 00:21:54.397971
loss_total             67625428726637953024.0
loss_critic            84531784548201562112.0
loss_actor                -32333975137.279999
memory_size                             412.0 

=== epoch 2/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:32,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:34<00:00,  1.55it/s]
episodes                                  126
episode_length                      15.730159
returns                              75.03169
return_std                           2.283723
average_reward                       4.769903
round_time             0 days 00:21:35.467535
episodes_test                           126.0
episode_length_test                 15.801587
returns_test                        75.404251
return_std_test                      6.462131
average_reward_test                  4.771992
round_time_test        0 days 00:00:04.236881
round_time_total       0 days 00:21:35.468966
loss_total             78193941074497470464.0
loss_critic            97742424650423681024.0
loss_actor                    -35032915000.32
memory_size                             412.0 

=== epoch 2/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:37,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:57<00:00,  1.52it/s]
episodes                                   125
episode_length                          15.944
returns                              76.132717
return_std                             5.89304
average_reward                         4.77511
round_time              0 days 00:21:58.682751
episodes_test                            126.0
episode_length_test                  15.833333
returns_test                         75.521538
return_std_test                       1.738714
average_reward_test                   4.769868
round_time_test         0 days 00:00:04.199066
round_time_total        0 days 00:21:58.684604
loss_total              93877038726627819520.0
loss_critic            117346296531418415104.0
loss_actor                 -38100343426.047997
memory_size                              412.0 

=== epoch 2/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:27,  1.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:44<00:00,  1.53it/s]
episodes                                   125
episode_length                          15.816
returns                              75.528991
return_std                            4.648936
average_reward                        4.775341
round_time              0 days 00:21:45.624288
episodes_test                            125.0
episode_length_test                      15.92
returns_test                         76.112169
return_std_test                       7.274207
average_reward_test                   4.780923
round_time_test         0 days 00:00:04.186218
round_time_total        0 days 00:21:45.626161
loss_total             110399351596298338304.0
loss_critic            137999187222682386432.0
loss_actor                 -41266807087.103996
memory_size                              412.0 

=== epoch 2/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:17,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:15<00:00,  1.50it/s]
episodes                                   124
episode_length                       15.983871
returns                              76.243955
return_std                            7.430473
average_reward                        4.769984
round_time              0 days 00:22:16.295757
episodes_test                            125.0
episode_length_test                     15.912
returns_test                         75.915458
return_std_test                       3.439141
average_reward_test                   4.771032
round_time_test         0 days 00:00:04.430615
round_time_total        0 days 00:22:16.297645
loss_total             128913122883999531008.0
loss_critic            161141400969470050304.0
loss_actor                 -44546526867.456001
memory_size                              412.0 

=== epoch 2/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:12,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:55<00:00,  1.52it/s]
episodes                                   126
episode_length                       15.833333
returns                              75.507858
return_std                             1.77894
average_reward                        4.768822
round_time              0 days 00:21:56.654079
episodes_test                            125.0
episode_length_test                     15.976
returns_test                         76.316513
return_std_test                       6.067697
average_reward_test                    4.77708
round_time_test         0 days 00:00:04.481627
round_time_total        0 days 00:21:56.655828
loss_total             151597547701188820992.0
loss_critic            189496931331249700864.0
loss_actor                 -48033718706.176003
memory_size                              412.0 

=== epoch 2/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:57,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:30<00:00,  1.55it/s]
episodes                                   126
episode_length                        15.84127
returns                              75.552599
return_std                              1.7709
average_reward                         4.76948
round_time              0 days 00:21:31.657609
episodes_test                            125.0
episode_length_test                      15.88
returns_test                         75.738451
return_std_test                       1.552877
average_reward_test                    4.76964
round_time_test         0 days 00:00:04.991706
round_time_total        0 days 00:21:31.659565
loss_total             177945264431464873984.0
loss_critic            222431576537658523648.0
loss_actor                 -51577074370.559998
memory_size                              412.0 

=== epoch 2/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:12,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:24<00:00,  1.49it/s]
episodes                                   125
episode_length                          15.848
returns                              75.590981
return_std                            1.695418
average_reward                        4.769627
round_time              0 days 00:22:25.296069
episodes_test                            126.0
episode_length_test                  15.849206
returns_test                         75.617356
return_std_test                       1.729347
average_reward_test                   4.771215
round_time_test         0 days 00:00:04.614467
round_time_total        0 days 00:22:25.297960
loss_total             202098790730383982592.0
loss_critic            252623484077605617664.0
loss_actor                 -55221191753.727997
memory_size                              412.0 

=== epoch 2/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:48,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:50<00:00,  1.40it/s]
episodes                                   123
episode_length                       16.073171
returns                              76.664006
return_std                            7.170001
average_reward                        4.769556
round_time              0 days 00:23:51.052637
episodes_test                            124.0
episode_length_test                  16.016129
returns_test                         76.385338
return_std_test                       6.483862
average_reward_test                   4.769308
round_time_test         0 days 00:00:04.144819
round_time_total        0 days 00:23:51.054180
loss_total             232339954310979420160.0
loss_critic            290424938130037964800.0
loss_actor                 -58817633052.671997
memory_size                              412.0 

=== epoch 2/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:11,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:10<00:00,  1.44it/s]
episodes                                   126
episode_length                       15.857143
returns                              75.627946
return_std                            2.241532
average_reward                        4.769404
round_time              0 days 00:23:11.655537
episodes_test                            125.0
episode_length_test                     15.984
returns_test                          76.18802
return_std_test                       4.500117
average_reward_test                   4.766662
round_time_test         0 days 00:00:03.767134
round_time_total        0 days 00:23:11.657413
loss_total             263019582009312608256.0
loss_critic            328774471958007513088.0
loss_actor                    -62894539214.848
memory_size                              412.0 

=== epoch 2/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:12,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:34<00:00,  1.48it/s]
episodes                                   124
episode_length                       15.959677
returns                              76.160016
return_std                            5.682624
average_reward                        4.772022
round_time              0 days 00:22:35.803125
episodes_test                            126.0
episode_length_test                  15.849206
returns_test                         75.642494
return_std_test                       2.385974
average_reward_test                   4.772757
round_time_test         0 days 00:00:03.759199
round_time_total        0 days 00:22:35.805102
loss_total             294152100354723807232.0
loss_critic            367690118678109749248.0
loss_actor                 -66656475191.295998
memory_size                              412.0 

=== epoch 2/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:25,  1.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:37<00:00,  1.54it/s]
episodes                                   125
episode_length                          15.832
returns                              75.490894
return_std                            1.777321
average_reward                         4.76816
round_time              0 days 00:21:38.689647
episodes_test                            125.0
episode_length_test                     15.912
returns_test                         75.935339
return_std_test                       2.267733
average_reward_test                   4.772215
round_time_test         0 days 00:00:04.319423
round_time_total        0 days 00:21:38.691463
loss_total             329402830112140754944.0
loss_critic            411753530123914444800.0
loss_actor                 -70618848636.927994
memory_size                              412.0 

=== epoch 2/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:08,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:59<00:00,  1.59it/s]
episodes                                   123
episode_length                       16.105691
returns                              76.863924
return_std                            7.674985
average_reward                        4.772389
round_time              0 days 00:21:00.703149
episodes_test                            126.0
episode_length_test                   15.81746
returns_test                         75.422065
return_std_test                       2.097978
average_reward_test                   4.768408
round_time_test         0 days 00:00:03.869157
round_time_total        0 days 00:21:00.704600
loss_total             363734137932992348160.0
loss_critic            454667664840605302784.0
loss_actor                 -74950916907.007996
memory_size                              412.0 

=== epoch 2/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:46,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:51<00:00,  1.60it/s]
episodes                                   125
episode_length                           15.92
returns                              76.002841
return_std                            8.426644
average_reward                        4.774053
round_time              0 days 00:20:51.984782
episodes_test                            126.0
episode_length_test                  15.873016
returns_test                         75.732407
return_std_test                       3.875355
average_reward_test                   4.771142
round_time_test         0 days 00:00:04.766943
round_time_total        0 days 00:20:51.986559
loss_total             412583260446179065856.0
loss_critic            515729067691817631744.0
loss_actor                 -79184706166.783997
memory_size                              412.0 

=== epoch 2/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:51,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:18<00:00,  1.56it/s]
episodes                                   126
episode_length                       15.761905
returns                              75.213555
return_std                             3.85893
average_reward                        4.771845
round_time              0 days 00:21:19.758267
episodes_test                            125.0
episode_length_test                     15.896
returns_test                         75.828305
return_std_test                       4.821833
average_reward_test                   4.770296
round_time_test         0 days 00:00:04.349253
round_time_total        0 days 00:21:19.760049
loss_total             452903492065467367424.0
loss_critic            566129355709597089792.0
loss_actor                 -83386685591.552002
memory_size                              412.0 

=== epoch 2/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:11,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:10<00:00,  1.57it/s]
episodes                                   125
episode_length                          15.904
returns                              75.864562
return_std                            2.495231
average_reward                        4.770153
round_time              0 days 00:21:11.475162
episodes_test                            125.0
episode_length_test                     15.904
returns_test                         75.877801
return_std_test                          5.211
average_reward_test                   4.770912
round_time_test         0 days 00:00:04.270586
round_time_total        0 days 00:21:11.476620
loss_total             491555932207748546560.0
loss_critic            614444905152974815232.0
loss_actor                 -87933274935.296005
memory_size                              412.0 

=== epoch 2/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:38,  1.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:45<00:00,  1.53it/s]
episodes                                   124
episode_length                       15.919355
returns                              75.948461
return_std                            3.499442
average_reward                        4.770601
round_time              0 days 00:21:46.295159
episodes_test                            126.0
episode_length_test                  15.857143
returns_test                         75.670314
return_std_test                       4.792571
average_reward_test                   4.772155
round_time_test         0 days 00:00:03.935590
round_time_total        0 days 00:21:46.296933
loss_total             549243317004740132864.0
loss_critic            686554134306432745472.0
loss_actor                    -92740560617.472
memory_size                              412.0 

=== epoch 2/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:48,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:12<00:00,  1.57it/s]
episodes                                   126
episode_length                       15.777778
returns                              75.274597
return_std                            2.119629
average_reward                        4.770962
round_time              0 days 00:21:13.488370
episodes_test                            126.0
episode_length_test                  15.761905
returns_test                          75.22226
return_std_test                       2.751234
average_reward_test                   4.772345
round_time_test         0 days 00:00:04.406455
round_time_total        0 days 00:21:13.490375
loss_total             607170598930955894784.0
loss_critic            758963235825797169152.0
loss_actor                 -97760108236.800003
memory_size                              412.0 

=== epoch 2/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:03,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:43<00:00,  1.53it/s]
episodes                                   125
episode_length                          15.832
returns                              75.526209
return_std                            1.968811
average_reward                         4.77043
round_time              0 days 00:21:44.373255
episodes_test                            126.0
episode_length_test                  15.761905
returns_test                         75.166331
return_std_test                       2.993268
average_reward_test                   4.768921
round_time_test         0 days 00:00:04.480009
round_time_total        0 days 00:21:44.374664
loss_total             663836647464867332096.0
loss_critic            829795795367286538240.0
loss_actor                -102622884937.727997
memory_size                              412.0 

=== epoch 2/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:52,  1.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:12<00:00,  1.57it/s]
episodes                                   125
episode_length                          15.856
returns                               75.68249
return_std                            2.606428
average_reward                        4.773051
round_time              0 days 00:21:13.458090
episodes_test                            125.0
episode_length_test                      15.88
returns_test                         75.867585
return_std_test                        4.02799
average_reward_test                   4.777516
round_time_test         0 days 00:00:04.838832
round_time_total        0 days 00:21:13.459958
loss_total             745914045490917015552.0
loss_critic            932392541092251435008.0
loss_actor                -107697739554.815994
memory_size                              412.0 

=== epoch 2/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:23,  1.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:11<00:00,  1.57it/s]
episodes                                    126
episode_length                         15.81746
returns                               75.497901
return_std                              3.59725
average_reward                         4.773137
round_time               0 days 00:21:12.543008
episodes_test                             126.0
episode_length_test                   15.857143
returns_test                          75.656921
return_std_test                        4.720455
average_reward_test                    4.771291
round_time_test          0 days 00:00:04.227821
round_time_total         0 days 00:21:12.544387
loss_total              801975893420988956672.0
loss_critic            1002469849104885219328.0
loss_actor                 -112618496053.248001
memory_size                               412.0 

=== epoch 2/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:00,  1.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:18<00:00,  1.56it/s]
episodes                                    126
episode_length                        15.706349
returns                               74.927818
return_std                             2.167558
average_reward                         4.770332
round_time               0 days 00:21:19.332053
episodes_test                             126.0
episode_length_test                    15.81746
returns_test                          75.459058
return_std_test                        1.825279
average_reward_test                    4.770688
round_time_test          0 days 00:00:04.556447
round_time_total         0 days 00:21:19.333449
loss_total              891177467826922192896.0
loss_critic            1113971816931982049280.0
loss_actor                 -117786666930.175995
memory_size                               412.0 

=== epoch 2/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:34,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:56<00:00,  1.52it/s]
episodes                                    126
episode_length                        15.801587
returns                               75.356723
return_std                             2.012572
average_reward                         4.768969
round_time               0 days 00:21:57.190840
episodes_test                             124.0
episode_length_test                   16.008065
returns_test                          76.473814
return_std_test                       14.128826
average_reward_test                    4.777038
round_time_test          0 days 00:00:03.920971
round_time_total         0 days 00:21:57.192832
loss_total              982693152003805544448.0
loss_critic            1228366418951308312576.0
loss_actor                 -123190117920.768005
memory_size                               412.0 

=== epoch 2/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:06,  1.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:58<00:00,  1.52it/s]
episodes                                    126
episode_length                        15.761905
returns                               75.160724
return_std                             2.024119
average_reward                         4.768545
round_time               0 days 00:21:58.935899
episodes_test                             124.0
episode_length_test                   16.032258
returns_test                          76.641531
return_std_test                       10.992367
average_reward_test                    4.780481
round_time_test          0 days 00:00:03.935624
round_time_total         0 days 00:21:58.937625
loss_total             1068757490796999344128.0
loss_critic            1335946840705572012032.0
loss_actor                 -128782187855.871994
memory_size                               412.0 

=== epoch 2/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:16,  1.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:55<00:00,  1.52it/s]
episodes                                    125
episode_length                           15.832
returns                               75.597367
return_std                             7.066286
average_reward                         4.774805
round_time               0 days 00:21:56.528662
episodes_test                             127.0
episode_length_test                   15.716535
returns_test                          75.063891
return_std_test                        3.999209
average_reward_test                    4.776215
round_time_test          0 days 00:00:05.774248
round_time_total         0 days 00:21:56.530357
loss_total             1163285759463554678784.0
loss_critic            1454107174770751569920.0
loss_actor                 -134808135217.151993
memory_size                               412.0 

=== epoch 2/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:34,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:00<00:00,  1.59it/s]
episodes                                    126
episode_length                        15.769841
returns                               75.210546
return_std                             2.023116
average_reward                         4.769286
round_time               0 days 00:21:01.718743
episodes_test                             125.0
episode_length_test                       15.88
returns_test                          75.700241
return_std_test                        5.510227
average_reward_test                    4.766955
round_time_test          0 days 00:00:03.888428
round_time_total         0 days 00:21:01.720499
loss_total             1262425390188431409152.0
loss_critic            1578031710859077091328.0
loss_actor                 -140776545734.656006
memory_size                               412.0 

=== epoch 2/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:30,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:07<00:00,  1.58it/s]
episodes                                    125
episode_length                            15.88
returns                                75.71672
return_std                             4.289319
average_reward                         4.768115
round_time               0 days 00:21:08.588273
episodes_test                             126.0
episode_length_test                   15.809524
returns_test                          75.436776
return_std_test                        2.491484
average_reward_test                    4.771705
round_time_test          0 days 00:00:03.898360
round_time_total         0 days 00:21:08.589646
loss_total             1381443286025526312960.0
loss_critic            1726804078768683745280.0
loss_actor                 -147040553328.640015
memory_size                               412.0 

=== epoch 2/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:17,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:48<00:00,  1.53it/s]
episodes                                    125
episode_length                           15.904
returns                               75.844367
return_std                             3.302077
average_reward                         4.768925
round_time               0 days 00:21:49.592928
episodes_test                             126.0
episode_length_test                    15.84127
returns_test                          75.562497
return_std_test                         1.74007
average_reward_test                    4.770084
round_time_test          0 days 00:00:03.939559
round_time_total         0 days 00:21:49.594699
loss_total             1521128891292217245696.0
loss_critic            1901411079401490350080.0
loss_actor                 -153870124769.279999
memory_size                               412.0 

=== epoch 2/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:47,  1.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:17<00:00,  1.64it/s]
episodes                                    125
episode_length                           15.816
returns                                75.46962
return_std                             2.317756
average_reward                         4.771763
round_time               0 days 00:20:18.952577
episodes_test                             127.0
episode_length_test                   15.724409
returns_test                           74.99108
return_std_test                        2.118148
average_reward_test                    4.769241
round_time_test          0 days 00:00:04.384226
round_time_total         0 days 00:20:18.954262
loss_total             1633229215071183831040.0
loss_critic            2041536484780507660288.0
loss_actor                 -160152486264.832001
memory_size                               412.0 

=== epoch 2/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:05,  1.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.07it/s]
episodes                                    125
episode_length                            15.84
returns                               75.528074
return_std                             1.753919
average_reward                         4.768259
round_time               0 days 00:16:09.074458
episodes_test                             125.0
episode_length_test                      15.984
returns_test                          76.280269
return_std_test                        8.084221
average_reward_test                    4.772424
round_time_test          0 days 00:00:03.869290
round_time_total         0 days 00:16:09.075775
loss_total             1781617187417717211136.0
loss_critic            2227021449430822289408.0
loss_actor                 -167136586276.864014
memory_size                               412.0 

=== epoch 2/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:23,  1.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:04<00:00,  2.21it/s]
episodes                                    126
episode_length                        15.777778
returns                                75.24335
return_std                             1.974728
average_reward                         4.769003
round_time               0 days 00:15:05.682325
episodes_test                             126.0
episode_length_test                   15.793651
returns_test                          75.356511
return_std_test                        1.941195
average_reward_test                    4.771314
round_time_test          0 days 00:00:03.664616
round_time_total         0 days 00:15:05.683536
loss_total             1895628216343852482560.0
loss_critic            2369535230310835290112.0
loss_actor                 -173618569568.256012
memory_size                               412.0 

=== epoch 2/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:39,  2.00it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                    124
episode_length                        15.967742
returns                               76.206797
return_std                            11.953445
average_reward                          4.77262
round_time               0 days 00:14:09.486242
episodes_test                             125.0
episode_length_test                       15.88
returns_test                          75.851689
return_std_test                        5.919431
average_reward_test                    4.776442
round_time_test          0 days 00:00:03.478918
round_time_total         0 days 00:14:09.487370
loss_total             2027805954815179882496.0
loss_critic            2534757402555569405952.0
loss_actor                 -180161696759.808014
memory_size                            412.0925 

=== epoch 2/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:46,  2.11it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:18<00:00,  2.50it/s]
episodes                                    125
episode_length                           15.928
returns                               75.956615
return_std                             3.697272
average_reward                          4.76883
round_time               0 days 00:13:19.269537
episodes_test                             126.0
episode_length_test                   15.849206
returns_test                          75.572258
return_std_test                        1.700995
average_reward_test                    4.768357
round_time_test          0 days 00:00:03.227839
round_time_total         0 days 00:13:19.270616
loss_total             2548504564630980395008.0
loss_critic            3185630649194663378944.0
loss_actor                 -187945441075.200012
memory_size                               422.0 

=== epoch 2/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                    125
episode_length                           15.896
returns                               75.900671
return_std                              4.75035
average_reward                         4.774834
round_time               0 days 00:12:37.883388
episodes_test                             126.0
episode_length_test                    15.84127
returns_test                          75.616689
return_std_test                        3.544176
average_reward_test                    4.773531
round_time_test          0 days 00:00:03.324054
round_time_total         0 days 00:12:37.884488
loss_total             2795164692065136148480.0
loss_critic            3493955808645687541760.0
loss_actor                    -196456342437.888
memory_size                               422.0 

=== epoch 2/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:41,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                    125
episode_length                           15.848
returns                               75.582157
return_std                             1.712858
average_reward                         4.769247
round_time               0 days 00:12:20.333583
episodes_test                             125.0
episode_length_test                      15.896
returns_test                          75.871265
return_std_test                        1.635003
average_reward_test                    4.773023
round_time_test          0 days 00:00:03.235327
round_time_total         0 days 00:12:20.334670
loss_total             3070913353249327153152.0
loss_critic            3838641623057686855680.0
loss_actor                 -205299305373.696014
memory_size                               422.0 

=== epoch 2/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:22,  2.03it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                    125
episode_length                            15.84
returns                               75.553512
return_std                             2.780738
average_reward                         4.769831
round_time               0 days 00:12:27.438420
episodes_test                             124.0
episode_length_test                   16.024194
returns_test                          76.571035
return_std_test                        6.550194
average_reward_test                     4.77851
round_time_test          0 days 00:00:03.236249
round_time_total         0 days 00:12:27.439502
loss_total             3320638702154884317184.0
loss_critic            4150798304263820804096.0
loss_actor                 -215003909931.007996
memory_size                               422.0 

=== epoch 2/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                    126
episode_length                        15.769841
returns                               75.359763
return_std                              5.44251
average_reward                         4.778709
round_time               0 days 00:12:27.208056
episodes_test                             126.0
episode_length_test                   15.777778
returns_test                          75.235299
return_std_test                        1.998649
average_reward_test                    4.768493
round_time_test          0 days 00:00:03.150235
round_time_total         0 days 00:12:27.209129
loss_total             3738592016648084389888.0
loss_critic            4673239942771168509952.0
loss_actor                 -224576202645.503998
memory_size                               422.0 

=== epoch 2/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                    127
episode_length                        15.637795
returns                               74.599941
return_std                             2.513139
average_reward                         4.770566
round_time               0 days 00:12:26.320774
episodes_test                             126.0
episode_length_test                   15.825397
returns_test                           75.50467
return_std_test                        3.628045
average_reward_test                    4.771193
round_time_test          0 days 00:00:03.171503
round_time_total         0 days 00:12:26.321845
loss_total             4052541624564865040384.0
loss_critic            5065676945806191493120.0
loss_actor                      -234503111680.0
memory_size                               422.0 

=== epoch 2/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:43,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                    126
episode_length                        15.746032
returns                               75.079409
return_std                             2.138561
average_reward                         4.768197
round_time               0 days 00:12:21.394481
episodes_test                             126.0
episode_length_test                   15.769841
returns_test                          75.274708
return_std_test                        9.111407
average_reward_test                    4.773464
round_time_test          0 days 00:00:03.171367
round_time_total         0 days 00:12:21.395564
loss_total             4452955166187587633152.0
loss_critic            5566193868366178942976.0
loss_actor                 -244861543088.127991
memory_size                               422.0 

=== epoch 2/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                    126
episode_length                        15.730159
returns                               75.004133
return_std                             2.115937
average_reward                          4.76812
round_time               0 days 00:12:25.702746
episodes_test                             127.0
episode_length_test                   15.748031
returns_test                          75.144167
return_std_test                        2.542059
average_reward_test                    4.771655
round_time_test          0 days 00:00:03.205229
round_time_total         0 days 00:12:25.703824
loss_total             4773013802149466865664.0
loss_critic            5966267143193067782144.0
loss_actor                 -255140236386.303986
memory_size                               422.0 

=== epoch 2/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:09,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                    126
episode_length                        15.801587
returns                               75.375099
return_std                             1.934681
average_reward                         4.770094
round_time               0 days 00:12:25.543081
episodes_test                             127.0
episode_length_test                   15.740157
returns_test                          75.150598
return_std_test                        4.710845
average_reward_test                      4.7745
round_time_test          0 days 00:00:03.185837
round_time_total         0 days 00:12:25.544157
loss_total             5175659387349840691200.0
loss_critic            6469574120242712412160.0
loss_actor                 -265350425067.519989
memory_size                               422.0 

=== epoch 2/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.68it/s]
episodes                                    124
episode_length                        15.983871
returns                               76.350136
return_std                            11.076137
average_reward                          4.77655
round_time               0 days 00:12:25.387966
episodes_test                             127.0
episode_length_test                   15.740157
returns_test                          75.071813
return_std_test                        2.108349
average_reward_test                    4.769514
round_time_test          0 days 00:00:03.160770
round_time_total         0 days 00:12:25.389037
loss_total             5841946463796488830976.0
loss_critic            7302432953732782096384.0
loss_actor                  -275596866314.23999
memory_size                             424.785 

=== epoch 2/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:05,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                    126
episode_length                        15.801587
returns                               75.353654
return_std                             2.102538
average_reward                         4.768754
round_time               0 days 00:12:26.008098
episodes_test                             125.0
episode_length_test                      15.912
returns_test                          75.961393
return_std_test                        8.271254
average_reward_test                    4.773836
round_time_test          0 days 00:00:03.189814
round_time_total         0 days 00:12:26.009168
loss_total             6406630271900984541184.0
loss_critic            8008287705049716490240.0
loss_actor                 -286084488249.343994
memory_size                               427.0 

=== epoch 2/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                    125
episode_length                           15.816
returns                               75.434925
return_std                             1.822537
average_reward                         4.769481
round_time               0 days 00:12:22.803133
episodes_test                             126.0
episode_length_test                   15.865079
returns_test                          75.778123
return_std_test                        9.212625
average_reward_test                    4.776469
round_time_test          0 days 00:00:03.140410
round_time_total         0 days 00:12:22.804214
loss_total             6829898276010933092352.0
loss_critic            8537372688513578827776.0
loss_actor                 -296498871459.840027
memory_size                               427.0 

=== epoch 2/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                    125
episode_length                           15.872
returns                                75.67216
return_std                             1.605172
average_reward                         4.767617
round_time               0 days 00:12:20.832881
episodes_test                             126.0
episode_length_test                   15.857143
returns_test                          75.609356
return_std_test                        1.670718
average_reward_test                    4.768322
round_time_test          0 days 00:00:03.112908
round_time_total         0 days 00:12:20.833963
loss_total             7182765557711850438656.0
loss_critic            8978456796656253796352.0
loss_actor                 -306838517497.856018
memory_size                               427.0 

=== epoch 2/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    124
episode_length                        15.919355
returns                               75.986469
return_std                             4.307724
average_reward                         4.774847
round_time               0 days 00:12:25.366261
episodes_test                             124.0
episode_length_test                   16.040323
returns_test                          76.523458
return_std_test                       11.538664
average_reward_test                    4.770666
round_time_test          0 days 00:00:03.206697
round_time_total         0 days 00:12:25.367327
loss_total             7841353706529397669888.0
loss_critic            9801691969941445541888.0
loss_actor                 -318179460431.872009
memory_size                               427.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:52,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     123
episode_length                         16.146341
returns                                77.034001
return_std                             12.902344
average_reward                          4.770975
round_time                0 days 00:12:23.924142
episodes_test                              125.0
episode_length_test                       15.928
returns_test                           75.997971
return_std_test                          5.78018
average_reward_test                      4.77143
round_time_test           0 days 00:00:03.207165
round_time_total          0 days 00:12:23.925246
loss_total              8508952083835384758272.0
loss_critic            10636189914481962254336.0
loss_actor                  -330683780907.007996
memory_size                             429.3975 

=== epoch 3/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                     126
episode_length                         15.825397
returns                                75.481464
return_std                              1.820066
average_reward                           4.76973
round_time                0 days 00:12:27.128213
episodes_test                              125.0
episode_length_test                         16.0
returns_test                           76.403449
return_std_test                         5.535597
average_reward_test                     4.775216
round_time_test           0 days 00:00:03.185267
round_time_total          0 days 00:12:27.129400
loss_total              9612830785319739588608.0
loss_critic            12016038281309860134912.0
loss_actor                  -343114759372.799988
memory_size                                437.0 

=== epoch 3/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:45,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                     125
episode_length                            15.872
returns                                75.706086
return_std                              1.698903
average_reward                          4.769831
round_time                0 days 00:12:28.865530
episodes_test                              126.0
episode_length_test                     15.81746
returns_test                           75.453311
return_std_test                         1.894937
average_reward_test                     4.770371
round_time_test           0 days 00:00:03.165083
round_time_total          0 days 00:12:28.866634
loss_total             10389290242765058211840.0
loss_critic            12986612571204283793408.0
loss_actor                  -355815171145.728027
memory_size                                437.0 

=== epoch 3/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                     125
episode_length                            15.904
returns                                 75.88105
return_std                              1.901702
average_reward                          4.771073
round_time                0 days 00:12:25.930550
episodes_test                              125.0
episode_length_test                       15.976
returns_test                           76.229637
return_std_test                         5.186619
average_reward_test                     4.771634
round_time_test           0 days 00:00:03.140419
round_time_total          0 days 00:12:25.931637
loss_total             11229710141733128372224.0
loss_critic            14037137426125917519872.0
loss_actor                  -369305512820.736023
memory_size                                437.0 

=== epoch 3/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     125
episode_length                            15.968
returns                                76.255809
return_std                              7.641743
average_reward                          4.775686
round_time                0 days 00:12:23.733116
episodes_test                              125.0
episode_length_test                       15.976
returns_test                           76.200397
return_std_test                         4.767054
average_reward_test                     4.769838
round_time_test           0 days 00:00:03.120899
round_time_total          0 days 00:12:23.734203
loss_total             11839339225952280379392.0
loss_critic            14799173762787322101760.0
loss_actor                  -382224350412.799988
memory_size                                437.0 

=== epoch 3/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                     125
episode_length                            15.848
returns                                75.610133
return_std                               1.70261
average_reward                           4.77097
round_time                0 days 00:12:21.239663
episodes_test                              125.0
episode_length_test                        15.92
returns_test                           76.108562
return_std_test                         7.310658
average_reward_test                     4.780641
round_time_test           0 days 00:00:03.137436
round_time_total          0 days 00:12:21.240754
loss_total             12717154757389347454976.0
loss_critic            15896443179194717831168.0
loss_actor                  -396170985406.463989
memory_size                                437.0 

=== epoch 3/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:54,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                     125
episode_length                            15.864
returns                                 75.68938
return_std                              4.991614
average_reward                          4.771116
round_time                0 days 00:12:21.769581
episodes_test                              124.0
episode_length_test                    16.008065
returns_test                           76.419175
return_std_test                         9.199317
average_reward_test                     4.773676
round_time_test           0 days 00:00:03.125865
round_time_total          0 days 00:12:21.770677
loss_total             13475739392729260490752.0
loss_critic            16844673959542151708672.0
loss_actor                  -408812455329.791992
memory_size                                437.0 

=== epoch 3/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                     124
episode_length                         15.983871
returns                                76.302153
return_std                               6.25471
average_reward                          4.773665
round_time                0 days 00:12:18.907897
episodes_test                              125.0
episode_length_test                       15.952
returns_test                           76.134772
return_std_test                         6.528162
average_reward_test                     4.772828
round_time_test           0 days 00:00:03.169124
round_time_total          0 days 00:12:18.908991
loss_total             14304794219769094995968.0
loss_critic            17880992479373747552256.0
loss_actor                  -422721925382.143982
memory_size                                437.0 

=== epoch 3/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                     124
episode_length                         15.959677
returns                                76.199359
return_std                              7.655067
average_reward                          4.774397
round_time                0 days 00:12:26.421191
episodes_test                              126.0
episode_length_test                    15.865079
returns_test                           75.670529
return_std_test                         1.635791
average_reward_test                     4.769708
round_time_test           0 days 00:00:03.223911
round_time_total          0 days 00:12:26.422282
loss_total             15129661282793837035520.0
loss_critic            18912076298725265244160.0
loss_actor                  -435301372510.208008
memory_size                                437.0 

=== epoch 3/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                     125
episode_length                             15.92
returns                                75.961355
return_std                              4.769199
average_reward                          4.771487
round_time                0 days 00:12:22.182968
episodes_test                              125.0
episode_length_test                        15.88
returns_test                           75.700458
return_std_test                         1.661175
average_reward_test                     4.766914
round_time_test           0 days 00:00:03.135792
round_time_total          0 days 00:12:22.184053
loss_total             16505666570855924105216.0
loss_critic            20632082860318809653248.0
loss_actor                   -450057351725.05603
memory_size                                437.0 

=== epoch 3/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     125
episode_length                            15.912
returns                                 75.87562
return_std                              2.067066
average_reward                          4.768479
round_time                0 days 00:12:23.516463
episodes_test                              124.0
episode_length_test                    16.024194
returns_test                           76.587614
return_std_test                         6.993417
average_reward_test                     4.779496
round_time_test           0 days 00:00:03.132937
round_time_total          0 days 00:12:23.517572
loss_total             17339445613528487034880.0
loss_critic            21674306650570928160768.0
loss_actor                  -462121844326.400024
memory_size                                437.0 

=== epoch 3/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:19,  2.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                     125
episode_length                            15.864
returns                                75.694106
return_std                              1.750123
average_reward                          4.771461
round_time                0 days 00:12:20.419241
episodes_test                              125.0
episode_length_test                        15.92
returns_test                           75.941029
return_std_test                         2.084793
average_reward_test                      4.77022
round_time_test           0 days 00:00:03.165429
round_time_total          0 days 00:12:20.420329
loss_total             18068578971630847393792.0
loss_critic            22585723332225172242432.0
loss_actor                  -475438228799.487976
memory_size                                437.0 

=== epoch 3/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:47,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                     125
episode_length                             15.84
returns                                75.543762
return_std                              1.844454
average_reward                          4.769251
round_time                0 days 00:12:25.728238
episodes_test                              126.0
episode_length_test                     15.84127
returns_test                           75.579974
return_std_test                         1.797549
average_reward_test                     4.771125
round_time_test           0 days 00:00:03.159550
round_time_total          0 days 00:12:25.729311
loss_total             18802483392794423459840.0
loss_critic            23503103831939519348736.0
loss_actor                  -487586144944.127991
memory_size                                437.0 

=== epoch 3/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                     125
episode_length                             15.92
returns                                75.950346
return_std                              3.136334
average_reward                          4.770828
round_time                0 days 00:12:24.562078
episodes_test                              126.0
episode_length_test                    15.833333
returns_test                           75.549824
return_std_test                         2.255961
average_reward_test                       4.7717
round_time_test           0 days 00:00:03.181913
round_time_total          0 days 00:12:24.563155
loss_total             20020089531114165633024.0
loss_critic            25025111480421243158528.0
loss_actor                  -503083295113.216003
memory_size                                437.0 

=== epoch 3/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:50,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     124
episode_length                         15.927419
returns                                 75.98886
return_std                              4.583947
average_reward                          4.770902
round_time                0 days 00:12:24.284835
episodes_test                              125.0
episode_length_test                       15.952
returns_test                           76.109005
return_std_test                         1.565197
average_reward_test                     4.771211
round_time_test           0 days 00:00:03.158086
round_time_total          0 days 00:12:24.285909
loss_total             21269522922689557168128.0
loss_critic            26586903193361464688640.0
loss_actor                  -520259640590.335999
memory_size                                437.0 

=== epoch 3/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:59,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:17<00:00,  2.71it/s]
episodes                                     126
episode_length                          15.84127
returns                                 75.59633
return_std                              2.762052
average_reward                          4.772086
round_time                0 days 00:12:17.534618
episodes_test                              125.0
episode_length_test                        15.88
returns_test                           75.698586
return_std_test                         6.716025
average_reward_test                     4.767007
round_time_test           0 days 00:00:03.167309
round_time_total          0 days 00:12:17.535682
loss_total             22678167766731608555520.0
loss_critic            28347709237295772073984.0
loss_actor                  -536165804785.664001
memory_size                                437.0 

=== epoch 3/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                     126
episode_length                         15.865079
returns                                75.694683
return_std                              1.959094
average_reward                          4.771237
round_time                0 days 00:12:20.422428
episodes_test                              126.0
episode_length_test                    15.849206
returns_test                           75.611763
return_std_test                         1.875423
average_reward_test                     4.770842
round_time_test           0 days 00:00:03.146214
round_time_total          0 days 00:12:20.423520
loss_total             23579100639003114733568.0
loss_critic            29473875298995073449984.0
loss_actor                  -549740948127.744019
memory_size                                437.0 

=== epoch 3/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:14<00:00,  2.72it/s]
episodes                                     123
episode_length                          16.04878
returns                                76.699013
return_std                              8.075763
average_reward                          4.778943
round_time                0 days 00:12:15.026032
episodes_test                              125.0
episode_length_test                       15.896
returns_test                           75.826104
return_std_test                         1.708698
average_reward_test                     4.770143
round_time_test           0 days 00:00:03.187213
round_time_total          0 days 00:12:15.027105
loss_total             25031416341725901225984.0
loss_critic            31289269884966202769408.0
loss_actor                  -567982463238.144043
memory_size                                437.0 

=== epoch 3/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                     125
episode_length                            15.928
returns                                75.996425
return_std                              3.160535
average_reward                          4.771311
round_time                0 days 00:12:25.189646
episodes_test                              124.0
episode_length_test                     16.08871
returns_test                           76.985156
return_std_test                         9.459067
average_reward_test                     4.785105
round_time_test           0 days 00:00:03.123515
round_time_total          0 days 00:12:25.190719
loss_total             26633450367423585714176.0
loss_critic            33291812395062891905024.0
loss_actor                  -582120045133.823975
memory_size                                437.0 

=== epoch 3/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:37,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                     123
episode_length                         16.073171
returns                                76.759511
return_std                              5.919218
average_reward                          4.775497
round_time                0 days 00:12:26.762356
episodes_test                              125.0
episode_length_test                       15.952
returns_test                           76.140528
return_std_test                         6.597188
average_reward_test                     4.773175
round_time_test           0 days 00:00:03.151625
round_time_total          0 days 00:12:26.763437
loss_total             27441201999401115975680.0
loss_critic            34301501913079755046912.0
loss_actor                  -596516691869.696045
memory_size                                437.0 

=== epoch 3/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.72it/s]
episodes                                     125
episode_length                             15.92
returns                                75.918713
return_std                              2.565594
average_reward                          4.768816
round_time                0 days 00:12:17.100193
episodes_test                              124.0
episode_length_test                    16.056452
returns_test                              76.715
return_std_test                         5.490616
average_reward_test                     4.777858
round_time_test           0 days 00:00:03.130529
round_time_total          0 days 00:12:17.101270
loss_total             28998107237502798528512.0
loss_critic            36247633428055762927616.0
loss_actor                  -613449019801.599976
memory_size                                437.0 

=== epoch 3/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:59,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.70it/s]
episodes                                     124
episode_length                         15.991935
returns                                76.354027
return_std                              4.240434
average_reward                           4.77443
round_time                0 days 00:12:22.570507
episodes_test                              123.0
episode_length_test                    16.162602
returns_test                           77.209927
return_std_test                         8.371073
average_reward_test                     4.777064
round_time_test           0 days 00:00:03.141959
round_time_total          0 days 00:12:22.571583
loss_total             31038018129068612911104.0
loss_critic            38797522004795384135680.0
loss_actor                  -632643652911.104004
memory_size                                437.0 

=== epoch 3/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                     124
episode_length                         16.008065
returns                                76.433345
return_std                              4.552259
average_reward                            4.7746
round_time                0 days 00:12:26.093270
episodes_test                              124.0
episode_length_test                    16.120968
returns_test                           76.969348
return_std_test                          3.16922
average_reward_test                     4.774546
round_time_test           0 days 00:00:03.174759
round_time_total          0 days 00:12:26.094349
loss_total             32516142160097482637312.0
loss_critic            40645177031196575531008.0
loss_actor                  -647906568208.384033
memory_size                                437.0 

=== epoch 3/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:17<00:00,  2.71it/s]
episodes                                     124
episode_length                         16.040323
returns                                76.499865
return_std                              6.084931
average_reward                          4.769126
round_time                0 days 00:12:18.212085
episodes_test                              124.0
episode_length_test                    16.016129
returns_test                           76.438896
return_std_test                         5.020563
average_reward_test                     4.772544
round_time_test           0 days 00:00:03.200154
round_time_total          0 days 00:12:18.213175
loss_total             34493232985038804484096.0
loss_critic            43116540501996844613632.0
loss_actor                  -664859393753.088013
memory_size                                437.0 

=== epoch 3/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                     124
episode_length                          15.91129
returns                                75.942554
return_std                              3.273618
average_reward                          4.772757
round_time                0 days 00:12:22.738864
episodes_test                              124.0
episode_length_test                    16.024194
returns_test                            76.44222
return_std_test                         3.475911
average_reward_test                     4.770385
round_time_test           0 days 00:00:03.132956
round_time_total          0 days 00:12:22.739967
loss_total             36721734177590039543808.0
loss_critic            45902166908947079888896.0
loss_actor                  -684582928547.839966
memory_size                                437.0 

=== epoch 3/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                     125
episode_length                            15.864
returns                                75.693949
return_std                              5.350747
average_reward                          4.771304
round_time                0 days 00:12:18.737828
episodes_test                              126.0
episode_length_test                     15.81746
returns_test                           75.466819
return_std_test                         3.076425
average_reward_test                     4.771253
round_time_test           0 days 00:00:03.127355
round_time_total          0 days 00:12:18.738917
loss_total             38941661490917129846784.0
loss_critic            48677076023302867124224.0
loss_actor                  -702953260580.864014
memory_size                                437.0 

=== epoch 3/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                     125
episode_length                            15.952
returns                                76.037351
return_std                              7.904686
average_reward                          4.766695
round_time                0 days 00:12:19.761472
episodes_test                              126.0
episode_length_test                    15.825397
returns_test                           75.485223
return_std_test                         2.001076
average_reward_test                      4.76996
round_time_test           0 days 00:00:03.173769
round_time_total          0 days 00:12:19.762549
loss_total             41479221881385689022464.0
loss_critic            51849026415827817070592.0
loss_actor                  -720146531024.895996
memory_size                             437.9735 

=== epoch 3/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:36,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     124
episode_length                          15.91129
returns                                75.897983
return_std                              5.730263
average_reward                          4.769922
round_time                0 days 00:12:23.872970
episodes_test                              126.0
episode_length_test                    15.809524
returns_test                            75.39662
return_std_test                         2.128742
average_reward_test                     4.769157
round_time_test           0 days 00:00:03.132561
round_time_total          0 days 00:12:23.874077
loss_total             44268845856753769775104.0
loss_critic            55336056374341866094592.0
loss_actor                  -738495758598.144043
memory_size                                438.0 

=== epoch 3/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                     126
episode_length                         15.793651
returns                                75.335938
return_std                              1.915063
average_reward                          4.770573
round_time                0 days 00:12:21.554288
episodes_test                              126.0
episode_length_test                    15.857143
returns_test                           75.722016
return_std_test                         3.390029
average_reward_test                     4.775406
round_time_test           0 days 00:00:03.185840
round_time_total          0 days 00:12:21.555369
loss_total             45820825673450282549248.0
loss_critic            57276031059362639249408.0
loss_actor                  -757775890546.687988
memory_size                                438.0 

=== epoch 3/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                     125
episode_length                             15.84
returns                                75.576688
return_std                              2.759315
average_reward                          4.771183
round_time                0 days 00:12:24.822950
episodes_test                              126.0
episode_length_test                    15.809524
returns_test                           75.414399
return_std_test                         1.936738
average_reward_test                     4.770174
round_time_test           0 days 00:00:03.152895
round_time_total          0 days 00:12:24.824044
loss_total             48330191850971793981440.0
loss_critic            60412738787456975896576.0
loss_actor                  -774907513798.656006
memory_size                                438.0 

=== epoch 3/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:53,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                     126
episode_length                         15.833333
returns                                75.517915
return_std                              1.731837
average_reward                          4.769553
round_time                0 days 00:12:24.823661
episodes_test                              126.0
episode_length_test                    15.849206
returns_test                           75.595456
return_std_test                         5.376345
average_reward_test                     4.769823
round_time_test           0 days 00:00:03.189855
round_time_total          0 days 00:12:24.824733
loss_total             49017942033200094117888.0
loss_critic            61272426555211794874368.0
loss_actor                  -793066214522.880005
memory_size                                438.0 

=== epoch 3/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                     125
episode_length                            15.848
returns                                75.556498
return_std                              1.688915
average_reward                           4.76762
round_time                0 days 00:12:27.130538
episodes_test                              125.0
episode_length_test                       15.936
returns_test                           76.088893
return_std_test                         3.023294
average_reward_test                     4.774696
round_time_test           0 days 00:00:03.147521
round_time_total          0 days 00:12:27.131631
loss_total             52500316493552637968384.0
loss_critic            65625394478374509871104.0
loss_actor                  -813545725067.264038
memory_size                                438.0 

=== epoch 3/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                     125
episode_length                            15.832
returns                                75.506365
return_std                               1.78543
average_reward                           4.76915
round_time                0 days 00:12:18.862163
episodes_test                              124.0
episode_length_test                    16.008065
returns_test                           76.452414
return_std_test                         9.502558
average_reward_test                     4.775794
round_time_test           0 days 00:00:03.124124
round_time_total          0 days 00:12:18.863240
loss_total             55580122737802022486016.0
loss_critic            69475152297478527647744.0
loss_actor                  -835433416523.776001
memory_size                                438.0 

=== epoch 3/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:13<00:00,  2.73it/s]
episodes                                     126
episode_length                         15.785714
returns                                75.298677
return_std                              2.498449
average_reward                          4.770078
round_time                0 days 00:12:13.750813
episodes_test                              126.0
episode_length_test                    15.777778
returns_test                           75.204132
return_std_test                         4.634398
average_reward_test                     4.766484
round_time_test           0 days 00:00:03.204664
round_time_total          0 days 00:12:13.751920
loss_total             59671393224003285942272.0
loss_critic            74589240306432387055616.0
loss_actor                  -857530023477.248047
memory_size                                438.0 

=== epoch 3/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.71it/s]
episodes                                     125
episode_length                             15.88
returns                                75.779665
return_std                              7.274637
average_reward                          4.771896
round_time                0 days 00:12:17.482034
episodes_test                              126.0
episode_length_test                    15.801587
returns_test                           75.366372
return_std_test                         1.999796
average_reward_test                     4.769567
round_time_test           0 days 00:00:03.236641
round_time_total          0 days 00:12:17.483104
loss_total             61290562645486806761472.0
loss_critic            76613202039939646095360.0
loss_actor                  -875633337597.952026
memory_size                                438.0 

=== epoch 3/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                     125
episode_length                            15.784
returns                                75.295209
return_std                               1.96597
average_reward                          4.770312
round_time                0 days 00:12:20.747970
episodes_test                              126.0
episode_length_test                    15.761905
returns_test                            75.21486
return_std_test                         3.077859
average_reward_test                     4.771978
round_time_test           0 days 00:00:03.083067
round_time_total          0 days 00:12:20.749051
loss_total             64220910640069630492672.0
loss_critic            80276136976028746645504.0
loss_actor                  -896638540939.264038
memory_size                                438.0 

=== epoch 3/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                     124
episode_length                         16.008065
returns                                76.373189
return_std                               7.90593
average_reward                          4.770847
round_time                0 days 00:12:24.402326
episodes_test                              125.0
episode_length_test                       15.896
returns_test                           75.791235
return_std_test                         6.109952
average_reward_test                     4.768035
round_time_test           0 days 00:00:03.156862
round_time_total          0 days 00:12:24.403401
loss_total             68084654105215560056832.0
loss_critic            85105816143924205453312.0
loss_actor                  -918223398961.151978
memory_size                                438.0 

=== epoch 3/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                     126
episode_length                          15.81746
returns                                75.482111
return_std                              4.289832
average_reward                          4.772137
round_time                0 days 00:12:26.271969
episodes_test                              126.0
episode_length_test                     15.84127
returns_test                           75.564912
return_std_test                         3.895941
average_reward_test                     4.770283
round_time_test           0 days 00:00:03.164475
round_time_total          0 days 00:12:26.273038
loss_total             70429292416404314652672.0
loss_critic            88036614013206887661568.0
loss_actor                  -939816843051.008057
memory_size                                438.0 

=== epoch 3/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:15<00:00,  2.72it/s]
episodes                                     126
episode_length                         15.738095
returns                                75.052672
return_std                              2.225747
average_reward                           4.76886
round_time                0 days 00:12:15.986703
episodes_test                              126.0
episode_length_test                     15.81746
returns_test                           75.449433
return_std_test                         2.144457
average_reward_test                     4.770077
round_time_test           0 days 00:00:03.174665
round_time_total          0 days 00:12:15.987773
loss_total             73976750314631337082880.0
loss_critic            92470936325755018149888.0
loss_actor                  -964159520669.696045
memory_size                                438.0 

=== epoch 3/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                     125
episode_length                            15.872
returns                                75.735363
return_std                              2.191917
average_reward                          4.771596
round_time                0 days 00:12:21.353147
episodes_test                              125.0
episode_length_test                       15.984
returns_test                           76.305266
return_std_test                        11.126135
average_reward_test                     4.774023
round_time_test           0 days 00:00:03.175949
round_time_total          0 days 00:12:21.354273
loss_total             76590819434644281229312.0
loss_critic            95738522544501333950464.0
loss_actor                   -985651977682.94397
memory_size                                438.0 

=== epoch 3/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      124
episode_length                          15.943548
returns                                 76.057911
return_std                               4.175276
average_reward                           4.770351
round_time                 0 days 00:12:23.126431
episodes_test                               126.0
episode_length_test                     15.849206
returns_test                            75.588701
return_std_test                          2.553481
average_reward_test                      4.769305
round_time_test            0 days 00:00:03.159960
round_time_total           0 days 00:12:23.127506
loss_total              80273022020814070874112.0
loss_critic            100341275634787225174016.0
loss_actor                  -1006903836803.072021
memory_size                                 438.0 

=== epoch 3/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:15<00:00,  2.72it/s]
episodes                                      126
episode_length                          15.785714
returns                                 75.298693
return_std                               1.979279
average_reward                           4.770073
round_time                 0 days 00:12:16.066440
episodes_test                               124.0
episode_length_test                     16.008065
returns_test                            76.456874
return_std_test                          7.615343
average_reward_test                      4.776011
round_time_test            0 days 00:00:03.144190
round_time_total           0 days 00:12:16.067501
loss_total              84696044984292914233344.0
loss_critic            105870054253004438110208.0
loss_actor                  -1030646433906.687988
memory_size                                 438.0 

=== epoch 3/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      122
episode_length                          16.262295
returns                                 77.833468
return_std                              15.780284
average_reward                           4.785997
round_time                 0 days 00:12:24.783351
episodes_test                               126.0
episode_length_test                     15.833333
returns_test                            75.513136
return_std_test                          1.902119
average_reward_test                      4.769354
round_time_test            0 days 00:00:03.200621
round_time_total           0 days 00:12:24.784426
loss_total              88921208675240312307712.0
loss_critic            111151508940435111280640.0
loss_actor                  -1052607187582.975952
memory_size                               439.306 

=== epoch 3/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.065041
returns                                 76.766028
return_std                               8.702351
average_reward                           4.778428
round_time                 0 days 00:12:27.992556
episodes_test                               126.0
episode_length_test                      15.81746
returns_test                            75.466553
return_std_test                          1.958113
average_reward_test                      4.771176
round_time_test            0 days 00:00:03.143428
round_time_total           0 days 00:12:27.993628
loss_total              95263061530550340681728.0
loss_critic            119078824855605858009088.0
loss_actor                  -1068294085378.047974
memory_size                                 446.0 

=== epoch 3/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      123
episode_length                          16.130081
returns                                 77.025826
return_std                               6.733108
average_reward                           4.775152
round_time                 0 days 00:12:28.974224
episodes_test                               125.0
episode_length_test                        15.952
returns_test                            76.092195
return_std_test                          2.206458
average_reward_test                      4.770157
round_time_test            0 days 00:00:03.169261
round_time_total           0 days 00:12:28.975303
loss_total              97847023476816576774144.0
loss_critic            122308777273238990356480.0
loss_actor                  -1083996042985.472046
memory_size                                 446.0 

=== epoch 3/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.992
returns                                 76.337236
return_std                               4.816034
average_reward                           4.773448
round_time                 0 days 00:12:24.636746
episodes_test                               125.0
episode_length_test                         15.92
returns_test                            75.922251
return_std_test                          1.551012
average_reward_test                      4.769057
round_time_test            0 days 00:00:03.184718
round_time_total           0 days 00:12:24.637825
loss_total             100921608874784246988800.0
loss_critic            126152008908108585762816.0
loss_actor                  -1106482473926.656006
memory_size                                 446.0 

=== epoch 3/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      125
episode_length                              15.92
returns                                 75.951113
return_std                               1.576837
average_reward                           4.770869
round_time                 0 days 00:12:27.691128
episodes_test                               125.0
episode_length_test                        15.912
returns_test                            75.887059
return_std_test                           2.01285
average_reward_test                      4.769272
round_time_test            0 days 00:00:03.141682
round_time_total           0 days 00:12:27.692189
loss_total             104566293922517044166656.0
loss_critic            130707865225092942266368.0
loss_actor                  -1126193545543.679932
memory_size                                 446.0 

=== epoch 3/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.71it/s]
episodes                                      123
episode_length                          16.138211
returns                                 77.023115
return_std                               9.762329
average_reward                           4.772661
round_time                 0 days 00:12:17.450568
episodes_test                               124.0
episode_length_test                     16.104839
returns_test                            76.901638
return_std_test                         15.120077
average_reward_test                      4.775189
round_time_test            0 days 00:00:03.129442
round_time_total           0 days 00:12:17.451653
loss_total             108044650364731748515840.0
loss_critic            135055810642753315930112.0
loss_actor                  -1144862214651.904053
memory_size                               446.149 

=== epoch 3/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.081301
returns                                 76.777453
return_std                               7.864353
average_reward                           4.774123
round_time                 0 days 00:12:27.983557
episodes_test                               124.0
episode_length_test                     16.064516
returns_test                            76.680317
return_std_test                          4.393205
average_reward_test                      4.773292
round_time_test            0 days 00:00:03.220309
round_time_total           0 days 00:12:27.984646
loss_total             111634465473138117312512.0
loss_critic            139543079600881825480704.0
loss_actor                  -1152971301322.751953
memory_size                                 450.0 

=== epoch 3/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:12,  2.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      126
episode_length                          15.761905
returns                                 75.199594
return_std                               2.690134
average_reward                           4.770892
round_time                 0 days 00:12:24.795918
episodes_test                               127.0
episode_length_test                     15.669291
returns_test                            74.758059
return_std_test                          2.262427
average_reward_test                      4.771057
round_time_test            0 days 00:00:03.159220
round_time_total           0 days 00:12:24.797032
loss_total             114134788539701453127680.0
loss_critic            142668483301792769114112.0
loss_actor                  -1165774079033.343994
memory_size                                 450.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      126
episode_length                          15.833333
returns                                 75.572127
return_std                               3.384154
average_reward                           4.773054
round_time                 0 days 00:12:25.227467
episodes_test                               126.0
episode_length_test                     15.785714
returns_test                            75.319985
return_std_test                          1.924478
average_reward_test                      4.771373
round_time_test            0 days 00:00:03.212609
round_time_total           0 days 00:12:25.228589
loss_total             118587809824924011331584.0
loss_critic            148234759795730982895616.0
loss_actor                  -1192236562415.615967
memory_size                                 450.0 

=== epoch 4/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                      124
episode_length                          15.975806
returns                                 76.349871
return_std                               9.999147
average_reward                           4.778952
round_time                 0 days 00:12:30.533260
episodes_test                               126.0
episode_length_test                     15.825397
returns_test                             75.45322
return_std_test                          1.890641
average_reward_test                       4.76803
round_time_test            0 days 00:00:03.152139
round_time_total           0 days 00:12:30.534345
loss_total             127189952987307496701952.0
loss_critic            158987438627113159098368.0
loss_actor                  -1216960102825.983887
memory_size                              451.4325 

=== epoch 4/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      124
episode_length                          15.967742
returns                                 76.158377
return_std                               3.951962
average_reward                           4.769401
round_time                 0 days 00:12:26.035207
episodes_test                               124.0
episode_length_test                     16.072581
returns_test                            76.742082
return_std_test                         11.503991
average_reward_test                      4.774766
round_time_test            0 days 00:00:03.161983
round_time_total           0 days 00:12:26.036335
loss_total             129983976542662350077952.0
loss_critic            162479967856822769942528.0
loss_actor                  -1231797389328.384033
memory_size                                 452.0 

=== epoch 4/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                      125
episode_length                             15.952
returns                                  76.08742
return_std                               3.227663
average_reward                            4.76989
round_time                 0 days 00:12:20.420158
episodes_test                               126.0
episode_length_test                     15.857143
returns_test                            75.728924
return_std_test                          4.647886
average_reward_test                       4.77585
round_time_test            0 days 00:00:03.134423
round_time_total           0 days 00:12:20.421250
loss_total             134257560669240824954880.0
loss_critic            167821948151279734751232.0
loss_actor                  -1253201498767.360107
memory_size                                 452.0 

=== epoch 4/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      124
episode_length                          15.991935
returns                                 76.279615
return_std                               3.431301
average_reward                           4.769797
round_time                 0 days 00:12:23.825623
episodes_test                               125.0
episode_length_test                        15.896
returns_test                            75.852967
return_std_test                          3.543631
average_reward_test                      4.771769
round_time_test            0 days 00:00:03.128680
round_time_total           0 days 00:12:23.826696
loss_total             136222966976443572027392.0
loss_critic            170278705894545697013760.0
loss_actor                  -1270326384656.384033
memory_size                                 452.0 

=== epoch 4/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      125
episode_length                             15.888
returns                                 75.793022
return_std                               1.501439
average_reward                           4.770507
round_time                 0 days 00:12:27.799322
episodes_test                               126.0
episode_length_test                     15.849206
returns_test                            75.590944
return_std_test                          1.713236
average_reward_test                      4.769507
round_time_test            0 days 00:00:03.147459
round_time_total           0 days 00:12:27.800420
loss_total             140660138545339156135936.0
loss_critic            175825170069686608461824.0
loss_actor                  -1296606792318.976074
memory_size                                 452.0 

=== epoch 4/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      123
episode_length                           16.03252
returns                                 76.504356
return_std                               7.586537
average_reward                           4.771575
round_time                 0 days 00:12:23.759612
episodes_test                               125.0
episode_length_test                        15.968
returns_test                            76.157638
return_std_test                          4.510759
average_reward_test                      4.769534
round_time_test            0 days 00:00:03.185029
round_time_total           0 days 00:12:23.760696
loss_total             146703746850765353779200.0
loss_critic            183379680279769628278784.0
loss_actor                  -1318357976875.008057
memory_size                                 452.0 

=== epoch 4/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                      126
episode_length                          15.833333
returns                                 75.473341
return_std                               1.777287
average_reward                           4.766829
round_time                 0 days 00:12:21.839126
episodes_test                               125.0
episode_length_test                        15.896
returns_test                            75.823006
return_std_test                          2.211263
average_reward_test                      4.769964
round_time_test            0 days 00:00:03.096008
round_time_total           0 days 00:12:21.840210
loss_total             149956575336482729885696.0
loss_critic            187445715675247139618816.0
loss_actor                  -1340443290632.191895
memory_size                                 452.0 

=== epoch 4/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                      124
episode_length                          15.943548
returns                                 76.080271
return_std                               2.884116
average_reward                           4.771667
round_time                 0 days 00:12:30.574530
episodes_test                               125.0
episode_length_test                        15.888
returns_test                            75.756352
return_std_test                          1.483133
average_reward_test                      4.768119
round_time_test            0 days 00:00:03.141146
round_time_total           0 days 00:12:30.575622
loss_total             154440719269347773644800.0
loss_critic            193050895690407669137408.0
loss_actor                  -1360247771561.983887
memory_size                                 452.0 

=== epoch 4/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                      125
episode_length                             15.912
returns                                 75.907356
return_std                               3.024721
average_reward                           4.770409
round_time                 0 days 00:12:22.156932
episodes_test                               124.0
episode_length_test                     16.072581
returns_test                             76.73115
return_std_test                          6.850819
average_reward_test                      4.774071
round_time_test            0 days 00:00:03.170085
round_time_total           0 days 00:12:22.158016
loss_total             162404474772873919070208.0
loss_critic            203005589648165786615808.0
loss_actor                   -1390456395333.63208
memory_size                                 452.0 

=== epoch 4/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.864
returns                                 75.674337
return_std                               2.303099
average_reward                           4.770125
round_time                 0 days 00:12:24.584327
episodes_test                               125.0
episode_length_test                        15.896
returns_test                             75.80261
return_std_test                          4.040362
average_reward_test                      4.768667
round_time_test            0 days 00:00:03.158191
round_time_total           0 days 00:12:24.585414
loss_total             168109377936036500340736.0
loss_critic            210136718492906735796224.0
loss_actor                  -1411323104198.656006
memory_size                                 452.0 

=== epoch 4/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      123
episode_length                          16.065041
returns                                   76.5952
return_std                               4.078969
average_reward                           4.767625
round_time                 0 days 00:12:23.064127
episodes_test                               124.0
episode_length_test                     16.040323
returns_test                            76.546934
return_std_test                          5.002394
average_reward_test                      4.772247
round_time_test            0 days 00:00:03.121665
round_time_total           0 days 00:12:23.065217
loss_total             176854022865173939748864.0
loss_critic            221067524696549783240704.0
loss_actor                  -1439976333901.823975
memory_size                                 452.0 

=== epoch 4/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                      124
episode_length                          16.080645
returns                                 76.754809
return_std                               3.151023
average_reward                           4.773165
round_time                 0 days 00:12:22.425663
episodes_test                               125.0
episode_length_test                        15.912
returns_test                            75.899579
return_std_test                          1.464317
average_reward_test                      4.769973
round_time_test            0 days 00:00:03.088509
round_time_total           0 days 00:12:22.426744
loss_total             180267939239120225697792.0
loss_critic            225334919850419535478784.0
loss_actor                   -1467328514490.36792
memory_size                                 452.0 

=== epoch 4/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.065041
returns                                 76.700969
return_std                               4.825138
average_reward                            4.77427
round_time                 0 days 00:12:26.008474
episodes_test                               124.0
episode_length_test                     16.129032
returns_test                            76.981244
return_std_test                          6.528551
average_reward_test                      4.772837
round_time_test            0 days 00:00:03.179895
round_time_total           0 days 00:12:26.009554
loss_total             188288062266208681984000.0
loss_critic            235360073660175797452800.0
loss_actor                  -1498123697324.031982
memory_size                                 452.0 

=== epoch 4/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      123
episode_length                          16.154472
returns                                 77.175021
return_std                               8.808365
average_reward                           4.777332
round_time                 0 days 00:12:29.454710
episodes_test                               123.0
episode_length_test                     16.203252
returns_test                            77.328175
return_std_test                          8.703131
average_reward_test                       4.77248
round_time_test            0 days 00:00:03.150724
round_time_total           0 days 00:12:29.455798
loss_total             195104890306282194468864.0
loss_critic            243881108686623792431104.0
loss_actor                  -1526243364044.800049
memory_size                                 452.0 

=== epoch 4/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:40,  2.12it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      124
episode_length                          16.048387
returns                                 76.581852
return_std                               6.159843
average_reward                           4.771956
round_time                 0 days 00:12:27.430416
episodes_test                               124.0
episode_length_test                     16.024194
returns_test                            76.459299
return_std_test                          3.767934
average_reward_test                      4.771402
round_time_test            0 days 00:00:03.236978
round_time_total           0 days 00:12:27.431504
loss_total             206697380129978080821248.0
loss_critic            258371720489987997696000.0
loss_actor                  -1556326683770.879883
memory_size                                 452.0 

=== epoch 4/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      123
episode_length                           16.03252
returns                                 76.496922
return_std                               4.786292
average_reward                           4.771087
round_time                 0 days 00:12:27.475155
episodes_test                               124.0
episode_length_test                     16.016129
returns_test                            76.381025
return_std_test                          4.670509
average_reward_test                      4.769054
round_time_test            0 days 00:00:03.106942
round_time_total           0 days 00:12:27.476241
loss_total             213717790058543176482816.0
loss_critic            267147233124748433031168.0
loss_actor                  -1588589360513.023926
memory_size                                 452.0 

=== epoch 4/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      124
episode_length                          15.991935
returns                                 76.293599
return_std                               3.411195
average_reward                           4.770748
round_time                 0 days 00:12:27.880057
episodes_test                               126.0
episode_length_test                     15.873016
returns_test                            75.689866
return_std_test                          1.934623
average_reward_test                      4.768462
round_time_test            0 days 00:00:03.154968
round_time_total           0 days 00:12:27.881159
loss_total             220621656925822248812544.0
loss_critic            275777066547843587637248.0
loss_actor                  -1612006383091.711914
memory_size                                 452.0 

=== epoch 4/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:01<00:00,  1.39it/s]
episodes                                      124
episode_length                          16.040323
returns                                 76.553117
return_std                               5.048057
average_reward                           4.772562
round_time                 0 days 00:24:02.117212
episodes_test                               125.0
episode_length_test                        15.904
returns_test                             75.88328
return_std_test                          4.202415
average_reward_test                      4.771449
round_time_test            0 days 00:00:03.210334
round_time_total           0 days 00:24:02.119076
loss_total             226594853475442566889472.0
loss_critic            283243562243876185964544.0
loss_actor                  -1644831899910.144043
memory_size                                 452.0 

=== epoch 4/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:06<44:37,  1.34s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:18<00:00,  1.56it/s]
episodes                                      125
episode_length                             15.928
returns                                 75.986582
return_std                               2.243842
average_reward                           4.770692
round_time                 0 days 00:21:19.236547
episodes_test                               124.0
episode_length_test                     16.040323
returns_test                            76.575445
return_std_test                          6.342404
average_reward_test                      4.774033
round_time_test            0 days 00:00:12.577283
round_time_total           0 days 00:21:19.237654
loss_total             238984760930878621745152.0
loss_critic            298730946142084683268096.0
loss_actor                  -1679115509497.855957
memory_size                                 452.0 

=== epoch 4/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:45<00:00,  2.61it/s]
episodes                                      124
episode_length                          15.975806
returns                                 76.300717
return_std                               4.418251
average_reward                           4.776032
round_time                 0 days 00:12:46.018233
episodes_test                               125.0
episode_length_test                        15.968
returns_test                            76.198487
return_std_test                          2.060095
average_reward_test                      4.772032
round_time_test            0 days 00:00:03.186986
round_time_total           0 days 00:12:46.019307
loss_total             252456119593296617013248.0
loss_critic            315570144240423558709248.0
loss_actor                  -1718132800028.672119
memory_size                                 452.0 

=== epoch 4/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:57,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      124
episode_length                          15.975806
returns                                 76.271082
return_std                               6.083919
average_reward                           4.774212
round_time                 0 days 00:12:24.969060
episodes_test                               126.0
episode_length_test                     15.809524
returns_test                               75.415
return_std_test                          2.578677
average_reward_test                      4.770347
round_time_test            0 days 00:00:03.129918
round_time_total           0 days 00:12:24.970158
loss_total             261318818930956380930048.0
loss_critic            326648518235731989102592.0
loss_actor                  -1740609139703.808105
memory_size                                 452.0 

=== epoch 4/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      125
episode_length                              15.84
returns                                 75.555325
return_std                               1.982115
average_reward                           4.769933
round_time                 0 days 00:12:26.788216
episodes_test                               124.0
episode_length_test                     16.040323
returns_test                            76.532765
return_std_test                          4.674366
average_reward_test                      4.771387
round_time_test            0 days 00:00:03.191439
round_time_total           0 days 00:12:26.789317
loss_total             268739634629479319470080.0
loss_critic            335924537595425128972288.0
loss_actor                  -1773226642571.263916
memory_size                                 452.0 

=== epoch 4/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      126
episode_length                          15.825397
returns                                 75.469269
return_std                               2.115621
average_reward                           4.769011
round_time                 0 days 00:12:26.895565
episodes_test                               125.0
episode_length_test                        15.968
returns_test                            76.214543
return_std_test                          5.564872
average_reward_test                      4.772996
round_time_test            0 days 00:00:03.166274
round_time_total           0 days 00:12:26.896648
loss_total             281001737876343944642560.0
loss_critic            351252166159735864688640.0
loss_actor                  -1810929579458.560059
memory_size                                 452.0 

=== epoch 4/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.154472
returns                                 77.002294
return_std                               7.954096
average_reward                           4.766585
round_time                 0 days 00:12:27.434990
episodes_test                               126.0
episode_length_test                     15.833333
returns_test                            75.529072
return_std_test                          3.247028
average_reward_test                      4.770388
round_time_test            0 days 00:00:03.132548
round_time_total           0 days 00:12:27.436053
loss_total             300913225428553286287360.0
loss_critic            376141525411972253745152.0
loss_actor                  -1845062247055.360107
memory_size                              453.1715 

=== epoch 4/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      124
episode_length                          16.064516
returns                                 76.760285
return_std                               8.370023
average_reward                           4.778336
round_time                 0 days 00:12:24.094195
episodes_test                               124.0
episode_length_test                      16.08871
returns_test                            76.835789
return_std_test                          5.516569
average_reward_test                      4.775823
round_time_test            0 days 00:00:03.144244
round_time_total           0 days 00:12:24.095287
loss_total             321396926829610257088512.0
loss_critic            401746151256944017408000.0
loss_actor                  -1872182760505.343994
memory_size                                 454.0 

=== epoch 4/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.928
returns                                 75.995066
return_std                               2.826587
average_reward                           4.771194
round_time                 0 days 00:12:24.388342
episodes_test                               125.0
episode_length_test                        15.912
returns_test                            75.899749
return_std_test                           2.19946
average_reward_test                      4.769965
round_time_test            0 days 00:00:03.122966
round_time_total           0 days 00:12:24.389429
loss_total             331925751782548552286208.0
loss_critic            414907181839005043916800.0
loss_actor                  -1902175454560.256104
memory_size                                 454.0 

=== epoch 4/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:15<00:00,  2.72it/s]
episodes                                      124
episode_length                          16.008065
returns                                 76.394304
return_std                               7.200116
average_reward                           4.772242
round_time                 0 days 00:12:16.303874
episodes_test                               124.0
episode_length_test                     16.080645
returns_test                            76.769777
return_std_test                          8.457921
average_reward_test                      4.774153
round_time_test            0 days 00:00:03.159811
round_time_total           0 days 00:12:16.304956
loss_total             338598101527322643398656.0
loss_critic            423247619593055730925568.0
loss_actor                  -1934099532283.904053
memory_size                                 454.0 

=== epoch 4/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      125
episode_length                             15.888
returns                                  75.77098
return_std                               2.148764
average_reward                           4.769052
round_time                 0 days 00:12:28.900388
episodes_test                               126.0
episode_length_test                     15.809524
returns_test                            75.402486
return_std_test                          1.971581
average_reward_test                      4.769524
round_time_test            0 days 00:00:03.171655
round_time_total           0 days 00:12:28.901469
loss_total             350432581409809371234304.0
loss_critic            438040718858444406259712.0
loss_actor                   -1963764174356.47998
memory_size                                 454.0 

=== epoch 4/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:42,  2.12it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      124
episode_length                          15.991935
returns                                 76.328978
return_std                               3.644313
average_reward                           4.772829
round_time                 0 days 00:12:27.688053
episodes_test                               125.0
episode_length_test                         15.88
returns_test                            75.691114
return_std_test                          3.886303
average_reward_test                      4.766392
round_time_test            0 days 00:00:03.164809
round_time_total           0 days 00:12:27.689130
loss_total             348862227849011414433792.0
loss_critic            436077777033547720163328.0
loss_actor                  -1980484971593.728027
memory_size                                 454.0 

=== epoch 4/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      125
episode_length                              15.96
returns                                 76.166756
return_std                               3.284527
average_reward                           4.772423
round_time                 0 days 00:12:24.495849
episodes_test                               124.0
episode_length_test                      16.08871
returns_test                            76.782152
return_std_test                          6.189455
average_reward_test                      4.772802
round_time_test            0 days 00:00:03.149313
round_time_total           0 days 00:12:24.496927
loss_total             359673208961614831681536.0
loss_critic            449591503498611401949184.0
loss_actor                  -1997306846969.855957
memory_size                                 454.0 

=== epoch 4/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      125
episode_length                             15.864
returns                                  75.66397
return_std                               2.003825
average_reward                           4.769442
round_time                 0 days 00:12:25.909636
episodes_test                               125.0
episode_length_test                        15.888
returns_test                            75.832607
return_std_test                          2.038364
average_reward_test                       4.77286
round_time_test            0 days 00:00:03.165680
round_time_total           0 days 00:12:25.910726
loss_total             365781574667946413785088.0
loss_critic            457226959692525333381120.0
loss_actor                  -2026908661907.456055
memory_size                                 454.0 

=== epoch 4/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.848
returns                                 75.567506
return_std                               1.866719
average_reward                           4.768206
round_time                 0 days 00:12:23.913028
episodes_test                               125.0
episode_length_test                        15.904
returns_test                            75.797091
return_std_test                          2.781952
average_reward_test                      4.765846
round_time_test            0 days 00:00:03.132609
round_time_total           0 days 00:12:23.914095
loss_total             366431300753607062716416.0
loss_critic            458039118529083767521280.0
loss_actor                  -2047621917573.120117
memory_size                                 454.0 

=== epoch 4/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      126
episode_length                          15.753968
returns                                 75.156888
return_std                               3.134213
average_reward                           4.770655
round_time                 0 days 00:12:27.207065
episodes_test                               124.0
episode_length_test                     16.120968
returns_test                            76.725672
return_std_test                          11.15979
average_reward_test                       4.75945
round_time_test            0 days 00:00:03.205199
round_time_total           0 days 00:12:27.208151
loss_total             377505603418291135578112.0
loss_critic            471881996085319797047296.0
loss_actor                  -2067616607240.191895
memory_size                                 454.0 

=== epoch 4/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                      124
episode_length                          15.975806
returns                                 76.169977
return_std                               6.932182
average_reward                           4.767753
round_time                 0 days 00:12:21.037017
episodes_test                               127.0
episode_length_test                     15.700787
returns_test                            74.899583
return_std_test                           2.56996
average_reward_test                      4.770511
round_time_test            0 days 00:00:03.148763
round_time_total           0 days 00:12:21.038103
loss_total             383204470960815901507584.0
loss_critic            479005580117158989398016.0
loss_actor                  -2081532208283.647949
memory_size                                 454.0 

=== epoch 4/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      126
episode_length                          15.809524
returns                                 75.387998
return_std                                2.13985
average_reward                           4.768598
round_time                 0 days 00:12:27.503873
episodes_test                               125.0
episode_length_test                        15.912
returns_test                            75.874764
return_std_test                          1.354475
average_reward_test                      4.768545
round_time_test            0 days 00:00:03.148450
round_time_total           0 days 00:12:27.504952
loss_total             392997394696720955211776.0
loss_critic            491246735370256432234496.0
loss_actor                  -2106846802935.808105
memory_size                                 454.0 

=== epoch 4/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      126
episode_length                          15.730159
returns                                 75.030328
return_std                               2.217886
average_reward                           4.769817
round_time                 0 days 00:12:24.620031
episodes_test                               126.0
episode_length_test                     15.769841
returns_test                            75.212456
return_std_test                           2.08859
average_reward_test                      4.769423
round_time_test            0 days 00:00:03.178879
round_time_total           0 days 00:12:24.621107
loss_total             394415554647539486556160.0
loss_critic            493019434599462694027264.0
loss_actor                  -2121063069122.560059
memory_size                                 454.0 

=== epoch 4/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.816
returns                                 75.429019
return_std                               2.000203
average_reward                             4.7691
round_time                 0 days 00:12:22.770188
episodes_test                               125.0
episode_length_test                        15.936
returns_test                            76.059916
return_std_test                          4.273602
average_reward_test                      4.772903
round_time_test            0 days 00:00:03.097002
round_time_total           0 days 00:12:22.771269
loss_total             394558031511639283466240.0
loss_critic            493197530824828520497152.0
loss_actor                  -2145398193782.783936
memory_size                                 454.0 

=== epoch 4/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                      124
episode_length                          16.032258
returns                                 76.493793
return_std                               7.600032
average_reward                           4.771224
round_time                 0 days 00:12:28.403293
episodes_test                               125.0
episode_length_test                          16.0
returns_test                            76.340462
return_std_test                          3.774096
average_reward_test                      4.771279
round_time_test            0 days 00:00:03.195777
round_time_total           0 days 00:12:28.404406
loss_total             408065384198783549571072.0
loss_critic            510081721329100318048256.0
loss_actor                  -2168284246179.840088
memory_size                                 454.0 

=== epoch 4/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      125
episode_length                             15.888
returns                                 75.898151
return_std                               3.088953
average_reward                           4.776995
round_time                 0 days 00:12:29.448163
episodes_test                               125.0
episode_length_test                        15.936
returns_test                              76.0163
return_std_test                          2.325716
average_reward_test                      4.770208
round_time_test            0 days 00:00:03.154167
round_time_total           0 days 00:12:29.449233
loss_total             411891946651345120919552.0
loss_critic            514864924930730700570624.0
loss_actor                  -2194639186296.832031
memory_size                                 454.0 

=== epoch 4/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.105691
returns                                 76.853823
return_std                               6.606978
average_reward                           4.771844
round_time                 0 days 00:12:25.644318
episodes_test                               126.0
episode_length_test                     15.849206
returns_test                             75.61499
return_std_test                          2.071277
average_reward_test                      4.771003
round_time_test            0 days 00:00:03.171343
round_time_total           0 days 00:12:25.645389
loss_total             418829997213393820319744.0
loss_critic            523537487646902819225600.0
loss_actor                  -2209459368558.591797
memory_size                                 454.0 

=== epoch 4/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.70it/s]
episodes                                      124
episode_length                          16.048387
returns                                 76.479308
return_std                               4.048935
average_reward                           4.765813
round_time                 0 days 00:12:22.483650
episodes_test                               125.0
episode_length_test                        15.952
returns_test                            76.076423
return_std_test                          1.685879
average_reward_test                      4.769196
round_time_test            0 days 00:00:03.141696
round_time_total           0 days 00:12:22.484767
loss_total             430402526259382476341248.0
loss_critic            538003148891338234134528.0
loss_actor                   -2236632494768.12793
memory_size                                 454.0 

=== epoch 4/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                      124
episode_length                          15.903226
returns                                 75.853142
return_std                               1.840866
average_reward                           4.773466
round_time                 0 days 00:12:30.880013
episodes_test                               124.0
episode_length_test                     16.129032
returns_test                            76.970587
return_std_test                          5.513333
average_reward_test                      4.772176
round_time_test            0 days 00:00:03.140018
round_time_total           0 days 00:12:30.881105
loss_total             432918542328264772288512.0
loss_critic            541148168777030917160960.0
loss_actor                  -2257775626813.439941
memory_size                                 454.0 

=== epoch 4/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                      125
episode_length                             15.912
returns                                 75.885075
return_std                               1.610863
average_reward                           4.769098
round_time                 0 days 00:12:31.589494
episodes_test                               125.0
episode_length_test                         15.92
returns_test                            75.918807
return_std_test                          2.871213
average_reward_test                      4.768891
round_time_test            0 days 00:00:03.149187
round_time_total           0 days 00:12:31.590563
loss_total             435429987738921243508736.0
loss_critic            544287475360207541895168.0
loss_actor                  -2272593042997.248047
memory_size                                 454.0 

=== epoch 4/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:03,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      124
episode_length                          16.064516
returns                                   76.7178
return_std                               7.284031
average_reward                            4.77564
round_time                 0 days 00:12:26.429189
episodes_test                               123.0
episode_length_test                     16.195122
returns_test                            77.477328
return_std_test                          7.483859
average_reward_test                       4.78401
round_time_test            0 days 00:00:03.158789
round_time_total           0 days 00:12:26.430257
loss_total             444475921755999301533696.0
loss_critic            555594892745321268379648.0
loss_actor                  -2292616300855.295898
memory_size                                 454.0 

=== epoch 4/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      124
episode_length                          15.935484
returns                                 76.017217
return_std                                1.64749
average_reward                           4.770152
round_time                 0 days 00:12:29.014033
episodes_test                               125.0
episode_length_test                        15.896
returns_test                            75.828574
return_std_test                          1.993369
average_reward_test                        4.7703
round_time_test            0 days 00:00:03.106263
round_time_total           0 days 00:12:29.015124
loss_total             452108197332216031739904.0
loss_critic            565135236635753678110720.0
loss_actor                  -2315495113621.503906
memory_size                                 454.0 

=== epoch 4/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:47,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                      125
episode_length                             15.944
returns                                  76.02997
return_std                               1.520716
average_reward                           4.768632
round_time                 0 days 00:12:28.212585
episodes_test                               124.0
episode_length_test                     16.024194
returns_test                            76.532973
return_std_test                          7.112423
average_reward_test                      4.776938
round_time_test            0 days 00:00:03.172361
round_time_total           0 days 00:12:28.213654
loss_total             468410366831998072258560.0
loss_critic            585512949048661384364032.0
loss_actor                   -2339432898756.60791
memory_size                                 454.0 

=== epoch 4/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:06,  2.07it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                      125
episode_length                             15.856
returns                                 75.643399
return_std                               2.091347
average_reward                           4.770567
round_time                 0 days 00:12:27.430537
episodes_test                               126.0
episode_length_test                     15.857143
returns_test                            75.615432
return_std_test                          1.680031
average_reward_test                       4.76869
round_time_test            0 days 00:00:03.183641
round_time_total           0 days 00:12:27.431606
loss_total             474345667338150707462144.0
loss_critic            592932073843682661367808.0
loss_actor                  -2360414213767.167969
memory_size                                 454.0 

=== epoch 4/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                      124
episode_length                          15.943548
returns                                 76.068918
return_std                                 4.7053
average_reward                           4.771072
round_time                 0 days 00:12:28.684166
episodes_test                               124.0
episode_length_test                     16.032258
returns_test                            76.477649
return_std_test                          6.749969
average_reward_test                      4.770317
round_time_test            0 days 00:00:03.136151
round_time_total           0 days 00:12:28.685238
loss_total             476911828520911142649856.0
loss_critic            596139775396442649133056.0
loss_actor                  -2381301581152.255859
memory_size                                 454.0 

=== epoch 4/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      124
episode_length                               16.0
returns                                 76.319005
return_std                               5.127923
average_reward                           4.769909
round_time                 0 days 00:12:25.043479
episodes_test                               126.0
episode_length_test                     15.865079
returns_test                            75.643029
return_std_test                          1.734258
average_reward_test                      4.767975
round_time_test            0 days 00:00:03.174542
round_time_total           0 days 00:12:25.044557
loss_total             496317819670549086863360.0
loss_critic            620397263984460971376640.0
loss_actor                  -2405992371257.344238
memory_size                                 454.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      124
episode_length                          16.064516
returns                                 76.645031
return_std                               6.898396
average_reward                           4.771165
round_time                 0 days 00:12:24.563489
episodes_test                               125.0
episode_length_test                         15.92
returns_test                            75.948542
return_std_test                           1.77861
average_reward_test                      4.770737
round_time_test            0 days 00:00:03.168086
round_time_total           0 days 00:12:24.564606
loss_total             488609751273310307483648.0
loss_critic            610762178832437895757824.0
loss_actor                  -2425007303098.368164
memory_size                                 454.0 

=== epoch 5/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:50,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      125
episode_length                              15.88
returns                                 75.841336
return_std                               4.056741
average_reward                            4.77585
round_time                 0 days 00:12:23.634917
episodes_test                               125.0
episode_length_test                        15.888
returns_test                            75.784459
return_std_test                          2.143428
average_reward_test                      4.769883
round_time_test            0 days 00:00:03.175833
round_time_total           0 days 00:12:23.636009
loss_total             504152165749740391104512.0
loss_critic            630190196484370943967232.0
loss_actor                   -2451351719051.26416
memory_size                                 454.0 

=== epoch 5/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:37,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.864
returns                                 75.703743
return_std                               2.576675
average_reward                           4.772119
round_time                 0 days 00:12:25.190701
episodes_test                               126.0
episode_length_test                     15.857143
returns_test                            75.629149
return_std_test                          4.654229
average_reward_test                      4.769549
round_time_test            0 days 00:00:03.111039
round_time_total           0 days 00:12:25.191796
loss_total             516344030510039376592896.0
loss_critic            645430027216320121012224.0
loss_actor                  -2474830624260.096191
memory_size                                 454.0 

=== epoch 5/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      124
episode_length                          15.991935
returns                                 76.338074
return_std                               4.226758
average_reward                           4.773448
round_time                 0 days 00:12:23.591588
episodes_test                               125.0
episode_length_test                        15.904
returns_test                            75.890992
return_std_test                          1.880928
average_reward_test                      4.771863
round_time_test            0 days 00:00:03.147803
round_time_total           0 days 00:12:23.592682
loss_total             535208996075918682226688.0
loss_critic            669011233955244715016192.0
loss_actor                  -2500237437698.047852
memory_size                                 454.0 

=== epoch 5/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      125
episode_length                             15.936
returns                                 76.030902
return_std                               2.009329
average_reward                           4.770957
round_time                 0 days 00:12:27.904809
episodes_test                               125.0
episode_length_test                        15.984
returns_test                            76.219518
return_std_test                          1.786061
average_reward_test                      4.768627
round_time_test            0 days 00:00:03.158837
round_time_total           0 days 00:12:27.905907
loss_total             537990841337183010291712.0
loss_critic            672488540545335800365056.0
loss_actor                  -2524295846625.279785
memory_size                                 454.0 

=== epoch 5/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                      122
episode_length                          16.262295
returns                                 77.523258
return_std                              11.632116
average_reward                           4.767147
round_time                 0 days 00:12:19.126245
episodes_test                               125.0
episode_length_test                        15.968
returns_test                            76.169733
return_std_test                          1.617466
average_reward_test                      4.770282
round_time_test            0 days 00:00:03.185371
round_time_total           0 days 00:12:19.127330
loss_total             565537475843637806891008.0
loss_critic            706921833128965286395904.0
loss_actor                  -2551894589308.928223
memory_size                               455.251 

=== epoch 5/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                      123
episode_length                           16.04878
returns                                 76.593539
return_std                               4.433899
average_reward                           4.772416
round_time                 0 days 00:12:30.517894
episodes_test                               124.0
episode_length_test                     16.008065
returns_test                            76.326528
return_std_test                          1.288756
average_reward_test                      4.768156
round_time_test            0 days 00:00:03.171815
round_time_total           0 days 00:12:30.518976
loss_total             603028911372146564923392.0
loss_critic            753786126064672286179328.0
loss_actor                  -2571885224001.536133
memory_size                                 463.0 

=== epoch 5/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                      124
episode_length                          16.040323
returns                                 76.486486
return_std                               2.044439
average_reward                           4.768418
round_time                 0 days 00:12:24.120000
episodes_test                               124.0
episode_length_test                     16.048387
returns_test                             76.53514
return_std_test                           6.21572
average_reward_test                      4.769096
round_time_test            0 days 00:00:03.131864
round_time_total           0 days 00:12:24.121080
loss_total             617832587377288855683072.0
loss_critic            772290720821150501306368.0
loss_actor                   -2590347054415.87207
memory_size                                 463.0 

=== epoch 5/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.081301
returns                                 76.703758
return_std                               1.790563
average_reward                           4.769654
round_time                 0 days 00:12:26.461226
episodes_test                               124.0
episode_length_test                     16.024194
returns_test                            76.550131
return_std_test                          4.609209
average_reward_test                      4.777093
round_time_test            0 days 00:00:03.190444
round_time_total           0 days 00:12:26.462326
loss_total             628022703182011718172672.0
loss_critic            785028364435391541936128.0
loss_actor                  -2619980120850.432129
memory_size                                 463.0 

=== epoch 5/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                      123
episode_length                          16.195122
returns                                 77.203455
return_std                               7.109171
average_reward                           4.767147
round_time                 0 days 00:12:27.613119
episodes_test                               123.0
episode_length_test                     16.154472
returns_test                            77.104344
return_std_test                          5.185745
average_reward_test                      4.773048
round_time_test            0 days 00:00:03.140956
round_time_total           0 days 00:12:27.614207
loss_total             643538343317304365809664.0
loss_critic            804422915280047188213760.0
loss_actor                  -2655762658492.416016
memory_size                                 463.0 

=== epoch 5/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                      122
episode_length                          16.172131
returns                                 77.114914
return_std                               9.609104
average_reward                            4.76834
round_time                 0 days 00:12:25.703949
episodes_test                               124.0
episode_length_test                     16.072581
returns_test                            76.669224
return_std_test                          4.308159
average_reward_test                      4.770289
round_time_test            0 days 00:00:03.106831
round_time_total           0 days 00:12:25.705036
loss_total             669662982319267632906240.0
loss_critic            837078712715198328733696.0
loss_actor                  -2670347045044.224121
memory_size                               466.073 

=== epoch 5/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                      124
episode_length                          16.016129
returns                                 76.402185
return_std                               4.238555
average_reward                           4.770282
round_time                 0 days 00:12:24.939381
episodes_test                               124.0
episode_length_test                     16.024194
returns_test                            76.424602
return_std_test                          1.227393
average_reward_test                      4.769353
round_time_test            0 days 00:00:03.144461
round_time_total           0 days 00:12:24.940469
loss_total             684571345038180127604736.0
loss_critic            855714165053241333645312.0
loss_actor                  -2696475966701.567871
memory_size                                 467.0 

=== epoch 5/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      124
episode_length                          16.048387
returns                                  76.58494
return_std                              10.261667
average_reward                           4.772162
round_time                 0 days 00:12:23.177216
episodes_test                               125.0
episode_length_test                        15.904
returns_test                            75.844286
return_std_test                          1.665939
average_reward_test                       4.76891
round_time_test            0 days 00:00:03.163753
round_time_total           0 days 00:12:23.178305
loss_total             699363670594159984508928.0
loss_critic            874204572876418049900544.0
loss_actor                  -2709922748563.456055
memory_size                              468.5325 

=== epoch 5/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      122
episode_length                          16.090164
returns                                 76.847151
return_std                               7.726087
average_reward                           4.775845
round_time                 0 days 00:12:22.601882
episodes_test                               124.0
episode_length_test                     16.040323
returns_test                            76.441218
return_std_test                           8.18646
average_reward_test                      4.765653
round_time_test            0 days 00:00:03.166288
round_time_total           0 days 00:12:22.602964
loss_total             730924033244565524709376.0
loss_critic            913655025721050637795328.0
loss_actor                  -2732284920332.288086
memory_size                                 472.0 

=== epoch 5/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                      123
episode_length                           16.03252
returns                                 76.550246
return_std                               5.513639
average_reward                           4.775311
round_time                 0 days 00:12:30.340932
episodes_test                               125.0
episode_length_test                        15.912
returns_test                            75.944648
return_std_test                          2.496401
average_reward_test                      4.773179
round_time_test            0 days 00:00:03.137407
round_time_total           0 days 00:12:30.342003
loss_total             724199978217405178445824.0
loss_critic            905249956151222020866048.0
loss_actor                  -2748446815420.416016
memory_size                               480.955 

=== epoch 5/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                      125
episode_length                             15.968
returns                                 76.154028
return_std                               4.447152
average_reward                           4.769315
round_time                 0 days 00:12:23.162017
episodes_test                               126.0
episode_length_test                     15.857143
returns_test                            75.671686
return_std_test                          2.709126
average_reward_test                      4.772232
round_time_test            0 days 00:00:03.137225
round_time_total           0 days 00:12:23.163107
loss_total             753976611737648962207744.0
loss_critic            942470747364727925506048.0
loss_actor                  -2784732059664.383789
memory_size                                 481.0 

=== epoch 5/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.66it/s]
episodes                                      124
episode_length                          15.903226
returns                                 75.928394
return_std                               3.098951
average_reward                           4.774258
round_time                 0 days 00:12:31.051298
episodes_test                               125.0
episode_length_test                        15.968
returns_test                            76.155974
return_std_test                          1.354087
average_reward_test                      4.769432
round_time_test            0 days 00:00:03.156466
round_time_total           0 days 00:12:31.052379
loss_total             764113879377004474662912.0
loss_critic            955142333116383437520896.0
loss_actor                  -2797708148998.144043
memory_size                                 481.0 

=== epoch 5/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                      124
episode_length                          16.032258
returns                                 76.452112
return_std                               5.577581
average_reward                           4.768597
round_time                 0 days 00:12:31.864117
episodes_test                               124.0
episode_length_test                     16.032258
returns_test                            76.505771
return_std_test                          8.591586
average_reward_test                      4.771883
round_time_test            0 days 00:00:03.118217
round_time_total           0 days 00:12:31.865198
loss_total             784153929498926602256384.0
loss_critic            980192395061720730566656.0
loss_actor                   -2825994092937.21582
memory_size                                 481.0 

=== epoch 5/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:47,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                       124
episode_length                           15.983871
returns                                  76.254672
return_std                                3.690064
average_reward                            4.770592
round_time                  0 days 00:12:28.118210
episodes_test                                124.0
episode_length_test                      16.016129
returns_test                             76.403613
return_std_test                           4.167315
average_reward_test                       4.770365
round_time_test             0 days 00:00:03.131865
round_time_total            0 days 00:12:28.119305
loss_total              801144425788056161746944.0
loss_critic            1001430515364586001006592.0
loss_actor                   -2843859045187.583984
memory_size                                  481.0 

=== epoch 5/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.872
returns                                  75.705136
return_std                                2.518043
average_reward                            4.769686
round_time                  0 days 00:12:33.111036
episodes_test                                124.0
episode_length_test                      16.056452
returns_test                              76.51612
return_std_test                           5.946018
average_reward_test                       4.765501
round_time_test             0 days 00:00:03.115676
round_time_total            0 days 00:12:33.112112
loss_total              806086260616110089961472.0
loss_critic            1007607807753487399256064.0
loss_actor                   -2874781272244.224121
memory_size                                  481.0 

=== epoch 5/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                       125
episode_length                              15.912
returns                                    75.8845
return_std                                1.715114
average_reward                            4.769072
round_time                  0 days 00:12:24.590918
episodes_test                                125.0
episode_length_test                         15.896
returns_test                             75.813903
return_std_test                           1.951465
average_reward_test                       4.769344
round_time_test             0 days 00:00:03.113950
round_time_total            0 days 00:12:24.591988
loss_total              822580053308062813913088.0
loss_critic            1028225048427025267687424.0
loss_actor                   -2897208293195.775879
memory_size                                  481.0 

=== epoch 5/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       124
episode_length                           16.056452
returns                                  76.634301
return_std                                4.904498
average_reward                            4.772807
round_time                  0 days 00:12:31.520908
episodes_test                                124.0
episode_length_test                      16.072581
returns_test                             76.742754
return_std_test                           7.201553
average_reward_test                       4.774831
round_time_test             0 days 00:00:03.182046
round_time_total            0 days 00:12:31.521981
loss_total              845907037675680260161536.0
loss_critic            1057383779224316993863680.0
loss_actor                   -2907546537164.799805
memory_size                                  481.0 

=== epoch 5/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                       123
episode_length                           16.138211
returns                                  76.984161
return_std                                5.447594
average_reward                            4.770123
round_time                  0 days 00:12:26.370012
episodes_test                                123.0
episode_length_test                      16.235772
returns_test                             77.493971
return_std_test                           5.975962
average_reward_test                       4.773175
round_time_test             0 days 00:00:03.185983
round_time_total            0 days 00:12:26.371079
loss_total              854995915023911208615936.0
loss_critic            1068744875508785305616384.0
loss_actor                    -2924024181424.12793
memory_size                                  481.0 

=== epoch 5/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:08,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       123
episode_length                           16.195122
returns                                  77.346694
return_std                                6.898797
average_reward                            4.775941
round_time                  0 days 00:12:30.449238
episodes_test                                124.0
episode_length_test                       16.08871
returns_test                             76.782127
return_std_test                           2.773757
average_reward_test                       4.772487
round_time_test             0 days 00:00:03.142771
round_time_total            0 days 00:12:30.450307
loss_total              872289912458780285599744.0
loss_critic            1090362372036659272220672.0
loss_actor                   -2964231352877.056152
memory_size                                  481.0 

=== epoch 5/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                       125
episode_length                              15.896
returns                                  75.808406
return_std                                1.858351
average_reward                            4.769008
round_time                  0 days 00:12:23.609766
episodes_test                                125.0
episode_length_test                         15.992
returns_test                             76.280004
return_std_test                           1.295177
average_reward_test                       4.769948
round_time_test             0 days 00:00:03.085270
round_time_total            0 days 00:12:23.610833
loss_total              856497328022045287514112.0
loss_critic            1070621641576308990279680.0
loss_actor                   -2968509216194.560059
memory_size                                  481.0 

=== epoch 5/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:53,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                       122
episode_length                           16.188525
returns                                  77.260884
return_std                                8.504649
average_reward                            4.772501
round_time                  0 days 00:12:23.424017
episodes_test                                124.0
episode_length_test                      16.016129
returns_test                             76.456495
return_std_test                            5.36848
average_reward_test                       4.773722
round_time_test             0 days 00:00:03.171565
round_time_total            0 days 00:12:23.425087
loss_total              901667271472760190140416.0
loss_critic            1127084071691343399747584.0
loss_actor                   -2990051073982.463867
memory_size                               481.9455 

=== epoch 5/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                       124
episode_length                           16.112903
returns                                  76.900611
return_std                                3.405717
average_reward                            4.772582
round_time                  0 days 00:12:22.176353
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.225047
return_std_test                           1.814612
average_reward_test                       4.768966
round_time_test             0 days 00:00:03.186383
round_time_total            0 days 00:12:22.177411
loss_total              937557378159615458410496.0
loss_critic            1171946703086343027687424.0
loss_actor                   -3020432386162.687988
memory_size                                  482.0 

=== epoch 5/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:21,  2.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.952
returns                                   76.09809
return_std                                 2.18234
average_reward                            4.770559
round_time                  0 days 00:12:31.980341
episodes_test                                124.0
episode_length_test                      16.048387
returns_test                              76.55953
return_std_test                           2.427823
average_reward_test                       4.770635
round_time_test             0 days 00:00:03.225273
round_time_total            0 days 00:12:31.981425
loss_total              958668000836453945835520.0
loss_critic            1198334981270261456699392.0
loss_actor                   -3053470987845.631836
memory_size                                  482.0 

=== epoch 5/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                       125
episode_length                               15.92
returns                                  75.944456
return_std                                1.636641
average_reward                            4.770219
round_time                  0 days 00:12:28.019585
episodes_test                                125.0
episode_length_test                         15.992
returns_test                             76.329881
return_std_test                           1.780116
average_reward_test                       4.773066
round_time_test             0 days 00:00:03.167810
round_time_total            0 days 00:12:28.020658
loss_total              946669292464775490437120.0
loss_critic            1183336594341993528688640.0
loss_actor                   -3070768746463.231934
memory_size                                  482.0 

=== epoch 5/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       124
episode_length                           16.120968
returns                                  76.992946
return_std                                8.762276
average_reward                            4.776013
round_time                  0 days 00:12:30.068440
episodes_test                                125.0
episode_length_test                         15.928
returns_test                             75.967819
return_std_test                           1.411242
average_reward_test                       4.769537
round_time_test             0 days 00:00:03.116880
round_time_total            0 days 00:12:30.069512
loss_total             1001006236666025360752640.0
loss_critic            1251257773859469149601792.0
loss_actor                   -3079012627906.560059
memory_size                               483.5845 

=== epoch 5/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:53,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       124
episode_length                           15.967742
returns                                  76.157866
return_std                                1.205904
average_reward                            4.769493
round_time                  0 days 00:12:29.698637
episodes_test                                125.0
episode_length_test                         15.928
returns_test                             75.969432
return_std_test                           1.499506
average_reward_test                       4.769585
round_time_test             0 days 00:00:03.164451
round_time_total            0 days 00:12:29.699704
loss_total             1018685065464074215620608.0
loss_critic            1273356311194599098941440.0
loss_actor                    -3106664780267.52002
memory_size                                  484.0 

=== epoch 5/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                       122
episode_length                           16.295082
returns                                  77.769983
return_std                                9.442081
average_reward                            4.772556
round_time                  0 days 00:12:30.812355
episodes_test                                124.0
episode_length_test                      16.096774
returns_test                             76.844845
return_std_test                           7.009694
average_reward_test                       4.774056
round_time_test             0 days 00:00:03.159299
round_time_total            0 days 00:12:30.813425
loss_total             1032093707214254099136512.0
loss_critic            1290117112427561145597952.0
loss_actor                   -3133882836582.399902
memory_size                                  484.0 

=== epoch 5/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                       123
episode_length                            16.04065
returns                                   76.51949
return_std                               10.123869
average_reward                            4.770245
round_time                  0 days 00:12:23.054601
episodes_test                                124.0
episode_length_test                      16.064516
returns_test                             76.626105
return_std_test                           3.329199
average_reward_test                       4.769906
round_time_test             0 days 00:00:03.160174
round_time_total            0 days 00:12:23.055660
loss_total             1061194870893424307863552.0
loss_critic            1326493566729286064275456.0
loss_actor                    -3158024720875.52002
memory_size                               485.0905 

=== epoch 5/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                       125
episode_length                              15.896
returns                                  75.930899
return_std                                4.797816
average_reward                            4.776771
round_time                  0 days 00:12:25.886833
episodes_test                                125.0
episode_length_test                         15.992
returns_test                             76.293767
return_std_test                           2.792834
average_reward_test                        4.77082
round_time_test             0 days 00:00:03.134675
round_time_total            0 days 00:12:25.887900
loss_total             1090988394470516272922624.0
loss_critic            1363735471862680433721344.0
loss_actor                   -3176376984272.895996
memory_size                                  490.0 

=== epoch 5/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                       123
episode_length                           16.203252
returns                                  77.176588
return_std                                7.969883
average_reward                            4.763102
round_time                  0 days 00:12:24.123540
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.279431
return_std_test                            4.18907
average_reward_test                       4.772354
round_time_test             0 days 00:00:03.175667
round_time_total            0 days 00:12:24.124618
loss_total             1148079670578215661862912.0
loss_critic            1435099564358195241549824.0
loss_actor                   -3210674097618.943848
memory_size                               490.5445 

=== epoch 5/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.113821
returns                                  77.158866
return_std                                9.527474
average_reward                            4.788176
round_time                  0 days 00:12:34.130637
episodes_test                                123.0
episode_length_test                      16.186992
returns_test                             77.417506
return_std_test                          13.254579
average_reward_test                       4.782694
round_time_test             0 days 00:00:03.191315
round_time_total            0 days 00:12:34.131708
loss_total             1193206112769338014433280.0
loss_critic            1491507614840794524942336.0
loss_actor                   -3229511045087.231934
memory_size                                  492.0 

=== epoch 5/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.808
returns                                  75.367037
return_std                                1.960998
average_reward                             4.76756
round_time                  0 days 00:12:31.834226
episodes_test                                125.0
episode_length_test                         15.888
returns_test                             75.799681
return_std_test                            3.83635
average_reward_test                       4.770828
round_time_test             0 days 00:00:03.192487
round_time_total            0 days 00:12:31.835328
loss_total             1168660375565589368799232.0
loss_critic            1460825443164972155666432.0
loss_actor                   -3224354681913.344238
memory_size                                  492.0 

=== epoch 5/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.896
returns                                  75.841704
return_std                                3.055736
average_reward                            4.771102
round_time                  0 days 00:12:32.180118
episodes_test                                126.0
episode_length_test                      15.761905
returns_test                             75.207647
return_std_test                           3.527332
average_reward_test                       4.771475
round_time_test             0 days 00:00:03.176407
round_time_total            0 days 00:12:32.181192
loss_total             1209287998844460756631552.0
loss_critic            1511609973722727606910976.0
loss_actor                   -3270476025298.943848
memory_size                                  492.0 

=== epoch 5/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:19,  2.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.943548
returns                                  76.067979
return_std                                2.498866
average_reward                            4.770835
round_time                  0 days 00:12:34.292205
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.382834
return_std_test                            3.45028
average_reward_test                       4.771588
round_time_test             0 days 00:00:03.205307
round_time_total            0 days 00:12:34.293278
loss_total             1229940613978020548444160.0
loss_critic            1537425740252769538801664.0
loss_actor                   -3296753501470.720215
memory_size                                  492.0 

=== epoch 5/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:01,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       123
episode_length                           16.219512
returns                                  77.509317
return_std                                6.587497
average_reward                             4.77885
round_time                  0 days 00:12:32.060987
episodes_test                                124.0
episode_length_test                      16.056452
returns_test                             76.712395
return_std_test                           5.086772
average_reward_test                        4.77776
round_time_test             0 days 00:00:03.191864
round_time_total            0 days 00:12:32.062053
loss_total             1250848537309764703485952.0
loss_critic            1563560643813967367503872.0
loss_actor                   -3314911116787.711914
memory_size                                  492.0 

=== epoch 5/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       123
episode_length                           16.056911
returns                                     76.681
return_std                                4.724676
average_reward                            4.775337
round_time                  0 days 00:12:32.715857
episodes_test                                124.0
episode_length_test                      16.120968
returns_test                             76.904287
return_std_test                            4.53782
average_reward_test                       4.770512
round_time_test             0 days 00:00:03.174478
round_time_total            0 days 00:12:32.716924
loss_total             1285445292761642046586880.0
loss_critic            1606806587462281158721536.0
loss_actor                   -3363120667426.815918
memory_size                                  492.0 

=== epoch 5/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                       125
episode_length                               15.92
returns                                  75.935336
return_std                                1.794093
average_reward                            4.769894
round_time                  0 days 00:12:25.499975
episodes_test                                124.0
episode_length_test                      16.040323
returns_test                             76.604744
return_std_test                            4.05651
average_reward_test                       4.775842
round_time_test             0 days 00:00:03.196841
round_time_total            0 days 00:12:25.501035
loss_total             1294760643254838458580992.0
loss_critic            1618450776524532700151808.0
loss_actor                   -3377591469735.936035
memory_size                                  492.0 

=== epoch 5/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       124
episode_length                           16.032258
returns                                  76.490128
return_std                                2.769089
average_reward                            4.771077
round_time                  0 days 00:12:28.883284
episodes_test                                125.0
episode_length_test                         15.952
returns_test                             76.138768
return_std_test                           4.471503
average_reward_test                       4.773089
round_time_test             0 days 00:00:03.138438
round_time_total            0 days 00:12:28.884366
loss_total             1285412703922104863555584.0
loss_critic            1606765850025751002218496.0
loss_actor                   -3397759892193.279785
memory_size                                  492.0 

=== epoch 5/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       121
episode_length                           16.429752
returns                                  78.573878
return_std                                9.909415
average_reward                            4.782193
round_time                  0 days 00:12:30.174169
episodes_test                                124.0
episode_length_test                      16.080645
returns_test                              76.76621
return_std_test                           3.844208
average_reward_test                        4.77398
round_time_test             0 days 00:00:03.176048
round_time_total            0 days 00:12:30.175230
loss_total             1308245261955711381798912.0
loss_critic            1635306548801745465638912.0
loss_actor                   -3427308110872.576172
memory_size                                  492.0 

=== epoch 5/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                       121
episode_length                            16.46281
returns                                   78.62156
return_std                                10.67211
average_reward                            4.775718
round_time                  0 days 00:12:26.221758
episodes_test                                120.0
episode_length_test                      16.558333
returns_test                             79.214601
return_std_test                          10.349942
average_reward_test                       4.783871
round_time_test             0 days 00:00:03.168219
round_time_total            0 days 00:12:26.222835
loss_total             1327881165388141253623808.0
loss_critic            1659851427295145867870208.0
loss_actor                   -3455502082637.824219
memory_size                                  492.0 

=== epoch 5/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:55,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                       123
episode_length                           16.170732
returns                                  77.276348
return_std                                6.487731
average_reward                             4.77873
round_time                  0 days 00:12:27.819095
episodes_test                                123.0
episode_length_test                      16.154472
returns_test                             77.119207
return_std_test                           5.666185
average_reward_test                       4.773782
round_time_test             0 days 00:00:03.146345
round_time_total            0 days 00:12:27.820160
loss_total             1367342417558435417554944.0
loss_critic            1709177990467882900783104.0
loss_actor                   -3488540617474.047852
memory_size                                  492.0 

=== epoch 5/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                       122
episode_length                           16.229508
returns                                  77.598627
return_std                                7.258985
average_reward                            4.781151
round_time                  0 days 00:12:27.674769
episodes_test                                125.0
episode_length_test                         15.976
returns_test                             76.225314
return_std_test                           3.521634
average_reward_test                       4.771386
round_time_test             0 days 00:00:03.185495
round_time_total            0 days 00:12:27.675865
loss_total             1383697686164156106932224.0
loss_critic            1729622075463925426552832.0
loss_actor                   -3527673031950.335938
memory_size                                  492.0 

=== epoch 5/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                       123
episode_length                           16.130081
returns                                  76.941452
return_std                                 4.85465
average_reward                            4.770101
round_time                  0 days 00:12:37.917347
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.494157
return_std_test                           3.666844
average_reward_test                       4.771323
round_time_test             0 days 00:00:03.163413
round_time_total            0 days 00:12:37.918429
loss_total             1403916836588980929560576.0
loss_critic            1754896012589732907188224.0
loss_actor                   -3549753000722.432129
memory_size                                  492.0 

=== epoch 5/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                       122
episode_length                           16.237705
returns                                  77.440316
return_std                                7.926151
average_reward                            4.769246
round_time                  0 days 00:12:30.856360
episodes_test                                118.0
episode_length_test                      16.898305
returns_test                             80.699943
return_std_test                          18.214236
average_reward_test                       4.775645
round_time_test             0 days 00:00:03.184048
round_time_total            0 days 00:12:30.857431
loss_total             1440659605397772968132608.0
loss_critic            1800824474771658949787648.0
loss_actor                   -3579493486952.448242
memory_size                                  492.0 

=== epoch 5/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       108
episode_length                           18.407407
returns                                  87.894166
return_std                               22.830424
average_reward                            4.774974
round_time                  0 days 00:12:30.622506
episodes_test                                120.0
episode_length_test                      16.566667
returns_test                             79.280378
return_std_test                          13.562599
average_reward_test                       4.785348
round_time_test             0 days 00:00:03.154078
round_time_total            0 days 00:12:30.623585
loss_total             1391795126372654102085632.0
loss_critic            1739743875958734971731968.0
loss_actor                   -3585405284646.912109
memory_size                                  492.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                       112
episode_length                           17.848214
returns                                  85.646507
return_std                               20.214297
average_reward                            4.798676
round_time                  0 days 00:12:25.243324
episodes_test                                108.0
episode_length_test                      18.490741
returns_test                             88.415202
return_std_test                          22.921436
average_reward_test                       4.781716
round_time_test             0 days 00:00:03.131191
round_time_total            0 days 00:12:25.244424
loss_total             1466197377008637036199936.0
loss_critic            1832746690339081201647616.0
loss_actor                   -3622382228209.664062
memory_size                                  492.0 

=== epoch 6/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:29,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       120
episode_length                           16.458333
returns                                  78.677301
return_std                               10.199285
average_reward                             4.78027
round_time                  0 days 00:12:30.159532
episodes_test                                121.0
episode_length_test                      16.479339
returns_test                             78.734917
return_std_test                          10.218304
average_reward_test                       4.777803
round_time_test             0 days 00:00:03.177939
round_time_total            0 days 00:12:30.160629
loss_total             1463311161816800510869504.0
loss_critic            1829138919872104692187136.0
loss_actor                   -3644739619127.295898
memory_size                                  492.0 

=== epoch 6/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:53,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       124
episode_length                           16.080645
returns                                  76.868487
return_std                                 8.27268
average_reward                            4.780157
round_time                  0 days 00:12:29.943006
episodes_test                                123.0
episode_length_test                      16.162602
returns_test                             77.164534
return_std_test                           6.807437
average_reward_test                       4.774349
round_time_test             0 days 00:00:03.139669
round_time_total            0 days 00:12:29.944088
loss_total             1495684740280582607470592.0
loss_critic            1869605893519285875113984.0
loss_actor                    -3665306125860.86377
memory_size                                  492.0 

=== epoch 6/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       124
episode_length                           15.935484
returns                                  76.014661
return_std                                1.436161
average_reward                            4.770193
round_time                  0 days 00:12:33.355961
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.037351
return_std_test                           1.097715
average_reward_test                       4.769486
round_time_test             0 days 00:00:03.146040
round_time_total            0 days 00:12:33.357066
loss_total             1542876997005477777768448.0
loss_critic            1928596211489057835843584.0
loss_actor                   -3707136983695.359863
memory_size                                  492.0 

=== epoch 6/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                       124
episode_length                           16.040323
returns                                  76.486784
return_std                                3.010286
average_reward                             4.76874
round_time                  0 days 00:12:26.871925
episodes_test                                124.0
episode_length_test                      16.112903
returns_test                             76.885602
return_std_test                           6.585331
average_reward_test                       4.771818
round_time_test             0 days 00:00:03.180958
round_time_total            0 days 00:12:26.873019
loss_total             1564698262519386044628992.0
loss_critic            1955872793912868331323392.0
loss_actor                   -3738194810765.312012
memory_size                                  492.0 

=== epoch 6/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       120
episode_length                           16.383333
returns                                  78.212462
return_std                               10.760129
average_reward                            4.779053
round_time                  0 days 00:12:35.479972
episodes_test                                124.0
episode_length_test                      16.080645
returns_test                             76.700177
return_std_test                           4.972938
average_reward_test                       4.769764
round_time_test             0 days 00:00:03.158121
round_time_total            0 days 00:12:35.481102
loss_total             1613259753719051140988928.0
loss_critic            2016574657290952633221120.0
loss_actor                   -3746205670572.032227
memory_size                                493.721 

=== epoch 6/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                       120
episode_length                           16.558333
returns                                  79.135332
return_std                               16.324243
average_reward                            4.779069
round_time                  0 days 00:12:26.856911
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.308455
return_std_test                            4.71543
average_reward_test                       4.766824
round_time_test             0 days 00:00:03.179820
round_time_total            0 days 00:12:26.857982
loss_total             1712539006873807167160320.0
loss_critic            2140673724797247385763840.0
loss_actor                   -3771724393086.976074
memory_size                               500.7775 

=== epoch 6/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       123
episode_length                           16.065041
returns                                  76.656637
return_std                                4.454677
average_reward                            4.771537
round_time                  0 days 00:12:30.273603
episodes_test                                123.0
episode_length_test                      16.170732
returns_test                             77.222242
return_std_test                           6.379484
average_reward_test                       4.775491
round_time_test             0 days 00:00:03.187847
round_time_total            0 days 00:12:30.274673
loss_total             1761922800421514715856896.0
loss_critic            2202403465903219354370048.0
loss_actor                   -3797070397964.288086
memory_size                                  504.0 

=== epoch 6/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       124
episode_length                           16.040323
returns                                   76.55874
return_std                                2.673407
average_reward                            4.772934
round_time                  0 days 00:12:29.253915
episodes_test                                123.0
episode_length_test                      16.211382
returns_test                              77.40345
return_std_test                            6.29611
average_reward_test                       4.774731
round_time_test             0 days 00:00:03.093568
round_time_total            0 days 00:12:29.254996
loss_total             1797590990759550584356864.0
loss_critic            2246988701682050852192256.0
loss_actor                   -3830526584291.328125
memory_size                                  504.0 

=== epoch 6/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                       125
episode_length                              15.888
returns                                  75.767308
return_std                                1.490957
average_reward                            4.768904
round_time                  0 days 00:12:37.093926
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.396922
return_std_test                           2.183509
average_reward_test                       4.772379
round_time_test             0 days 00:00:03.159448
round_time_total            0 days 00:12:37.094994
loss_total             1837551153660134242123776.0
loss_critic            2296938906037363431964672.0
loss_actor                    -3862757675892.73584
memory_size                                  504.0 

=== epoch 6/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                       122
episode_length                           16.213115
returns                                  77.491253
return_std                                7.091377
average_reward                             4.77938
round_time                  0 days 00:12:34.437725
episodes_test                                125.0
episode_length_test                         15.912
returns_test                             75.881618
return_std_test                           1.464091
average_reward_test                         4.7689
round_time_test             0 days 00:00:03.195516
round_time_total            0 days 00:12:34.438803
loss_total             1817783254100124723314688.0
loss_critic            2272229029776904773697536.0
loss_actor                   -3872781586268.160156
memory_size                                  504.0 

=== epoch 6/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                       122
episode_length                           16.221311
returns                                  77.523253
return_std                                6.542255
average_reward                            4.778954
round_time                  0 days 00:12:28.421968
episodes_test                                124.0
episode_length_test                      16.096774
returns_test                              76.87534
return_std_test                           4.311179
average_reward_test                       4.775926
round_time_test             0 days 00:00:03.088967
round_time_total            0 days 00:12:28.423051
loss_total             1859922925393463241342976.0
loss_critic            2324903616686813792436224.0
loss_actor                   -3902838498131.967773
memory_size                                  504.0 

=== epoch 6/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       123
episode_length                           16.243902
returns                                  77.614985
return_std                                6.526016
average_reward                            4.778074
round_time                  0 days 00:12:33.192887
episodes_test                                120.0
episode_length_test                          16.55
returns_test                             79.020297
return_std_test                          14.218522
average_reward_test                       4.774597
round_time_test             0 days 00:00:03.220195
round_time_total            0 days 00:12:33.193953
loss_total             1895189327775290439499776.0
loss_critic            2368986622672502331015168.0
loss_actor                   -3916904413396.992188
memory_size                                  504.0 

=== epoch 6/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                       124
episode_length                           16.008065
returns                                  76.307999
return_std                                3.197499
average_reward                            4.766823
round_time                  0 days 00:12:24.954232
episodes_test                                123.0
episode_length_test                      16.195122
returns_test                              77.33809
return_std_test                           6.765621
average_reward_test                       4.775483
round_time_test             0 days 00:00:03.153377
round_time_total            0 days 00:12:24.955310
loss_total             1903915939766850474213376.0
loss_critic            2379894882194582486908928.0
loss_actor                   -3940355000893.439941
memory_size                                  504.0 

=== epoch 6/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       123
episode_length                           16.162602
returns                                  77.231095
return_std                                6.299181
average_reward                            4.778316
round_time                  0 days 00:12:29.470273
episodes_test                                125.0
episode_length_test                         15.976
returns_test                             76.204562
return_std_test                           3.457419
average_reward_test                        4.77007
round_time_test             0 days 00:00:03.150143
round_time_total            0 days 00:12:29.471347
loss_total             1935712380857521088233472.0
loss_critic            2419640435530497295122432.0
loss_actor                   -3969467221147.647949
memory_size                                  504.0 

=== epoch 6/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:34,  2.01it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       125
episode_length                              15.936
returns                                  75.999107
return_std                                1.685928
average_reward                             4.76905
round_time                  0 days 00:12:28.607228
episodes_test                                124.0
episode_length_test                      16.080645
returns_test                             76.824728
return_std_test                           4.451246
average_reward_test                       4.777567
round_time_test             0 days 00:00:03.175551
round_time_total            0 days 00:12:28.608362
loss_total             1995445054823988610465792.0
loss_critic            2494306277258998958784512.0
loss_actor                    -4000877842923.52002
memory_size                                  504.0 

=== epoch 6/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.121951
returns                                  76.938362
return_std                                6.951681
average_reward                            4.772164
round_time                  0 days 00:12:36.357646
episodes_test                                121.0
episode_length_test                      16.504132
returns_test                             78.854381
return_std_test                          11.622441
average_reward_test                       4.777958
round_time_test             0 days 00:00:03.215088
round_time_total            0 days 00:12:36.358753
loss_total             2014668203548728493080576.0
loss_critic            2518335214083657850421248.0
loss_actor                   -4042852700127.231934
memory_size                                  504.0 

=== epoch 6/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       120
episode_length                           16.366667
returns                                  78.030752
return_std                                9.301239
average_reward                             4.76759
round_time                  0 days 00:12:33.089657
episodes_test                                121.0
episode_length_test                      16.454545
returns_test                             78.506636
return_std_test                          13.506115
average_reward_test                       4.771224
round_time_test             0 days 00:00:03.183509
round_time_total            0 days 00:12:33.090760
loss_total             2045046669435244690538496.0
loss_critic            2556308296342724174413824.0
loss_actor                   -4075178528342.016113
memory_size                                  504.0 

=== epoch 6/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                       125
episode_length                              15.952
returns                                  76.140694
return_std                                 2.96602
average_reward                            4.773194
round_time                  0 days 00:12:25.323895
episodes_test                                124.0
episode_length_test                      16.112903
returns_test                             76.862361
return_std_test                           5.557728
average_reward_test                        4.77036
round_time_test             0 days 00:00:03.132335
round_time_total            0 days 00:12:25.324976
loss_total             2102464504650401363001344.0
loss_critic            2628080587632488612888576.0
loss_actor                   -4113103312257.023926
memory_size                                  504.0 

=== epoch 6/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       125
episode_length                               15.88
returns                                  75.752982
return_std                                1.532603
average_reward                            4.770363
round_time                  0 days 00:12:29.424438
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.418079
return_std_test                            5.51011
average_reward_test                        4.77613
round_time_test             0 days 00:00:03.199916
round_time_total            0 days 00:12:29.425513
loss_total             2162999032821163063508992.0
loss_critic            2703748746530889666658304.0
loss_actor                   -4153016826200.063965
memory_size                                  504.0 

=== epoch 6/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:36,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.121951
returns                                  76.989268
return_std                                8.804317
average_reward                            4.775356
round_time                  0 days 00:12:35.692406
episodes_test                                125.0
episode_length_test                          15.92
returns_test                             75.893891
return_std_test                           1.668262
average_reward_test                       4.767296
round_time_test             0 days 00:00:03.228217
round_time_total            0 days 00:12:35.693480
loss_total             2146443504506295407345664.0
loss_critic            2683054336227377003102208.0
loss_actor                   -4177968578297.855957
memory_size                                  504.0 

=== epoch 6/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       123
episode_length                            16.04878
returns                                  76.538299
return_std                                5.924331
average_reward                            4.769018
round_time                  0 days 00:12:36.363411
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.516822
return_std_test                           3.023906
average_reward_test                       4.772716
round_time_test             0 days 00:00:03.173230
round_time_total            0 days 00:12:36.364497
loss_total             2210079294174639996010496.0
loss_critic            2762599068809207910236160.0
loss_actor                   -4222018130542.591797
memory_size                                  504.0 

=== epoch 6/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       125
episode_length                              15.944
returns                                  76.095397
return_std                                4.194668
average_reward                            4.772775
round_time                  0 days 00:12:36.060952
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.389075
return_std_test                           4.607169
average_reward_test                       4.771892
round_time_test             0 days 00:00:03.160862
round_time_total            0 days 00:12:36.062022
loss_total             2207988560557596049670144.0
loss_critic            2759985652859759981756416.0
loss_actor                   -4234005944336.383789
memory_size                                  504.0 

=== epoch 6/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                       124
episode_length                                16.0
returns                                  76.360651
return_std                                5.025272
average_reward                            4.772489
round_time                  0 days 00:12:37.880801
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.057872
return_std_test                           3.905891
average_reward_test                       4.770323
round_time_test             0 days 00:00:03.242113
round_time_total            0 days 00:12:37.881882
loss_total             2253871150298696861614080.0
loss_critic            2817338886856594591580160.0
loss_actor                   -4273316751736.832031
memory_size                                  504.0 

=== epoch 6/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.991935
returns                                  76.339286
return_std                                5.324428
average_reward                            4.773685
round_time                  0 days 00:12:35.275613
episodes_test                                123.0
episode_length_test                      16.195122
returns_test                              77.37026
return_std_test                           9.091981
average_reward_test                       4.777363
round_time_test             0 days 00:00:03.196252
round_time_total            0 days 00:12:35.276690
loss_total             2282799223674623796183040.0
loss_critic            2853498984449197062750208.0
loss_actor                   -4301093864734.720215
memory_size                                  504.0 

=== epoch 6/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:55<00:00,  1.97it/s]
episodes                                       122
episode_length                           16.270492
returns                                  77.636149
return_std                                6.950002
average_reward                            4.771581
round_time                  0 days 00:16:56.415660
episodes_test                                124.0
episode_length_test                      16.129032
returns_test                             76.957372
return_std_test                           8.519537
average_reward_test                       4.771357
round_time_test             0 days 00:00:03.145945
round_time_total            0 days 00:16:56.418009
loss_total             2281020733302685407641600.0
loss_critic            2851275867376991080546304.0
loss_actor                   -4323526684573.695801
memory_size                                  504.0 

=== epoch 6/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:03<56:18,  1.69s/it]  /home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:02<00:00,  1.33it/s]
episodes                                       123
episode_length                           16.178862
returns                                  77.248853
return_std                                5.435529
average_reward                            4.774515
round_time                  0 days 00:25:05.661206
episodes_test                                123.0
episode_length_test                      16.219512
returns_test                             77.440106
return_std_test                           5.474415
average_reward_test                       4.774607
round_time_test             0 days 00:00:12.602869
round_time_total            0 days 00:25:05.662304
loss_total             2312796449584475791163392.0
loss_critic            2890995511143961999704064.0
loss_actor                    -4349138718031.87207
memory_size                                  504.0 

=== epoch 6/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:28<00:00,  2.48it/s]
episodes                                       124
episode_length                           16.080645
returns                                  76.685229
return_std                                4.495321
average_reward                            4.768884
round_time                  0 days 00:13:28.544080
episodes_test                                126.0
episode_length_test                      15.857143
returns_test                             75.606431
return_std_test                           1.661553
average_reward_test                       4.768095
round_time_test             0 days 00:00:03.167418
round_time_total            0 days 00:13:28.545156
loss_total             2350563029262605370785792.0
loss_critic            2938203737633135825780736.0
loss_actor                   -4377049082757.120117
memory_size                                  504.0 

=== epoch 6/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:07,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:25<00:00,  2.48it/s]
episodes                                       124
episode_length                           15.943548
returns                                  76.034118
return_std                                1.271001
average_reward                            4.768861
round_time                  0 days 00:13:26.285633
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.457284
return_std_test                           2.695629
average_reward_test                       4.768972
round_time_test             0 days 00:00:03.191169
round_time_total            0 days 00:13:26.286759
loss_total             2455391067538332911140864.0
loss_critic            3069238780190569579675648.0
loss_actor                   -4433745963122.688477
memory_size                                  504.0 

=== epoch 6/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:21<00:00,  2.50it/s]
episodes                                       125
episode_length                              15.896
returns                                  75.837566
return_std                                1.865925
average_reward                            4.770933
round_time                  0 days 00:13:21.995242
episodes_test                                124.0
episode_length_test                      16.024194
returns_test                             76.506079
return_std_test                           4.354086
average_reward_test                       4.774368
round_time_test             0 days 00:00:03.190845
round_time_total            0 days 00:13:21.996338
loss_total             2521119531695016761098240.0
loss_critic            3151399359008322931916800.0
loss_actor                   -4461043702824.959961
memory_size                                  504.0 

=== epoch 6/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:03<00:00,  2.55it/s]
episodes                                       123
episode_length                           16.146341
returns                                  77.073816
return_std                                8.077005
average_reward                            4.773411
round_time                  0 days 00:13:03.950107
episodes_test                                125.0
episode_length_test                         15.888
returns_test                             75.798537
return_std_test                           1.679831
average_reward_test                       4.770797
round_time_test             0 days 00:00:03.172027
round_time_total            0 days 00:13:03.951172
loss_total             2537203641179261450059776.0
loss_critic            3171504493260547899260928.0
loss_actor                   -4505557921955.839844
memory_size                                  504.0 

=== epoch 6/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                       124
episode_length                           15.991935
returns                                  76.338763
return_std                                3.342048
average_reward                            4.773497
round_time                  0 days 00:12:27.338765
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.317438
return_std_test                           1.610308
average_reward_test                        4.76984
round_time_test             0 days 00:00:03.150696
round_time_total            0 days 00:12:27.339828
loss_total             2611433096065431067688960.0
loss_critic            3264291310066820446158848.0
loss_actor                   -4548234082910.208008
memory_size                                  504.0 

=== epoch 6/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       124
episode_length                           16.024194
returns                                  76.418029
return_std                                1.495802
average_reward                            4.768896
round_time                  0 days 00:12:31.854847
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.547845
return_std_test                           5.794034
average_reward_test                       4.774539
round_time_test             0 days 00:00:03.206173
round_time_total            0 days 00:12:31.855934
loss_total             2590590873334721160413184.0
loss_critic            3238238532923447912169472.0
loss_actor                   -4574949168906.240234
memory_size                                  504.0 

=== epoch 6/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:25<00:00,  2.48it/s]
episodes                                       123
episode_length                           16.146341
returns                                  77.089599
return_std                                 5.16601
average_reward                            4.774387
round_time                  0 days 00:13:26.286046
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.466509
return_std_test                            1.90872
average_reward_test                       4.769624
round_time_test             0 days 00:00:03.196960
round_time_total            0 days 00:13:26.287128
loss_total             2656652382647017246031872.0
loss_critic            3320815419302608845668352.0
loss_actor                   -4624335415410.688477
memory_size                                  504.0 

=== epoch 6/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:42<00:00,  2.62it/s]
episodes                                       123
episode_length                            16.04878
returns                                  76.553119
return_std                                2.532186
average_reward                            4.769932
round_time                  0 days 00:12:42.996061
episodes_test                                123.0
episode_length_test                      16.211382
returns_test                             77.302146
return_std_test                           7.817137
average_reward_test                       4.768489
round_time_test             0 days 00:00:03.178523
round_time_total            0 days 00:12:42.997132
loss_total             2739948510423886770208768.0
loss_critic            3424935576564730874036224.0
loss_actor                    -4662859543609.34375
memory_size                                  504.0 

=== epoch 6/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                       124
episode_length                           16.048387
returns                                  76.547834
return_std                                2.231466
average_reward                            4.769866
round_time                  0 days 00:12:40.281695
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.391663
return_std_test                           4.306092
average_reward_test                       4.772042
round_time_test             0 days 00:00:03.187231
round_time_total            0 days 00:12:40.282767
loss_total             2771694793556413250535424.0
loss_critic            3464618430678547356975104.0
loss_actor                    -4700717681999.87207
memory_size                                  504.0 

=== epoch 6/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:40<00:00,  2.63it/s]
episodes                                       122
episode_length                           16.229508
returns                                  77.434478
return_std                                7.963861
average_reward                            4.770957
round_time                  0 days 00:12:41.406354
episodes_test                                124.0
episode_length_test                      16.016129
returns_test                             76.371604
return_std_test                            2.68371
average_reward_test                       4.768425
round_time_test             0 days 00:00:03.150697
round_time_total            0 days 00:12:41.407428
loss_total             2738649301266801671274496.0
loss_critic            3423311566073137274552320.0
loss_actor                   -4711976597454.847656
memory_size                                  504.0 

=== epoch 6/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:43<00:00,  1.88it/s]
episodes                                       124
episode_length                           16.008065
returns                                  76.339153
return_std                                2.061575
average_reward                            4.768799
round_time                  0 days 00:17:43.902744
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.310473
return_std_test                           2.594673
average_reward_test                       4.774305
round_time_test             0 days 00:00:03.179233
round_time_total            0 days 00:17:43.904656
loss_total             2805645921648650720641024.0
loss_critic            3507057341991801707298816.0
loss_actor                   -4735147142938.624023
memory_size                                  504.0 

=== epoch 6/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:49,  1.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:26<00:00,  1.42it/s]
episodes                                       123
episode_length                           16.178862
returns                                  77.130268
return_std                                7.142734
average_reward                            4.767404
round_time                  0 days 00:23:28.014675
episodes_test                                124.0
episode_length_test                      16.064516
returns_test                             76.741439
return_std_test                           4.577503
average_reward_test                       4.777113
round_time_test             0 days 00:00:04.685956
round_time_total            0 days 00:23:28.016549
loss_total             2914297149090746958086144.0
loss_critic            3642871371079253401534464.0
loss_actor                    -4791265313161.21582
memory_size                                  504.0 

=== epoch 6/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:19,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:20<00:00,  1.37it/s]
episodes                                       124
episode_length                           15.935484
returns                                  75.992582
return_std                                1.785018
average_reward                            4.768638
round_time                  0 days 00:24:21.390956
episodes_test                                122.0
episode_length_test                      16.270492
returns_test                             77.708214
return_std_test                          13.078335
average_reward_test                       4.776057
round_time_test             0 days 00:00:04.972570
round_time_total            0 days 00:24:21.392798
loss_total             2972941515489095047970816.0
loss_critic            3716176829491519982403584.0
loss_actor                   -4816370159190.015625
memory_size                                  504.0 

=== epoch 6/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:10,  1.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:29<00:00,  1.21it/s]
episodes                                       123
episode_length                           16.138211
returns                                  77.078695
return_std                                5.278293
average_reward                            4.776113
round_time                  0 days 00:27:30.455978
episodes_test                                123.0
episode_length_test                      16.252033
returns_test                             77.632728
return_std_test                           9.759981
average_reward_test                       4.776865
round_time_test             0 days 00:00:05.103761
round_time_total            0 days 00:27:30.457845
loss_total             3011522685443178667966464.0
loss_critic            3764403293483363100262400.0
loss_actor                   -4865706006282.240234
memory_size                                  504.0 

=== epoch 6/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:16,  1.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:08<00:00,  1.23it/s]
episodes                                       124
episode_length                           16.048387
returns                                  76.611159
return_std                                4.700492
average_reward                            4.773963
round_time                  0 days 00:27:09.357110
episodes_test                                124.0
episode_length_test                      16.056452
returns_test                             76.691444
return_std_test                           4.526858
average_reward_test                       4.776383
round_time_test             0 days 00:00:05.752142
round_time_total            0 days 00:27:09.359084
loss_total             3061117313674376327987200.0
loss_critic            3826396577997740140658688.0
loss_actor                   -4911190592585.727539
memory_size                                  504.0 

=== epoch 6/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:23,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:39<00:00,  1.30it/s]
episodes                                       123
episode_length                           16.178862
returns                                   77.23891
return_std                                11.87543
average_reward                            4.774161
round_time                  0 days 00:25:40.925466
episodes_test                                125.0
episode_length_test                          15.96
returns_test                              76.09286
return_std_test                           1.970021
average_reward_test                        4.76786
round_time_test             0 days 00:00:04.250768
round_time_total            0 days 00:25:40.927236
loss_total             3094258383733180554280960.0
loss_critic            3867822910058839716921344.0
loss_actor                   -4944294768017.408203
memory_size                               505.1305 

=== epoch 6/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:58,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:44<00:00,  1.29it/s]
episodes                                       124
episode_length                           15.927419
returns                                  75.982483
return_std                                3.647437
average_reward                            4.770532
round_time                  0 days 00:25:45.919290
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.058573
return_std_test                           1.476382
average_reward_test                       4.770492
round_time_test             0 days 00:00:04.719602
round_time_total            0 days 00:25:45.921190
loss_total             3222518888984436189167616.0
loss_critic            4028148541983197558210560.0
loss_actor                   -4967129504284.671875
memory_size                                  511.0 

=== epoch 6/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:55,  1.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:01<00:00,  1.19it/s]
episodes                                       125
episode_length                               15.92
returns                                  75.941368
return_std                                2.495452
average_reward                            4.770267
round_time                  0 days 00:28:02.879133
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.058131
return_std_test                           1.845479
average_reward_test                       4.770409
round_time_test             0 days 00:00:04.276569
round_time_total            0 days 00:28:02.880944
loss_total             3274013712215383711154176.0
loss_critic            4092517069760873925443584.0
loss_actor                   -4998683677556.736328
memory_size                                  511.0 

=== epoch 6/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:41,  1.16it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:39<00:00,  1.09it/s]
episodes                                       125
episode_length                                15.8
returns                                  75.373049
return_std                                2.216279
average_reward                            4.770445
round_time                  0 days 00:30:40.276403
episodes_test                                126.0
episode_length_test                      15.777778
returns_test                             75.271128
return_std_test                           2.051953
average_reward_test                       4.770692
round_time_test             0 days 00:00:04.807047
round_time_total            0 days 00:30:40.278209
loss_total             3306345191137935490023424.0
loss_critic            4132931415964105587032064.0
loss_actor                   -5019831896113.152344
memory_size                                  511.0 

=== epoch 6/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<28:24,  1.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:26<00:00,  1.17it/s]
episodes                                       126
episode_length                           15.833333
returns                                  75.499591
return_std                                2.138353
average_reward                            4.768507
round_time                  0 days 00:28:27.544849
episodes_test                                125.0
episode_length_test                         15.896
returns_test                             75.811241
return_std_test                            1.89657
average_reward_test                       4.769195
round_time_test             0 days 00:00:04.839115
round_time_total            0 days 00:28:27.547233
loss_total             3420328206003552720519168.0
loss_critic            4275410182924831463309312.0
loss_actor                   -5059711141675.007812
memory_size                                  511.0 

=== epoch 6/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:11,  1.10it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:14<00:00,  1.14it/s]
episodes                                       125
episode_length                              15.792
returns                                  75.302805
return_std                                1.921376
average_reward                             4.76852
round_time                  0 days 00:29:16.702489
episodes_test                                125.0
episode_length_test                         15.928
returns_test                             76.038958
return_std_test                           6.848439
average_reward_test                       4.774006
round_time_test             0 days 00:00:04.821965
round_time_total            0 days 00:29:16.705433
loss_total             3406848508404007543242752.0
loss_critic            4258560560619154462212096.0
loss_actor                   -5111935638700.032227
memory_size                                  511.0 

=== epoch 6/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<37:26,  1.12s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:25<00:00,  1.17it/s]
episodes                                       124
episode_length                           16.096774
returns                                  76.773634
return_std                                8.843616
average_reward                            4.769636
round_time                  0 days 00:28:27.550473
episodes_test                                126.0
episode_length_test                      15.793651
returns_test                             75.377074
return_std_test                           2.756204
average_reward_test                       4.772673
round_time_test             0 days 00:00:05.888891
round_time_total            0 days 00:28:27.564120
loss_total             3473257724106314308648960.0
loss_critic            4341572086083703858528256.0
loss_actor                   -5138114086699.007812
memory_size                                  511.0 

=== epoch 6/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<44:01,  1.32s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:43<00:00,  1.47it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       124
episode_length                           16.008065
returns                                  76.360504
return_std                                2.612966
average_reward                            4.770045
round_time                  0 days 00:22:46.397189
episodes_test                                124.0
episode_length_test                      16.072581
returns_test                             76.697082
return_std_test                           5.897559
average_reward_test                       4.771989
round_time_test             0 days 00:00:04.903616
round_time_total            0 days 00:22:46.398546
loss_total             3492643912360937478160384.0
loss_critic            4365804821636169133457408.0
loss_actor                   -5179692668682.240234
memory_size                                  511.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<21:50,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:15<00:00,  1.43it/s]
episodes                                       125
episode_length                              15.936
returns                                  76.050638
return_std                                3.448263
average_reward                            4.772433
round_time                  0 days 00:23:16.056112
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.379307
return_std_test                           3.407938
average_reward_test                       4.773707
round_time_test             0 days 00:00:03.965526
round_time_total            0 days 00:23:16.057911
loss_total             3540276786357452448727040.0
loss_critic            4425345910853192921382912.0
loss_actor                   -5210377757917.183594
memory_size                                  511.0 

=== epoch 7/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:54,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:36<00:00,  1.41it/s]
episodes                                       123
episode_length                           16.097561
returns                                  76.918139
return_std                                4.702619
average_reward                            4.778179
round_time                  0 days 00:23:36.973018
episodes_test                                124.0
episode_length_test                      16.040323
returns_test                             76.660392
return_std_test                           4.937609
average_reward_test                       4.779209
round_time_test             0 days 00:00:04.654483
round_time_total            0 days 00:23:36.974870
loss_total             3634804208535021616627712.0
loss_critic            4543505189746090241425408.0
loss_actor                   -5252296939405.311523
memory_size                                  511.0 

=== epoch 7/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:25,  1.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:26<00:00,  1.49it/s]
episodes                                       125
episode_length                               15.96
returns                                     76.126
return_std                                 1.67613
average_reward                            4.769843
round_time                  0 days 00:22:27.086046
episodes_test                                124.0
episode_length_test                      16.008065
returns_test                             76.398858
return_std_test                           2.320454
average_reward_test                       4.772464
round_time_test             0 days 00:00:03.854482
round_time_total            0 days 00:22:27.088092
loss_total             3611190002903813579603968.0
loss_critic            4513987429014128801349632.0
loss_actor                   -5294816235618.303711
memory_size                                  511.0 

=== epoch 7/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:20,  1.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:06<00:00,  1.51it/s]
episodes                                       124
episode_length                                16.0
returns                                  76.327613
return_std                                 3.14368
average_reward                            4.770553
round_time                  0 days 00:22:07.606138
episodes_test                                123.0
episode_length_test                      16.170732
returns_test                             77.197243
return_std_test                          10.036226
average_reward_test                       4.773925
round_time_test             0 days 00:00:03.967823
round_time_total            0 days 00:22:07.607997
loss_total             3682620907002362052214784.0
loss_critic            4603276057371903115919360.0
loss_actor                   -5325603666395.135742
memory_size                                  511.0 

=== epoch 7/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:09,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:33<00:00,  1.48it/s]
episodes                                       123
episode_length                            16.02439
returns                                  76.537607
return_std                                7.829154
average_reward                            4.776177
round_time                  0 days 00:22:34.770026
episodes_test                                124.0
episode_length_test                      16.048387
returns_test                             76.608157
return_std_test                           6.116398
average_reward_test                       4.773623
round_time_test             0 days 00:00:04.473425
round_time_total            0 days 00:22:34.771940
loss_total             3703393298970891786387456.0
loss_critic            4629241547476679958462464.0
loss_actor                   -5355925243691.007812
memory_size                                  511.0 

=== epoch 7/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:02,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:13<00:00,  1.50it/s]
episodes                                       125
episode_length                              15.976
returns                                  76.184016
return_std                                0.952873
average_reward                            4.768725
round_time                  0 days 00:22:14.854452
episodes_test                                124.0
episode_length_test                      16.064516
returns_test                              76.69131
return_std_test                           5.051759
average_reward_test                       4.774031
round_time_test             0 days 00:00:04.940375
round_time_total            0 days 00:22:14.855889
loss_total             3773740171939382734880768.0
loss_critic            4717175136669681485611008.0
loss_actor                    -5388575394758.65625
memory_size                                  511.0 

=== epoch 7/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:51,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:00<00:00,  1.52it/s]
episodes                                       123
episode_length                            16.02439
returns                                  76.502123
return_std                                3.667721
average_reward                            4.773979
round_time                  0 days 00:22:01.102137
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.581043
return_std_test                           5.585283
average_reward_test                       4.776606
round_time_test             0 days 00:00:04.498754
round_time_total            0 days 00:22:01.105946
loss_total             3847325621446325789786112.0
loss_critic            4809156947292352134774784.0
loss_actor                   -5432574856396.799805
memory_size                                  511.0 

=== epoch 7/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<35:24,  1.06s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:06<00:00,  1.38it/s]
episodes                                       124
episode_length                           16.072581
returns                                  76.719456
return_std                                  4.6049
average_reward                            4.773356
round_time                  0 days 00:24:08.788743
episodes_test                                125.0
episode_length_test                          15.96
returns_test                             76.121149
return_std_test                           2.454952
average_reward_test                       4.769632
round_time_test             0 days 00:00:04.242497
round_time_total            0 days 00:24:08.790610
loss_total             3868068452939822620737536.0
loss_critic            4835085488424634449657856.0
loss_actor                        -5461581856768.0
memory_size                                  511.0 

=== epoch 7/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:24,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:22<00:00,  1.56it/s]
episodes                                       123
episode_length                           16.056911
returns                                  76.699035
return_std                                7.735637
average_reward                            4.776582
round_time                  0 days 00:21:23.513652
episodes_test                                124.0
episode_length_test                      16.024194
returns_test                             76.377102
return_std_test                           2.036006
average_reward_test                       4.766434
round_time_test             0 days 00:00:04.073736
round_time_total            0 days 00:21:23.515630
loss_total             3901213611906800872849408.0
loss_critic            4876516937727832412389376.0
loss_actor                   -5507711125684.223633
memory_size                                  511.0 

=== epoch 7/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:08,  1.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:11<00:00,  1.44it/s]
episodes                                       124
episode_length                           16.080645
returns                                  76.692154
return_std                                4.499312
average_reward                            4.769329
round_time                  0 days 00:23:12.664507
episodes_test                                124.0
episode_length_test                      16.096774
returns_test                             76.856739
return_std_test                           4.255388
average_reward_test                       4.774749
round_time_test             0 days 00:00:05.068158
round_time_total            0 days 00:23:12.666290
loss_total             4032202298733125752061952.0
loss_critic            5040252787163467484233728.0
loss_actor                   -5542869585887.232422
memory_size                                  511.0 

=== epoch 7/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:28,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:42<00:00,  1.30it/s]
episodes                                       124
episode_length                           15.943548
returns                                  76.035451
return_std                                1.503836
average_reward                            4.768993
round_time                  0 days 00:25:43.728907
episodes_test                                124.0
episode_length_test                      16.032258
returns_test                             76.504944
return_std_test                           3.332128
average_reward_test                       4.771912
round_time_test             0 days 00:00:03.869373
round_time_total            0 days 00:25:43.730762
loss_total             4069183262637978992771072.0
loss_critic            5086479000727474533302272.0
loss_actor                   -5590525982015.488281
memory_size                                  511.0 

=== epoch 7/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:08,  1.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:40<00:00,  1.25it/s]
episodes                                       123
episode_length                           16.121951
returns                                  76.883789
return_std                                6.498662
average_reward                            4.768883
round_time                  0 days 00:26:41.571162
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.322097
return_std_test                           1.457362
average_reward_test                       4.770131
round_time_test             0 days 00:00:04.805419
round_time_total            0 days 00:26:41.572597
loss_total             4089995982215472792731648.0
loss_critic            5112494896920720650010624.0
loss_actor                   -5630865940676.608398
memory_size                                  511.0 

=== epoch 7/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:52,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:18<00:00,  1.18it/s]
episodes                                       124
episode_length                           16.032258
returns                                  76.453306
return_std                                6.731993
average_reward                            4.768744
round_time                  0 days 00:28:19.530435
episodes_test                                125.0
episode_length_test                         15.952
returns_test                             76.078616
return_std_test                           1.568237
average_reward_test                       4.769317
round_time_test             0 days 00:00:04.579660
round_time_total            0 days 00:28:19.532055
loss_total             4077676081928214356688896.0
loss_critic            5097095019580063913869312.0
loss_actor                    -5623511462969.34375
memory_size                                  511.0 

=== epoch 7/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:43,  1.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:14<00:00,  1.18it/s]
episodes                                       125
episode_length                               15.92
returns                                   75.95388
return_std                                1.794679
average_reward                            4.771011
round_time                  0 days 00:28:15.964928
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.300586
return_std_test                           3.184735
average_reward_test                       4.768787
round_time_test             0 days 00:00:05.241430
round_time_total            0 days 00:28:15.967372
loss_total             4107499017235450476625920.0
loss_critic            5134373686120035253223424.0
loss_actor                   -5677047628431.360352
memory_size                                  511.0 

=== epoch 7/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<50:02,  1.50s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:30<00:00,  1.21it/s]
episodes                                       124
episode_length                           15.927419
returns                                  75.986953
return_std                                1.655253
average_reward                            4.770664
round_time                  0 days 00:27:32.645337
episodes_test                                126.0
episode_length_test                      15.865079
returns_test                             75.645727
return_std_test                           1.725826
average_reward_test                       4.768132
round_time_test             0 days 00:00:04.884111
round_time_total            0 days 00:27:32.647137
loss_total             4252687048130795107516416.0
loss_critic            5315858724649144135188480.0
loss_actor                   -5719252600881.152344
memory_size                                  511.0 

=== epoch 7/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:31,  1.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [26:10<00:00,  1.27it/s]
episodes                                       123
episode_length                           16.130081
returns                                  77.055683
return_std                                6.072376
average_reward                            4.777047
round_time                  0 days 00:26:11.119567
episodes_test                                125.0
episode_length_test                         15.888
returns_test                             75.774049
return_std_test                           1.730423
average_reward_test                       4.769279
round_time_test             0 days 00:00:04.270020
round_time_total            0 days 00:26:11.120986
loss_total             4280376120000277647458304.0
loss_critic            5350470059081677139869696.0
loss_actor                   -5756528177184.767578
memory_size                                  511.0 

=== epoch 7/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:58,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:00<00:00,  1.51it/s]
episodes                                       124
episode_length                           15.959677
returns                                  76.103652
return_std                                4.833079
average_reward                            4.768455
round_time                  0 days 00:22:01.713169
episodes_test                                125.0
episode_length_test                         15.912
returns_test                             75.943267
return_std_test                           1.926609
average_reward_test                        4.77273
round_time_test             0 days 00:00:03.988026
round_time_total            0 days 00:22:01.714943
loss_total             4348245847729479174062080.0
loss_critic            5435307221049022032642048.0
loss_actor                   -5796363658854.400391
memory_size                                  511.0 

=== epoch 7/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:39,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:39<00:00,  1.54it/s]
episodes                                       125
episode_length                              15.976
returns                                  76.199237
return_std                                3.133282
average_reward                            4.769704
round_time                  0 days 00:21:40.799512
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.066966
return_std_test                           1.458274
average_reward_test                        4.77099
round_time_test             0 days 00:00:04.669061
round_time_total            0 days 00:21:40.801382
loss_total             4433149432242893070794752.0
loss_critic            5541436699348918354313216.0
loss_actor                    -5826541974192.12793
memory_size                                  511.0 

=== epoch 7/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:04,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:01<00:00,  1.67it/s]
episodes                                       124
episode_length                           15.919355
returns                                  75.919525
return_std                                1.538802
average_reward                            4.768784
round_time                  0 days 00:20:01.910018
episodes_test                                124.0
episode_length_test                      16.129032
returns_test                             76.873704
return_std_test                           5.972866
average_reward_test                        4.76617
round_time_test             0 days 00:00:03.936003
round_time_total            0 days 00:20:01.911454
loss_total             4475587463444693730197504.0
loss_critic            5594484230064545982316544.0
loss_actor                   -5858326597861.375977
memory_size                                  511.0 

=== epoch 7/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:37,  1.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:50<00:00,  2.11it/s]
episodes                                       125
episode_length                               15.92
returns                                  76.000789
return_std                                2.386017
average_reward                            4.773897
round_time                  0 days 00:15:50.924669
episodes_test                                125.0
episode_length_test                         15.952
returns_test                             76.086678
return_std_test                           1.180514
average_reward_test                       4.769771
round_time_test             0 days 00:00:04.055422
round_time_total            0 days 00:15:50.925894
loss_total             4340667491837570787049472.0
loss_critic            5425834267447153685168128.0
loss_actor                   -5878451036487.679688
memory_size                                  511.0 

=== epoch 7/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:17,  1.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                       124
episode_length                           16.040323
returns                                  76.545657
return_std                                6.111065
average_reward                            4.772077
round_time                  0 days 00:14:54.299272
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.066876
return_std_test                           1.962854
average_reward_test                       4.770924
round_time_test             0 days 00:00:03.523217
round_time_total            0 days 00:14:54.300459
loss_total             4423644907687060529217536.0
loss_critic            5529556042411134613979136.0
loss_actor                   -5925892482138.112305
memory_size                                  511.0 

=== epoch 7/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:53,  2.09it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:40<00:00,  2.44it/s]
episodes                                       124
episode_length                           15.935484
returns                                   75.98667
return_std                                  1.4551
average_reward                            4.768325
round_time                  0 days 00:13:41.304364
episodes_test                                125.0
episode_length_test                         15.936
returns_test                             76.001585
return_std_test                           1.971196
average_reward_test                       4.769205
round_time_test             0 days 00:00:03.280107
round_time_total            0 days 00:13:41.305510
loss_total             4584354263521435316125696.0
loss_critic            5730442725692902207389696.0
loss_actor                    -5980024573263.87207
memory_size                                  511.0 

=== epoch 7/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:38,  2.13it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:55<00:00,  2.58it/s]
episodes                                       124
episode_length                           16.032258
returns                                  76.599658
return_std                                6.254114
average_reward                            4.777925
round_time                  0 days 00:12:56.517512
episodes_test                                125.0
episode_length_test                          15.92
returns_test                             75.963226
return_std_test                           2.833761
average_reward_test                       4.771549
round_time_test             0 days 00:00:03.244753
round_time_total            0 days 00:12:56.518631
loss_total             4606177965987145634021376.0
loss_critic            5757722362566065736646656.0
loss_actor                   -6004487985037.311523
memory_size                                  511.0 

=== epoch 7/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:46<00:00,  2.61it/s]
episodes                                       125
episode_length                              15.912
returns                                  75.904996
return_std                                1.458899
average_reward                            4.770354
round_time                  0 days 00:12:46.994196
episodes_test                                125.0
episode_length_test                         15.928
returns_test                             75.962614
return_std_test                            1.23552
average_reward_test                       4.769253
round_time_test             0 days 00:00:03.254505
round_time_total            0 days 00:12:46.995271
loss_total             4549051358635966148378624.0
loss_critic            5686314099251794619138048.0
loss_actor                   -6015458938978.303711
memory_size                                  511.0 

=== epoch 7/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:47,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.66it/s]
episodes                                       124
episode_length                           16.008065
returns                                  76.323829
return_std                                4.342562
average_reward                            4.767817
round_time                  0 days 00:12:33.609330
episodes_test                                126.0
episode_length_test                      15.865079
returns_test                             75.676984
return_std_test                            1.90449
average_reward_test                       4.770107
round_time_test             0 days 00:00:03.136547
round_time_total            0 days 00:12:33.610417
loss_total             4609763097183054376992768.0
loss_critic            5762203773786734578368512.0
loss_actor                   -6056551759740.927734
memory_size                                  511.0 

=== epoch 7/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:08,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       124
episode_length                           16.129032
returns                                  76.930715
return_std                                 5.52182
average_reward                            4.769704
round_time                  0 days 00:12:31.661851
episodes_test                                125.0
episode_length_test                          15.96
returns_test                             76.128945
return_std_test                           1.396235
average_reward_test                       4.770138
round_time_test             0 days 00:00:03.151042
round_time_total            0 days 00:12:31.662926
loss_total             4753902649804952063442944.0
loss_critic            5942378206673800523677696.0
loss_actor                    -6105372013625.34375
memory_size                                  511.0 

=== epoch 7/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       125
episode_length                              15.904
returns                                  75.873186
return_std                                1.531483
average_reward                            4.770713
round_time                  0 days 00:12:29.415136
episodes_test                                125.0
episode_length_test                         15.896
returns_test                             75.828364
return_std_test                           1.686761
average_reward_test                       4.770247
round_time_test             0 days 00:00:03.193396
round_time_total            0 days 00:12:29.416205
loss_total             4835402589252298196647936.0
loss_critic            6044253129073456836509696.0
loss_actor                   -6139629176881.152344
memory_size                                  511.0 

=== epoch 7/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                       125
episode_length                              15.864
returns                                  75.674866
return_std                                1.834689
average_reward                            4.770159
round_time                  0 days 00:12:29.107435
episodes_test                                125.0
episode_length_test                         15.992
returns_test                             76.370514
return_std_test                           3.834785
average_reward_test                       4.775601
round_time_test             0 days 00:00:03.197759
round_time_total            0 days 00:12:29.108514
loss_total             4988940200326030635827200.0
loss_critic            6236175139907217777491968.0
loss_actor                   -6199467621482.496094
memory_size                                  511.0 

=== epoch 7/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.162602
returns                                  77.117347
return_std                                7.979135
average_reward                            4.771372
round_time                  0 days 00:12:34.458578
episodes_test                                124.0
episode_length_test                      16.040323
returns_test                             76.567405
return_std_test                           5.075778
average_reward_test                       4.773607
round_time_test             0 days 00:00:03.225986
round_time_total            0 days 00:12:34.459647
loss_total             4941840067289558269558784.0
loss_critic            6177299970945496037457920.0
loss_actor                   -6189509143429.120117
memory_size                                  511.0 

=== epoch 7/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:03,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                       125
episode_length                              15.936
returns                                  75.978852
return_std                                1.296587
average_reward                            4.767745
round_time                  0 days 00:12:37.457675
episodes_test                                124.0
episode_length_test                      16.080645
returns_test                             76.771647
return_std_test                           8.254476
average_reward_test                       4.774224
round_time_test             0 days 00:00:03.169094
round_time_total            0 days 00:12:37.458751
loss_total             5073882732041079235805184.0
loss_critic            6342353303776409700794368.0
loss_actor                   -6240977664606.208008
memory_size                                  511.0 

=== epoch 7/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                       124
episode_length                           15.951613
returns                                  76.071028
return_std                                1.310856
average_reward                            4.768758
round_time                  0 days 00:12:32.199121
episodes_test                                125.0
episode_length_test                           16.0
returns_test                             76.285102
return_std_test                           2.576327
average_reward_test                       4.767819
round_time_test             0 days 00:00:03.209286
round_time_total            0 days 00:12:32.200201
loss_total             5033414794875779885826048.0
loss_critic            6291768384067181146537984.0
loss_actor                   -6289821134487.551758
memory_size                                  511.0 

=== epoch 7/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:37,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                       125
episode_length                              15.928
returns                                  75.969018
return_std                                2.691112
average_reward                            4.769506
round_time                  0 days 00:12:28.227817
episodes_test                                125.0
episode_length_test                         15.992
returns_test                             76.268115
return_std_test                           1.232782
average_reward_test                       4.769211
round_time_test             0 days 00:00:03.145218
round_time_total            0 days 00:12:28.228881
loss_total             5119063305464095317164032.0
loss_critic            6398829012070398089494528.0
loss_actor                   -6317924360650.751953
memory_size                                  511.0 

=== epoch 7/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.856
returns                                  75.625707
return_std                                1.917153
average_reward                             4.76959
round_time                  0 days 00:12:33.724555
episodes_test                                125.0
episode_length_test                          15.92
returns_test                             75.886932
return_std_test                           1.436517
average_reward_test                       4.766846
round_time_test             0 days 00:00:03.186483
round_time_total            0 days 00:12:33.725633
loss_total             5254134094408173710475264.0
loss_critic            6567667493999097201819648.0
loss_actor                   -6382328344477.696289
memory_size                                  511.0 

=== epoch 7/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.919355
returns                                  75.974043
return_std                                1.985375
average_reward                            4.772499
round_time                  0 days 00:12:34.283384
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.210449
return_std_test                           3.438186
average_reward_test                       4.768077
round_time_test             0 days 00:00:03.225447
round_time_total            0 days 00:12:34.284463
loss_total             5345192579917544020770816.0
loss_critic            6681490605713669815795712.0
loss_actor                   -6410528141606.912109
memory_size                                  511.0 

=== epoch 7/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.968
returns                                  76.219121
return_std                                4.042284
average_reward                            4.773303
round_time                  0 days 00:12:33.267583
episodes_test                                124.0
episode_length_test                      16.056452
returns_test                             76.624036
return_std_test                           5.061322
average_reward_test                       4.772263
round_time_test             0 days 00:00:03.179301
round_time_total            0 days 00:12:33.268659
loss_total             5394795728598948502306816.0
loss_critic            6743494535584644803330048.0
loss_actor                   -6460645912346.624023
memory_size                                  511.0 

=== epoch 7/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                       124
episode_length                           15.959677
returns                                  76.087546
return_std                                1.268785
average_reward                            4.767576
round_time                  0 days 00:12:29.840277
episodes_test                                125.0
episode_length_test                         15.936
returns_test                             75.982166
return_std_test                           1.443032
average_reward_test                       4.768033
round_time_test             0 days 00:00:03.226638
round_time_total            0 days 00:12:29.841353
loss_total             5413184565748938024419328.0
loss_critic            6766480582526534544261120.0
loss_actor                   -6481903423848.448242
memory_size                                  511.0 

=== epoch 7/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.983871
returns                                  76.241137
return_std                                3.059224
average_reward                             4.76989
round_time                  0 days 00:12:35.527927
episodes_test                                125.0
episode_length_test                         15.952
returns_test                             76.103266
return_std_test                           2.989468
average_reward_test                       4.770907
round_time_test             0 days 00:00:03.192244
round_time_total            0 days 00:12:35.529006
loss_total             5503401707510897815060480.0
loss_critic            6879252012070856539766784.0
loss_actor                   -6503214185709.568359
memory_size                                  511.0 

=== epoch 7/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.943548
returns                                  76.026243
return_std                                1.517082
average_reward                            4.768336
round_time                  0 days 00:12:34.593486
episodes_test                                125.0
episode_length_test                         15.976
returns_test                             76.231996
return_std_test                           2.274424
average_reward_test                       4.771794
round_time_test             0 days 00:00:03.192162
round_time_total            0 days 00:12:34.594570
loss_total             5589706600595195793768448.0
loss_critic            6987133122571549175447552.0
loss_actor                   -6535176563720.192383
memory_size                                  511.0 

=== epoch 7/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:38<00:00,  2.64it/s]
episodes                                       125
episode_length                              15.936
returns                                  76.022655
return_std                                1.389946
average_reward                            4.770583
round_time                  0 days 00:12:39.365941
episodes_test                                124.0
episode_length_test                      16.064516
returns_test                             76.584566
return_std_test                            5.83516
average_reward_test                       4.767428
round_time_test             0 days 00:00:03.162538
round_time_total            0 days 00:12:39.367001
loss_total             5515730541210433876918272.0
loss_critic            6894663040900650418307072.0
loss_actor                   -6566368799621.120117
memory_size                                  511.0 

=== epoch 7/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:40<00:00,  2.63it/s]
episodes                                       124
episode_length                           16.040323
returns                                  76.580086
return_std                                5.450799
average_reward                            4.774284
round_time                  0 days 00:12:40.869920
episodes_test                                124.0
episode_length_test                      16.072581
returns_test                             76.751891
return_std_test                           4.869606
average_reward_test                       4.775378
round_time_test             0 days 00:00:03.191424
round_time_total            0 days 00:12:40.871073
loss_total             5644056373227926891528192.0
loss_critic            7055070336849253459034112.0
loss_actor                   -6621834660544.511719
memory_size                                  511.0 

=== epoch 7/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.105691
returns                                  76.972959
return_std                                4.976486
average_reward                            4.779279
round_time                  0 days 00:12:36.129801
episodes_test                                124.0
episode_length_test                      16.096774
returns_test                             76.849477
return_std_test                           2.694089
average_reward_test                       4.774325
round_time_test             0 days 00:00:03.192245
round_time_total            0 days 00:12:36.130864
loss_total             5773290973932089442828288.0
loss_critic            7216613585315527187759104.0
loss_actor                   -6663123918389.248047
memory_size                                  511.0 

=== epoch 7/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:07,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                       124
episode_length                           15.959677
returns                                  76.114943
return_std                                 1.27439
average_reward                             4.77153
round_time                  0 days 00:12:35.454932
episodes_test                                125.0
episode_length_test                         15.968
returns_test                             76.153597
return_std_test                           1.444961
average_reward_test                       4.769227
round_time_test             0 days 00:00:03.148535
round_time_total            0 days 00:12:35.456030
loss_total             5738199351856286671044608.0
loss_critic            7172749062638705087348736.0
loss_actor                   -6708982781050.879883
memory_size                                  511.0 

=== epoch 7/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                       125
episode_length                               15.92
returns                                  75.930017
return_std                                1.652412
average_reward                            4.769417
round_time                  0 days 00:12:38.084386
episodes_test                                125.0
episode_length_test                         15.912
returns_test                             75.907179
return_std_test                           2.254886
average_reward_test                       4.770498
round_time_test             0 days 00:00:03.159031
round_time_total            0 days 00:12:38.085457
loss_total             5576908123528124495298560.0
loss_critic            6971135024670456621826048.0
loss_actor                   -6693148765454.335938
memory_size                                  511.0 

=== epoch 7/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                       124
episode_length                                16.0
returns                                  76.479866
return_std                                9.027676
average_reward                            4.779799
round_time                  0 days 00:12:35.131618
episodes_test                                125.0
episode_length_test                          15.88
returns_test                             75.740912
return_std_test                           2.080842
average_reward_test                       4.769675
round_time_test             0 days 00:00:03.134034
round_time_total            0 days 00:12:35.132705
loss_total             5723087546509191102332928.0
loss_critic            7153859305450431714426880.0
loss_actor                   -6741782577283.072266
memory_size                                  511.0 

=== epoch 7/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.65it/s]
episodes                                       125
episode_length                              15.824
returns                                  75.489067
return_std                                1.974489
average_reward                            4.770544
round_time                  0 days 00:12:36.580431
episodes_test                                124.0
episode_length_test                      16.016129
returns_test                             76.480973
return_std_test                           8.072096
average_reward_test                       4.775239
round_time_test             0 days 00:00:03.161449
round_time_total            0 days 00:12:36.581513
loss_total             6009498399524385248509952.0
loss_critic            7511872867035680419610624.0
loss_actor                    -6822173641277.44043
memory_size                                  511.0 

=== epoch 7/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                       123
episode_length                           16.081301
returns                                  76.721301
return_std                                6.543552
average_reward                              4.7708
round_time                  0 days 00:12:35.707049
episodes_test                                126.0
episode_length_test                      15.833333
returns_test                             75.508835
return_std_test                           1.877628
average_reward_test                       4.769112
round_time_test             0 days 00:00:03.187512
round_time_total            0 days 00:12:35.708111
loss_total             6008193566920603754037248.0
loss_critic            7510241813995134103060480.0
loss_actor                    -6841623357161.47168
memory_size                                  511.0 

=== epoch 7/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:32<00:00,  2.66it/s]
episodes                                       125
episode_length                              15.928
returns                                  75.956764
return_std                                 3.09719
average_reward                            4.768804
round_time                  0 days 00:12:33.186747
episodes_test                                126.0
episode_length_test                      15.873016
returns_test                             75.690425
return_std_test                           1.796156
average_reward_test                       4.768497
round_time_test             0 days 00:00:03.201177
round_time_total            0 days 00:12:33.187823
loss_total             6240584246808647066189824.0
loss_critic            7800730173366790067847168.0
loss_actor                   -6923453764468.736328
memory_size                                  511.0 

=== epoch 7/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:41,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:45<00:00,  2.61it/s]
episodes                                       124
episode_length                           15.943548
returns                                   76.08916
return_std                                3.348226
average_reward                            4.772188
round_time                  0 days 00:12:45.635463
episodes_test                                125.0
episode_length_test                         15.944
returns_test                             76.102857
return_std_test                           2.214641
average_reward_test                       4.773216
round_time_test             0 days 00:00:03.140335
round_time_total            0 days 00:12:45.636535
loss_total             6132604108565273460604928.0
loss_critic            7665755001247121591500800.0
loss_actor                   -6925509426151.423828
memory_size                                  511.0 

=== epoch 7/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:42<00:00,  2.62it/s]
episodes                                       124
episode_length                           15.983871
returns                                  76.220361
return_std                                1.611874
average_reward                            4.768575
round_time                  0 days 00:12:42.895828
episodes_test                                123.0
episode_length_test                      16.195122
returns_test                             77.316359
return_std_test                           9.048526
average_reward_test                       4.774145
round_time_test             0 days 00:00:03.182339
round_time_total            0 days 00:12:42.896970
loss_total             6297832843559249096212480.0
loss_critic            7872290911270621767794688.0
loss_actor                   -6985933852246.015625
memory_size                                  511.0 

=== epoch 7/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:47,  1.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:53<00:00,  2.59it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       124
episode_length                           15.991935
returns                                  76.249684
return_std                                1.279843
average_reward                            4.767918
round_time                  0 days 00:12:53.586457
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.250652
return_std_test                            4.06067
average_reward_test                       4.770543
round_time_test             0 days 00:00:03.192014
round_time_total            0 days 00:12:53.587534
loss_total             6454335419623786781081600.0
loss_critic            8067919139313658582007808.0
loss_actor                   -7043258218971.135742
memory_size                                  511.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:45,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:47<00:00,  2.61it/s]
episodes                                       125
episode_length                              15.992
returns                                  76.235297
return_std                                6.256396
average_reward                            4.767155
round_time                  0 days 00:12:47.243520
episodes_test                                125.0
episode_length_test                          15.92
returns_test                             75.914499
return_std_test                           1.314479
average_reward_test                       4.768598
round_time_test             0 days 00:00:03.196716
round_time_total            0 days 00:12:47.244649
loss_total             6608666836536933920800768.0
loss_critic            8260833404114024534114304.0
loss_actor                   -7093604410195.967773
memory_size                                  511.0 

=== epoch 8/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:43,  2.12it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                       125
episode_length                              15.856
returns                                  75.619532
return_std                                1.954118
average_reward                            4.769224
round_time                  0 days 00:12:37.538485
episodes_test                                126.0
episode_length_test                       15.81746
returns_test                             75.419228
return_std_test                           2.028142
average_reward_test                       4.768172
round_time_test             0 days 00:00:03.181452
round_time_total            0 days 00:12:37.539608
loss_total             6206452803054096959406080.0
loss_critic            7758065874690412187222016.0
loss_actor                   -7056505847676.927734
memory_size                                  511.0 

=== epoch 8/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:47<00:00,  2.60it/s]
episodes                                       124
episode_length                           15.935484
returns                                   75.99021
return_std                                1.711874
average_reward                            4.768606
round_time                  0 days 00:12:48.362478
episodes_test                                125.0
episode_length_test                         15.888
returns_test                             75.759204
return_std_test                           1.710236
average_reward_test                        4.76849
round_time_test             0 days 00:00:03.202506
round_time_total            0 days 00:12:48.363581
loss_total             6315629938558851532980224.0
loss_critic            7894537286000905965010944.0
loss_actor                   -7100544871563.263672
memory_size                                  511.0 

=== epoch 8/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:27<00:00,  1.91it/s]
episodes                                       123
episode_length                           16.130081
returns                                  76.848727
return_std                                5.945264
average_reward                            4.764455
round_time                  0 days 00:17:27.989881
episodes_test                                124.0
episode_length_test                      16.016129
returns_test                             76.437923
return_std_test                           7.243997
average_reward_test                       4.772542
round_time_test             0 days 00:00:03.172188
round_time_total            0 days 00:17:27.991788
loss_total             6224802480072259914432512.0
loss_critic            7781002965991142016942080.0
loss_actor                   -7112987895660.543945
memory_size                                  511.0 

=== epoch 8/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:05,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:56<00:00,  1.04it/s]
episodes                                       123
episode_length                           16.089431
returns                                  76.743075
return_std                                3.348486
average_reward                            4.769997
round_time                  0 days 00:31:57.410362
episodes_test                                122.0
episode_length_test                      16.327869
returns_test                             77.874645
return_std_test                           8.663124
average_reward_test                        4.76952
round_time_test             0 days 00:00:03.825340
round_time_total            0 days 00:31:57.413883
loss_total             6436242711226947978919936.0
loss_critic            8045303239838436155195392.0
loss_actor                   -7196077019627.519531
memory_size                                  511.0 

=== epoch 8/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<38:30,  1.16s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:57<00:00,  1.11it/s]
episodes                                       124
episode_length                           16.040323
returns                                  76.511988
return_std                                3.081051
average_reward                            4.770242
round_time                  0 days 00:29:59.965938
episodes_test                                122.0
episode_length_test                      16.286885
returns_test                             77.687998
return_std_test                           6.664609
average_reward_test                        4.76995
round_time_test             0 days 00:00:04.290517
round_time_total            0 days 00:29:59.969259
loss_total             6541288042672289768013824.0
loss_critic            8176609920934533541134336.0
loss_actor                   -7231056887676.927734
memory_size                                  511.0 

=== epoch 8/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<55:30,  1.67s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:24<00:00,  1.10it/s]
episodes                                       122
episode_length                           16.147541
returns                                  77.083096
return_std                                5.836066
average_reward                            4.777159
round_time                  0 days 00:30:26.951197
episodes_test                                125.0
episode_length_test                         15.984
returns_test                             76.241771
return_std_test                           1.849523
average_reward_test                       4.770033
round_time_test             0 days 00:00:04.467605
round_time_total            0 days 00:30:26.953821
loss_total             6724876235061667427254272.0
loss_critic            8406095150288357591875584.0
loss_actor                    -7273684575977.47168
memory_size                                  511.0 

=== epoch 8/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<38:58,  1.17s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:50<00:00,  1.05it/s]
episodes                                       124
episode_length                           15.991935
returns                                  76.253767
return_std                                2.568761
average_reward                              4.7687
round_time                  0 days 00:31:52.525158
episodes_test                                123.0
episode_length_test                      16.154472
returns_test                             77.077963
return_std_test                           7.216174
average_reward_test                       4.771333
round_time_test             0 days 00:00:04.180458
round_time_total            0 days 00:31:52.528364
loss_total             6755866942655411295092736.0
loss_critic            8444833530889426710822912.0
loss_actor                   -7352510242619.391602
memory_size                                  511.0 

=== epoch 8/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<44:59,  1.35s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [32:18<00:00,  1.03it/s]
episodes                                       125
episode_length                              15.952
returns                                   76.09923
return_std                                2.817616
average_reward                             4.77057
round_time                  0 days 00:32:20.553102
episodes_test                                125.0
episode_length_test                         15.968
returns_test                              76.15779
return_std_test                           1.976841
average_reward_test                        4.76954
round_time_test             0 days 00:00:04.245150
round_time_total            0 days 00:32:20.556099
loss_total             6977243094139011793420288.0
loss_critic            8721553718802775485186048.0
loss_actor                    -7405352842690.55957
memory_size                                  511.0 

=== epoch 8/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<31:18,  1.06it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:07<00:00,  1.14it/s]
episodes                                       121
episode_length                            16.31405
returns                                  77.830261
return_std                               15.054352
average_reward                            4.770643
round_time                  0 days 00:29:09.842669
episodes_test                                125.0
episode_length_test                          15.96
returns_test                             76.190592
return_std_test                           3.740391
average_reward_test                       4.773993
round_time_test             0 days 00:00:05.038664
round_time_total            0 days 00:29:09.845387
loss_total             7352884708075821250641920.0
loss_critic            9191105738637717035548672.0
loss_actor                   -7484441812008.959961
memory_size                                511.836 

=== epoch 8/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<27:02,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:39<00:00,  1.05it/s]
episodes                                        125
episode_length                               15.968
returns                                   76.161888
return_std                                 1.922193
average_reward                             4.769819
round_time                   0 days 00:31:41.363480
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                              76.283573
return_std_test                            1.727265
average_reward_test                        4.770174
round_time_test              0 days 00:00:04.032497
round_time_total             0 days 00:31:41.366443
loss_total              9040856043162054120439808.0
loss_critic            11301069858388256246202368.0
loss_actor                    -7573906658426.879883
memory_size                                   527.0 

=== epoch 8/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<38:34,  1.16s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:54<00:00,  1.08it/s]
episodes                                        122
episode_length                            16.180328
returns                                   77.283706
return_std                                10.167296
average_reward                             4.776314
round_time                   0 days 00:30:57.369957
episodes_test                                 125.0
episode_length_test                           15.96
returns_test                              76.185588
return_std_test                            3.773823
average_reward_test                        4.773592
round_time_test              0 days 00:00:04.431584
round_time_total             0 days 00:30:57.373409
loss_total              9263561856999305053732864.0
loss_critic            11579452122874575566405632.0
loss_actor                    -7650425727090.688477
memory_size                                   527.0 

=== epoch 8/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<37:36,  1.13s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:55<00:00,  1.04it/s]
episodes                                        124
episode_length                            16.032258
returns                                   76.504222
return_std                                 4.858921
average_reward                             4.771894
round_time                   0 days 00:31:58.133063
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.712339
return_std_test                            3.359505
average_reward_test                        4.769877
round_time_test              0 days 00:00:04.935203
round_time_total             0 days 00:31:58.135803
loss_total              9318284121852811784749056.0
loss_critic            11647854956319359570542592.0
loss_actor                    -7672170592862.208008
memory_size                                   527.0 

=== epoch 8/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<46:41,  1.40s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [35:24<00:00,  1.06s/it]
episodes                                        124
episode_length                            15.983871
returns                                   76.281776
return_std                                 6.039533
average_reward                             4.772355
round_time                   0 days 00:35:26.366387
episodes_test                                 125.0
episode_length_test                            16.0
returns_test                              76.404142
return_std_test                            6.105103
average_reward_test                        4.775259
round_time_test              0 days 00:00:04.350694
round_time_total             0 days 00:35:26.369276
loss_total              9623711631820449409662976.0
loss_critic            12029639345400202263527424.0
loss_actor                    -7763475761463.295898
memory_size                                   527.0 

=== epoch 8/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<36:11,  1.09s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [44:01<00:00,  1.32s/it]
episodes                                        125
episode_length                               15.856
returns                                   75.618434
return_std                                 2.080038
average_reward                             4.769075
round_time                   0 days 00:44:03.351707
episodes_test                                 125.0
episode_length_test                          15.904
returns_test                              75.893828
return_std_test                            2.517783
average_reward_test                        4.771944
round_time_test              0 days 00:00:04.060951
round_time_total             0 days 00:44:03.353182
loss_total              9749107480359083188420608.0
loss_critic            12186384148399360470679552.0
loss_actor                    -7831630969831.423828
memory_size                                   527.0 

=== epoch 8/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:13,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:48<00:00,  1.34it/s]
episodes                                        124
episode_length                            16.024194
returns                                   76.516936
return_std                                 8.703886
average_reward                             4.775103
round_time                   0 days 00:24:49.937528
episodes_test                                 125.0
episode_length_test                          15.952
returns_test                              76.121877
return_std_test                              3.0154
average_reward_test                        4.772053
round_time_test              0 days 00:00:05.227201
round_time_total             0 days 00:24:49.939704
loss_total              9937341653137618148261888.0
loss_critic            12421676852771257715785728.0
loss_actor                    -7911386473824.255859
memory_size                                   527.0 

=== epoch 8/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:43,  1.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:51<00:00,  1.46it/s]
episodes                                        125
episode_length                               15.912
returns                                   75.872315
return_std                                 2.133909
average_reward                             4.768302
round_time                   0 days 00:22:52.128764
episodes_test                                 125.0
episode_length_test                          15.984
returns_test                               76.40048
return_std_test                            6.935607
average_reward_test                        4.779941
round_time_test              0 days 00:00:04.224892
round_time_total             0 days 00:22:52.130776
loss_total             10171816272224076487983104.0
loss_critic            12714770128755030214311936.0
loss_actor                    -7995429580701.696289
memory_size                                   527.0 

=== epoch 8/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:44,  1.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:03<00:00,  1.51it/s]
episodes                                        125
episode_length                               15.896
returns                                   75.804781
return_std                                 1.582577
average_reward                             4.768795
round_time                   0 days 00:22:04.198522
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.391832
return_std_test                            2.473057
average_reward_test                        4.772082
round_time_test              0 days 00:00:03.836242
round_time_total             0 days 00:22:04.200335
loss_total             10321626752596844589613056.0
loss_critic            12902033229473189747228672.0
loss_actor                    -8061007133671.423828
memory_size                                   527.0 

=== epoch 8/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:00,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:19<00:00,  1.49it/s]
episodes                                        124
episode_length                            16.008065
returns                                   76.410927
return_std                                 2.857257
average_reward                             4.773233
round_time                   0 days 00:22:20.272888
episodes_test                                 125.0
episode_length_test                          15.936
returns_test                              75.999752
return_std_test                            1.466735
average_reward_test                        4.769201
round_time_test              0 days 00:00:04.454928
round_time_total             0 days 00:22:20.274293
loss_total             10377759805269788080472064.0
loss_critic            12972199542828381910335488.0
loss_actor                    -8120946692980.736328
memory_size                                   527.0 

=== epoch 8/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:14,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:22<00:00,  1.49it/s]
episodes                                        124
episode_length                            15.919355
returns                                   75.935784
return_std                                 1.497181
average_reward                             4.769925
round_time                   0 days 00:22:23.753046
episodes_test                                 125.0
episode_length_test                          15.952
returns_test                               76.08217
return_std_test                            2.015107
average_reward_test                        4.769574
round_time_test              0 days 00:00:04.620727
round_time_total             0 days 00:22:23.754459
loss_total             10510881200090346399203328.0
loss_critic            13138601273419743384043520.0
loss_actor                    -8194263290478.591797
memory_size                                   527.0 

=== epoch 8/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:07,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:17<00:00,  1.50it/s]
episodes                                        124
episode_length                            15.991935
returns                                   76.257766
return_std                                 5.133257
average_reward                             4.768444
round_time                   0 days 00:22:18.322222
episodes_test                                 125.0
episode_length_test                          15.968
returns_test                              76.199202
return_std_test                            4.229999
average_reward_test                        4.772142
round_time_test              0 days 00:00:04.428469
round_time_total             0 days 00:22:18.324114
loss_total             10576322601456103719960576.0
loss_critic            13220403015579308803489792.0
loss_actor                    -8224299306975.232422
memory_size                                   527.0 

=== epoch 8/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:33,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:45<00:00,  1.46it/s]
episodes                                        123
episode_length                            16.138211
returns                                   76.997881
return_std                                 2.718159
average_reward                              4.77115
round_time                   0 days 00:22:46.520736
episodes_test                                 125.0
episode_length_test                            16.0
returns_test                              76.303727
return_std_test                            1.364632
average_reward_test                        4.768983
round_time_test              0 days 00:00:04.441917
round_time_total             0 days 00:22:46.522513
loss_total             10837011102829706018291712.0
loss_critic            13546263637648596032028672.0
loss_actor                    -8309391331950.591797
memory_size                                   527.0 

=== epoch 8/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:40,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:00<00:00,  1.45it/s]
episodes                                        123
episode_length                            16.195122
returns                                   77.365525
return_std                                  8.49055
average_reward                             4.777143
round_time                   0 days 00:23:01.210798
episodes_test                                 122.0
episode_length_test                       16.360656
returns_test                              78.041791
return_std_test                            8.363922
average_reward_test                        4.770175
round_time_test              0 days 00:00:03.802427
round_time_total             0 days 00:23:01.212637
loss_total             11045223155356763045232640.0
loss_critic            13806528707702929796104192.0
loss_actor                    -8386619469463.551758
memory_size                                   527.0 

=== epoch 8/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:04,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:19<00:00,  1.49it/s]
episodes                                        124
episode_length                            16.016129
returns                                   76.375747
return_std                                 2.629117
average_reward                             4.768678
round_time                   0 days 00:22:20.098923
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.384526
return_std_test                            3.018268
average_reward_test                         4.77164
round_time_test              0 days 00:00:03.817171
round_time_total             0 days 00:22:20.100715
loss_total             11013198908974370767503360.0
loss_critic            13766498392230950051250176.0
loss_actor                    -8449861535399.935547
memory_size                                   527.0 

=== epoch 8/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:41,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:46<00:00,  1.53it/s]
episodes                                        123
episode_length                            16.065041
returns                                    76.62483
return_std                                  2.82152
average_reward                             4.769579
round_time                   0 days 00:21:47.430551
episodes_test                                 124.0
episode_length_test                       16.016129
returns_test                              76.425024
return_std_test                            2.941696
average_reward_test                        4.771746
round_time_test              0 days 00:00:03.943069
round_time_total             0 days 00:21:47.432438
loss_total             11035912016882139112931328.0
loss_critic            13794889779781792033669120.0
loss_actor                    -8454350259879.935547
memory_size                                   527.0 

=== epoch 8/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:58,  1.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:59<00:00,  1.52it/s]
episodes                                        124
episode_length                            16.032258
returns                                   76.520765
return_std                                 6.789174
average_reward                             4.772948
round_time                   0 days 00:22:00.030753
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.710104
return_std_test                            1.988436
average_reward_test                        4.769737
round_time_test              0 days 00:00:04.130479
round_time_total             0 days 00:22:00.032136
loss_total             11238942569230937926664192.0
loss_critic            14048677970289847754031104.0
loss_actor                    -8509678085472.255859
memory_size                                   527.0 

=== epoch 8/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:22,  1.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:20<00:00,  1.49it/s]
episodes                                        124
episode_length                            15.951613
returns                                   76.022915
return_std                                 5.701648
average_reward                             4.765932
round_time                   0 days 00:22:20.902290
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                              76.346652
return_std_test                            8.303219
average_reward_test                        4.774094
round_time_test              0 days 00:00:05.896009
round_time_total             0 days 00:22:20.903800
loss_total             11215949297650556223881216.0
loss_critic            14019936383984903135428608.0
loss_actor                    -8556676952358.912109
memory_size                                   527.0 

=== epoch 8/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:23<00:00,  1.49it/s]
episodes                                        125
episode_length                               15.904
returns                                   75.888432
return_std                                 2.005949
average_reward                             4.771675
round_time                   0 days 00:22:24.422162
episodes_test                                 124.0
episode_length_test                       16.129032
returns_test                              77.027175
return_std_test                            9.486755
average_reward_test                        4.775685
round_time_test              0 days 00:00:04.028041
round_time_total             0 days 00:22:24.423601
loss_total             11198555215586261370667008.0
loss_critic            13998193782917743889088512.0
loss_actor                    -8616096657833.984375
memory_size                                   527.0 

=== epoch 8/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:35,  1.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:14<00:00,  1.43it/s]
episodes                                        122
episode_length                            16.229508
returns                                   77.471473
return_std                                 5.129232
average_reward                             4.773399
round_time                   0 days 00:23:14.933277
episodes_test                                 125.0
episode_length_test                          15.976
returns_test                              76.210299
return_std_test                            2.576008
average_reward_test                         4.77044
round_time_test              0 days 00:00:04.162232
round_time_total             0 days 00:23:14.934678
loss_total             11176826633468136394850304.0
loss_critic            13971033039345361474289664.0
loss_actor                    -8640049873944.576172
memory_size                                   527.0 

=== epoch 8/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:49,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:03<00:00,  1.51it/s]
episodes                                        124
episode_length                            16.040323
returns                                   76.596519
return_std                                 6.392818
average_reward                             4.775261
round_time                   0 days 00:22:04.020872
episodes_test                                 125.0
episode_length_test                          15.968
returns_test                              76.202039
return_std_test                             2.82091
average_reward_test                        4.772329
round_time_test              0 days 00:00:04.048236
round_time_total             0 days 00:22:04.022587
loss_total             11359340126762345825304576.0
loss_critic            14199174915546781216407552.0
loss_actor                    -8699874044018.688477
memory_size                                   527.0 

=== epoch 8/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<19:22,  1.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:48<00:00,  1.53it/s]
episodes                                        124
episode_length                            16.008065
returns                                    76.40575
return_std                                 2.733829
average_reward                             4.772901
round_time                   0 days 00:21:49.116787
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.343225
return_std_test                            2.913483
average_reward_test                        4.768902
round_time_test              0 days 00:00:04.024658
round_time_total             0 days 00:21:49.118677
loss_total             11406873866082711536402432.0
loss_critic            14258592102451433651044352.0
loss_actor                    -8762058409771.007812
memory_size                                   527.0 

=== epoch 8/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:27,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:21<00:00,  1.49it/s]
episodes                                        122
episode_length                            16.254098
returns                                    77.50484
return_std                                 7.732189
average_reward                             4.768288
round_time                   0 days 00:22:22.501927
episodes_test                                 123.0
episode_length_test                       16.154472
returns_test                              77.077083
return_std_test                            3.698401
average_reward_test                        4.771261
round_time_test              0 days 00:00:04.148754
round_time_total             0 days 00:22:22.503889
loss_total             11701157358564152057004032.0
loss_critic            14626446457821056150274048.0
loss_actor                    -8846728900378.623047
memory_size                                   527.0 

=== epoch 8/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:27,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:35<00:00,  1.48it/s]
episodes                                        120
episode_length                            16.533333
returns                                    79.02571
return_std                                12.552988
average_reward                             4.779991
round_time                   0 days 00:22:35.920205
episodes_test                                 124.0
episode_length_test                       16.120968
returns_test                              76.926187
return_std_test                             2.96794
average_reward_test                        4.771873
round_time_test              0 days 00:00:04.423912
round_time_total             0 days 00:22:35.922095
loss_total             11784599465046565288148992.0
loss_critic            14730749072621444293197824.0
loss_actor                    -8880728241340.416016
memory_size                                531.0915 

=== epoch 8/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:19,  1.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:45<00:00,  1.53it/s]
episodes                                        117
episode_length                            16.854701
returns                                   80.573153
return_std                                11.228994
average_reward                              4.78055
round_time                   0 days 00:21:47.029711
episodes_test                                 119.0
episode_length_test                       16.714286
returns_test                              79.818654
return_std_test                            9.592116
average_reward_test                        4.775459
round_time_test              0 days 00:00:04.138705
round_time_total             0 days 00:21:47.031061
loss_total             12159219837976071030964224.0
loss_critic            15199024531937855660883968.0
loss_actor                    -8916188421160.960938
memory_size                                   534.0 

=== epoch 8/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:52,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                        124
episode_length                            15.991935
returns                                   76.288486
return_std                                  2.31225
average_reward                             4.773675
round_time                   0 days 00:16:36.371973
episodes_test                                 122.0
episode_length_test                       16.311475
returns_test                              77.905988
return_std_test                            5.026689
average_reward_test                        4.776192
round_time_test              0 days 00:00:03.751048
round_time_total             0 days 00:16:36.373459
loss_total             12243812032370265240371200.0
loss_critic            15304764766932203770740736.0
loss_actor                    -8987310132559.871094
memory_size                                   534.0 

=== epoch 8/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:38<00:00,  2.28it/s]
episodes                                        122
episode_length                            16.286885
returns                                   77.815634
return_std                                 7.204331
average_reward                             4.777811
round_time                   0 days 00:14:39.221384
episodes_test                                 125.0
episode_length_test                          15.944
returns_test                              76.066204
return_std_test                            1.961388
average_reward_test                         4.77094
round_time_test              0 days 00:00:03.554146
round_time_total             0 days 00:14:39.222537
loss_total             12474821607411813164515328.0
loss_critic            15593526737895866046087168.0
loss_actor                    -9063596562841.599609
memory_size                                   534.0 

=== epoch 8/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:53,  2.09it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:21<00:00,  2.49it/s]
episodes                                        123
episode_length                            16.056911
returns                                   76.623722
return_std                                 4.473389
average_reward                             4.772026
round_time                   0 days 00:13:22.241653
episodes_test                                 122.0
episode_length_test                       16.278689
returns_test                              77.760372
return_std_test                            7.224181
average_reward_test                         4.77674
round_time_test              0 days 00:00:03.268037
round_time_total             0 days 00:13:22.242792
loss_total             12553122134199527115063296.0
loss_critic            15691402405315652683300864.0
loss_actor                    -9137275055112.191406
memory_size                                   534.0 

=== epoch 8/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:07,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:59<00:00,  2.56it/s]
episodes                                        124
episode_length                            16.072581
returns                                   76.720257
return_std                                 4.068248
average_reward                             4.773454
round_time                   0 days 00:13:00.457314
episodes_test                                 122.0
episode_length_test                       16.278689
returns_test                              77.682964
return_std_test                            5.900827
average_reward_test                        4.772018
round_time_test              0 days 00:00:03.282630
round_time_total             0 days 00:13:00.458408
loss_total             12494846059424511162318848.0
loss_critic            15618557303632317554622464.0
loss_actor                    -9183139835936.767578
memory_size                                   534.0 

=== epoch 8/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:44<00:00,  2.62it/s]
episodes                                        124
episode_length                            15.919355
returns                                   75.910359
return_std                                 1.299475
average_reward                             4.768137
round_time                   0 days 00:12:44.558785
episodes_test                                 126.0
episode_length_test                       15.857143
returns_test                              75.617429
return_std_test                            1.776948
average_reward_test                        4.768774
round_time_test              0 days 00:00:03.090702
round_time_total             0 days 00:12:44.559868
loss_total             12640024208269351648755712.0
loss_critic            15800029976573885777182720.0
loss_actor                    -9214700784451.583984
memory_size                                   534.0 

=== epoch 8/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:37,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                        124
episode_length                            16.040323
returns                                   76.514643
return_std                                 2.630448
average_reward                             4.770128
round_time                   0 days 00:12:35.386530
episodes_test                                 125.0
episode_length_test                          15.976
returns_test                              76.189466
return_std_test                            3.145842
average_reward_test                        4.769157
round_time_test              0 days 00:00:03.155611
round_time_total             0 days 00:12:35.387603
loss_total             12962019587495699878510592.0
loss_critic            16202524201543567927148544.0
loss_actor                    -9276094172889.087891
memory_size                                   534.0 

=== epoch 8/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                        122
episode_length                            16.286885
returns                                   77.807389
return_std                                 7.664172
average_reward                             4.777842
round_time                   0 days 00:12:37.523236
episodes_test                                 120.0
episode_length_test                       16.616667
returns_test                              79.170022
return_std_test                            8.105733
average_reward_test                        4.764638
round_time_test              0 days 00:00:03.202614
round_time_total             0 days 00:12:37.524312
loss_total             13194608162036553571368960.0
loss_critic            16493259913594740022444032.0
loss_actor                    -9372034096955.392578
memory_size                                   534.0 

=== epoch 8/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:47,  2.25it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                        123
episode_length                            16.162602
returns                                   77.165607
return_std                                 5.142364
average_reward                             4.774281
round_time                   0 days 00:12:39.528707
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                              76.331435
return_std_test                             2.31542
average_reward_test                        4.773183
round_time_test              0 days 00:00:03.157178
round_time_total             0 days 00:12:39.529787
loss_total             13377841121872424627863552.0
loss_critic            16722301124774678688169984.0
loss_actor                    -9408483401400.320312
memory_size                                   534.0 

=== epoch 8/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:59,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        123
episode_length                            16.089431
returns                                   76.852357
return_std                                 4.863464
average_reward                             4.776498
round_time                   0 days 00:12:36.350197
episodes_test                                 124.0
episode_length_test                       16.112903
returns_test                               76.88627
return_std_test                            3.904509
average_reward_test                          4.7718
round_time_test              0 days 00:00:03.136022
round_time_total             0 days 00:12:36.351259
loss_total             12881629756821693488168960.0
loss_critic            16102036932296321865351168.0
loss_actor                    -9461218949988.351562
memory_size                                   534.0 

=== epoch 8/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        124
episode_length                            15.967742
returns                                   76.183667
return_std                                 3.898726
average_reward                             4.771057
round_time                   0 days 00:12:37.220631
episodes_test                                 124.0
episode_length_test                       16.120968
returns_test                              76.939346
return_std_test                            6.312204
average_reward_test                          4.7727
round_time_test              0 days 00:00:03.179272
round_time_total             0 days 00:12:37.221711
loss_total             13001369889976366189772800.0
loss_critic            16251712082022300869722112.0
loss_actor                    -9485254201704.447266
memory_size                                   534.0 

=== epoch 8/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        122
episode_length                            16.270492
returns                                   77.626782
return_std                                 5.648912
average_reward                             4.771007
round_time                   0 days 00:12:37.359311
episodes_test                                 123.0
episode_length_test                       16.219512
returns_test                              77.347761
return_std_test                            6.606027
average_reward_test                        4.768937
round_time_test              0 days 00:00:03.225051
round_time_total             0 days 00:12:37.360384
loss_total             13408383883432830642946048.0
loss_critic            16760479555179965950459904.0
loss_actor                    -9603791330213.888672
memory_size                                   534.0 

=== epoch 8/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        121
episode_length                            16.355372
returns                                   78.019397
return_std                                 6.967358
average_reward                             4.770119
round_time                   0 days 00:12:36.079611
episodes_test                                 123.0
episode_length_test                       16.203252
returns_test                              77.273367
return_std_test                            4.507021
average_reward_test                        4.769027
round_time_test              0 days 00:00:03.147489
round_time_total             0 days 00:12:36.080671
loss_total             13294622092442554419118080.0
loss_critic            16618277352110628989304832.0
loss_actor                    -9623986084511.744141
memory_size                                   534.0 

=== epoch 8/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                        124
episode_length                            15.991935
returns                                   76.340207
return_std                                 3.729865
average_reward                             4.773748
round_time                   0 days 00:12:39.923656
episodes_test                                 124.0
episode_length_test                       16.080645
returns_test                              76.760309
return_std_test                             4.86172
average_reward_test                        4.773576
round_time_test              0 days 00:00:03.168443
round_time_total             0 days 00:12:39.924729
loss_total             13509882145512242024546304.0
loss_critic            16887352380185155293151232.0
loss_actor                    -9708449947189.248047
memory_size                                   534.0 

=== epoch 8/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                        123
episode_length                            16.162602
returns                                   77.172409
return_std                                 5.230004
average_reward                             4.774745
round_time                   0 days 00:12:30.857648
episodes_test                                 123.0
episode_length_test                       16.235772
returns_test                              77.534334
return_std_test                            6.018248
average_reward_test                        4.775618
round_time_test              0 days 00:00:03.167185
round_time_total             0 days 00:12:30.858715
loss_total             13847221523288566450356224.0
loss_critic            17309026602982023282819072.0
loss_actor                    -9805317640552.447266
memory_size                                   534.0 

=== epoch 8/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:09,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        125
episode_length                               15.928
returns                                   75.955726
return_std                                 2.952864
average_reward                             4.768743
round_time                   0 days 00:12:36.257042
episodes_test                                 124.0
episode_length_test                        16.08871
returns_test                              76.729035
return_std_test                            5.062941
average_reward_test                        4.769205
round_time_test              0 days 00:00:03.246017
round_time_total             0 days 00:12:36.258099
loss_total             14102137099819285160656896.0
loss_critic            17627671062260321825587200.0
loss_actor                    -9878101782167.552734
memory_size                                   534.0 

=== epoch 8/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                        124
episode_length                            15.959677
returns                                   76.136838
return_std                                 3.378872
average_reward                             4.770469
round_time                   0 days 00:12:32.412211
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                               76.27864
return_std_test                            3.028879
average_reward_test                        4.769876
round_time_test              0 days 00:00:03.134327
round_time_total             0 days 00:12:32.413271
loss_total             14133581032097474452389888.0
loss_critic            17666975986615258231865344.0
loss_actor                    -9902111099715.583984
memory_size                                   534.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:29,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                        126
episode_length                             15.84127
returns                                    75.54806
return_std                                 1.909818
average_reward                             4.769194
round_time                   0 days 00:12:37.560083
episodes_test                                 126.0
episode_length_test                       15.865079
returns_test                              75.687361
return_std_test                            2.225459
average_reward_test                        4.770769
round_time_test              0 days 00:00:03.185708
round_time_total             0 days 00:12:37.561183
loss_total             14405581928801953229307904.0
loss_critic            18006977102163593064873984.0
loss_actor                    -9961669751209.984375
memory_size                                   534.0 

=== epoch 9/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:32,  2.14it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:41<00:00,  2.63it/s]
episodes                                        124
episode_length                            15.975806
returns                                   76.219793
return_std                                 2.700083
average_reward                             4.770967
round_time                   0 days 00:12:41.986735
episodes_test                                 123.0
episode_length_test                       16.219512
returns_test                              77.440891
return_std_test                            5.898447
average_reward_test                        4.774648
round_time_test              0 days 00:00:03.191882
round_time_total             0 days 00:12:41.987817
loss_total             14539232115826400014893056.0
loss_critic            18174039831476580252647424.0
loss_actor                   -10028945635278.847656
memory_size                                   534.0 

=== epoch 9/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        123
episode_length                            16.073171
returns                                   76.638144
return_std                                 5.633178
average_reward                             4.767955
round_time                   0 days 00:12:36.315049
episodes_test                                 123.0
episode_length_test                       16.154472
returns_test                              77.082953
return_std_test                            4.223746
average_reward_test                        4.771596
round_time_test              0 days 00:00:03.211835
round_time_total             0 days 00:12:36.316173
loss_total             14767685589240568321409024.0
loss_critic            18459606674469270069444608.0
loss_actor                   -10078994859294.720703
memory_size                                   534.0 

=== epoch 9/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:04,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        125
episode_length                               15.944
returns                                   76.072328
return_std                                 1.140674
average_reward                             4.771268
round_time                   0 days 00:12:35.980144
episodes_test                                 125.0
episode_length_test                           15.92
returns_test                              75.919623
return_std_test                            1.594932
average_reward_test                        4.768874
round_time_test              0 days 00:00:03.193066
round_time_total             0 days 00:12:35.981233
loss_total             15032262963880382764679168.0
loss_critic            18790328385130935930060800.0
loss_actor                   -10164473756123.136719
memory_size                                   534.0 

=== epoch 9/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                        124
episode_length                            15.967742
returns                                   76.156488
return_std                                  2.81391
average_reward                             4.769401
round_time                   0 days 00:12:38.203636
episodes_test                                 125.0
episode_length_test                          15.936
returns_test                              75.981508
return_std_test                            1.445275
average_reward_test                        4.767967
round_time_test              0 days 00:00:03.177185
round_time_total             0 days 00:12:38.204727
loss_total             15293892100330756067295232.0
loss_critic            19117364788832423058079744.0
loss_actor                   -10259298880323.583984
memory_size                                   534.0 

=== epoch 9/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                        125
episode_length                               15.872
returns                                   75.751654
return_std                                 6.649549
average_reward                             4.772645
round_time                   0 days 00:12:37.542719
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.400219
return_std_test                            5.839614
average_reward_test                        4.772561
round_time_test              0 days 00:00:03.213767
round_time_total             0 days 00:12:37.543802
loss_total             15497693000241287952072704.0
loss_critic            19372115904857505213186048.0
loss_actor                   -10305075854442.496094
memory_size                                   534.0 

=== epoch 9/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        125
episode_length                               15.848
returns                                   75.640649
return_std                                 3.329815
average_reward                             4.772747
round_time                   0 days 00:12:36.020930
episodes_test                                 126.0
episode_length_test                        15.81746
returns_test                              75.446342
return_std_test                             3.61537
average_reward_test                        4.769879
round_time_test              0 days 00:00:03.132237
round_time_total             0 days 00:12:36.022052
loss_total             15691165390018578908971008.0
loss_critic            19613956404977426463981568.0
loss_actor                   -10387325326983.167969
memory_size                                   534.0 

=== epoch 9/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        125
episode_length                                15.92
returns                                    75.95418
return_std                                 4.695364
average_reward                             4.771026
round_time                   0 days 00:12:37.066669
episodes_test                                 125.0
episode_length_test                          15.904
returns_test                              75.868772
return_std_test                             1.52663
average_reward_test                        4.770531
round_time_test              0 days 00:00:03.202612
round_time_total             0 days 00:12:37.067743
loss_total             15872934901624712028225536.0
loss_critic            19841168282235301210882048.0
loss_actor                   -10451550837669.888672
memory_size                                   534.0 

=== epoch 9/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                        124
episode_length                            15.975806
returns                                   76.218016
return_std                                 3.196357
average_reward                             4.770764
round_time                   0 days 00:12:39.645983
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.426427
return_std_test                            4.896052
average_reward_test                        4.774277
round_time_test              0 days 00:00:03.182999
round_time_total             0 days 00:12:39.647066
loss_total             15744940835092019562938368.0
loss_critic            19681175715786800186261504.0
loss_actor                   -10533142759211.007812
memory_size                                   534.0 

=== epoch 9/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:01,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        124
episode_length                            16.064516
returns                                   76.751442
return_std                                 8.062049
average_reward                             4.777537
round_time                   0 days 00:12:36.657996
episodes_test                                 122.0
episode_length_test                       16.286885
returns_test                              77.734879
return_std_test                            7.316772
average_reward_test                        4.772835
round_time_test              0 days 00:00:03.183771
round_time_total             0 days 00:12:36.659078
loss_total             15370438031909407594381312.0
loss_critic            19213047213898202171310080.0
loss_actor                   -10524418751594.496094
memory_size                                   534.0 

=== epoch 9/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                        121
episode_length                            16.454545
returns                                   78.575506
return_std                                 7.347655
average_reward                             4.775402
round_time                   0 days 00:12:34.196550
episodes_test                                 123.0
episode_length_test                       16.195122
returns_test                              77.353567
return_std_test                              6.4964
average_reward_test                        4.776484
round_time_test              0 days 00:00:03.128861
round_time_total             0 days 00:12:34.197641
loss_total             16050433063299510748315648.0
loss_critic            20063040982022958371307520.0
loss_actor                   -10617983160287.232422
memory_size                                   534.0 

=== epoch 9/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                        122
episode_length                            16.237705
returns                                   77.479067
return_std                                 6.554449
average_reward                             4.771534
round_time                   0 days 00:12:32.394588
episodes_test                                 123.0
episode_length_test                       16.162602
returns_test                              77.118727
return_std_test                            3.714586
average_reward_test                         4.77136
round_time_test              0 days 00:00:03.185707
round_time_total             0 days 00:12:32.395674
loss_total             16644443638915758242660352.0
loss_critic            20805554208388738550071296.0
loss_actor                    -10702558219206.65625
memory_size                                   534.0 

=== epoch 9/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:53,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.66it/s]
episodes                                        125
episode_length                               15.928
returns                                   75.994166
return_std                                 4.216451
average_reward                              4.77113
round_time                   0 days 00:12:30.974975
episodes_test                                 125.0
episode_length_test                          15.944
returns_test                              76.051439
return_std_test                            2.212991
average_reward_test                        4.769925
round_time_test              0 days 00:00:03.180843
round_time_total             0 days 00:12:30.976050
loss_total             16392356350332913594662912.0
loss_critic            20490445096723433729818624.0
loss_actor                   -10747119755853.824219
memory_size                                   534.0 

=== epoch 9/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                        123
episode_length                            16.154472
returns                                   77.134689
return_std                                10.726294
average_reward                             4.774719
round_time                   0 days 00:12:28.561688
episodes_test                                 125.0
episode_length_test                           15.96
returns_test                               76.12098
return_std_test                            1.912234
average_reward_test                        4.769599
round_time_test              0 days 00:00:03.137934
round_time_total             0 days 00:12:28.562769
loss_total             16913850735477267896467456.0
loss_critic            21142313070587828932444160.0
loss_actor                    -10844398492844.03125
memory_size                                534.9735 

=== epoch 9/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.66it/s]
episodes                                        125
episode_length                               15.864
returns                                   75.645071
return_std                                 2.120421
average_reward                             4.768321
round_time                   0 days 00:12:31.124140
episodes_test                                 124.0
episode_length_test                        16.08871
returns_test                              76.749868
return_std_test                            4.970572
average_reward_test                        4.770551
round_time_test              0 days 00:00:03.139156
round_time_total             0 days 00:12:31.125211
loss_total             17345521870312369284972544.0
loss_critic            21681901967010024564719616.0
loss_actor                   -10863324767453.183594
memory_size                                   540.0 

=== epoch 9/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        125
episode_length                                15.84
returns                                   75.605815
return_std                                 3.946553
average_reward                              4.77295
round_time                   0 days 00:12:36.910473
episodes_test                                 125.0
episode_length_test                          15.912
returns_test                              75.940711
return_std_test                            1.981632
average_reward_test                        4.772566
round_time_test              0 days 00:00:03.166112
round_time_total             0 days 00:12:36.911552
loss_total             17210024130612012733431808.0
loss_critic            21512529810543095549263872.0
loss_actor                   -10926810308018.175781
memory_size                                   540.0 

=== epoch 9/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.66it/s]
episodes                                        124
episode_length                            16.008065
returns                                   76.395172
return_std                                 4.299951
average_reward                             4.772172
round_time                   0 days 00:12:33.509201
episodes_test                                 125.0
episode_length_test                          15.912
returns_test                              75.921258
return_std_test                            1.778779
average_reward_test                        4.771347
round_time_test              0 days 00:00:03.148383
round_time_total             0 days 00:12:33.510277
loss_total             17777265060435665214242816.0
loss_critic            22221580925624932137697280.0
loss_actor                     -10981369812877.3125
memory_size                                   540.0 

=== epoch 9/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:13,  2.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                        124
episode_length                            16.048387
returns                                   76.597939
return_std                                 4.953801
average_reward                             4.772783
round_time                   0 days 00:12:31.879851
episodes_test                                 124.0
episode_length_test                       16.056452
returns_test                              76.592934
return_std_test                            2.992833
average_reward_test                        4.770358
round_time_test              0 days 00:00:03.176151
round_time_total             0 days 00:12:31.880915
loss_total             18213194401320488281309184.0
loss_critic            22766492612755774263787520.0
loss_actor                   -11059556956241.919922
memory_size                                   540.0 

=== epoch 9/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                        125
episode_length                                15.96
returns                                   76.202923
return_std                                  4.54928
average_reward                             4.774674
round_time                   0 days 00:12:34.139224
episodes_test                                 124.0
episode_length_test                       16.072581
returns_test                              76.719372
return_std_test                            5.562342
average_reward_test                        4.773362
round_time_test              0 days 00:00:03.156376
round_time_total             0 days 00:12:34.140316
loss_total             17989181998259429032591360.0
loss_critic            22486477126295328514899968.0
loss_actor                   -11075488579059.712891
memory_size                                   540.0 

=== epoch 9/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                        124
episode_length                            15.983871
returns                                   76.344108
return_std                                 5.964171
average_reward                             4.776271
round_time                   0 days 00:12:35.076865
episodes_test                                 127.0
episode_length_test                       15.653543
returns_test                              74.684303
return_std_test                            2.375941
average_reward_test                        4.771166
round_time_test              0 days 00:00:03.238903
round_time_total             0 days 00:12:35.077939
loss_total             18573757257972786401378304.0
loss_critic            23217196169736088201461760.0
loss_actor                   -11174457538772.992188
memory_size                                   540.0 

=== epoch 9/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:38<00:00,  2.64it/s]
episodes                                        125
episode_length                               15.816
returns                                   75.494867
return_std                                 4.417294
average_reward                             4.773144
round_time                   0 days 00:12:38.716989
episodes_test                                 125.0
episode_length_test                          15.976
returns_test                              76.304768
return_std_test                            5.000573
average_reward_test                        4.776412
round_time_test              0 days 00:00:03.198104
round_time_total             0 days 00:12:38.718091
loss_total             18520958608926760415264768.0
loss_critic            23151197876587072929660928.0
loss_actor                   -11249876875608.064453
memory_size                                   540.0 

=== epoch 9/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                        125
episode_length                                15.88
returns                                   75.752506
return_std                                 2.842547
average_reward                             4.770294
round_time                   0 days 00:12:39.996390
episodes_test                                 125.0
episode_length_test                          15.976
returns_test                              76.277158
return_std_test                            4.771305
average_reward_test                         4.77473
round_time_test              0 days 00:00:03.132423
round_time_total             0 days 00:12:39.997461
loss_total             19238419295736903558496256.0
loss_critic            24048023705051734380380160.0
loss_actor                   -11367811483959.296875
memory_size                                   540.0 

=== epoch 9/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.66it/s]
episodes                                        123
episode_length                            16.138211
returns                                   76.982611
return_std                                 8.468846
average_reward                             4.770224
round_time                   0 days 00:12:33.636526
episodes_test                                 123.0
episode_length_test                       16.252033
returns_test                              77.813719
return_std_test                           14.421259
average_reward_test                        4.788004
round_time_test              0 days 00:00:03.172605
round_time_total             0 days 00:12:33.637595
loss_total             19397479690849520837459968.0
loss_critic            24246849190367651554131968.0
loss_actor                   -11414338438955.007812
memory_size                                   540.0 

=== epoch 9/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                        124
episode_length                            16.024194
returns                                   76.437529
return_std                                 4.625157
average_reward                             4.770111
round_time                   0 days 00:12:27.924977
episodes_test                                 125.0
episode_length_test                          15.936
returns_test                              76.045731
return_std_test                            2.310401
average_reward_test                        4.772008
round_time_test              0 days 00:00:03.177080
round_time_total             0 days 00:12:27.926067
loss_total             19389391324493549974585344.0
loss_critic            24236738739772563503710208.0
loss_actor                   -11504487538622.464844
memory_size                                   540.0 

=== epoch 9/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:38<00:00,  2.64it/s]
episodes                                        125
episode_length                               15.856
returns                                   75.650531
return_std                                 6.129506
average_reward                             4.771097
round_time                   0 days 00:12:38.799835
episodes_test                                 125.0
episode_length_test                          15.888
returns_test                              75.777195
return_std_test                            1.822702
average_reward_test                        4.769502
round_time_test              0 days 00:00:03.188154
round_time_total             0 days 00:12:38.800939
loss_total             19576814857121381287460864.0
loss_critic            24471018166221872404889600.0
loss_actor                   -11559914707091.455078
memory_size                                   540.0 

=== epoch 9/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:37<00:00,  2.64it/s]
episodes                                        124
episode_length                            15.983871
returns                                   76.302754
return_std                                 3.773154
average_reward                              4.77357
round_time                   0 days 00:12:37.578345
episodes_test                                 125.0
episode_length_test                          15.888
returns_test                              75.778028
return_std_test                             1.50001
average_reward_test                        4.769555
round_time_test              0 days 00:00:03.218001
round_time_total             0 days 00:12:37.579412
loss_total             19166466561927433330622464.0
loss_critic            23958082774891586356510720.0
loss_actor                   -11584545952890.880859
memory_size                                   540.0 

=== epoch 9/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                        127
episode_length                            15.669291
returns                                    74.73313
return_std                                  2.32387
average_reward                             4.769354
round_time                   0 days 00:12:30.429025
episodes_test                                 125.0
episode_length_test                          15.944
returns_test                               76.13634
return_std_test                           11.058135
average_reward_test                        4.775328
round_time_test              0 days 00:00:03.203648
round_time_total             0 days 00:12:30.430086
loss_total             19427001951722344567275520.0
loss_critic            24283752020638022624083968.0
loss_actor                   -11652284142321.664062
memory_size                                   540.0 

=== epoch 9/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:39<00:00,  2.63it/s]
episodes                                        125
episode_length                               15.816
returns                                   75.455985
return_std                                 2.057429
average_reward                             4.770727
round_time                   0 days 00:12:40.086563
episodes_test                                 128.0
episode_length_test                       15.578125
returns_test                              74.358892
return_std_test                            3.832893
average_reward_test                        4.773309
round_time_test              0 days 00:00:03.188517
round_time_total             0 days 00:12:40.087635
loss_total             20070257310546414203305984.0
loss_critic            25087821180545237228978176.0
loss_actor                   -11705245022289.919922
memory_size                                   540.0 

=== epoch 9/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        127
episode_length                            15.716535
returns                                    74.95304
return_std                                 2.140493
average_reward                             4.769187
round_time                   0 days 00:12:36.600786
episodes_test                                 126.0
episode_length_test                        15.81746
returns_test                              75.518484
return_std_test                            3.649062
average_reward_test                        4.774381
round_time_test              0 days 00:00:03.155791
round_time_total             0 days 00:12:36.601859
loss_total             20439761627733951881674752.0
loss_critic            25549701603042448730226688.0
loss_actor                   -11779295736233.984375
memory_size                                   540.0 

=== epoch 9/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:11,  2.19it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        125
episode_length                               15.848
returns                                    75.61303
return_std                                 3.735417
average_reward                             4.771145
round_time                   0 days 00:12:36.652406
episodes_test                                 126.0
episode_length_test                       15.825397
returns_test                              75.648055
return_std_test                            6.011417
average_reward_test                        4.780235
round_time_test              0 days 00:00:03.170378
round_time_total             0 days 00:12:36.653474
loss_total             20295148291675451490828288.0
loss_critic            25368934904866864745152512.0
loss_actor                   -11846260823162.880859
memory_size                                   540.0 

=== epoch 9/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:02,  2.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                        125
episode_length                                15.84
returns                                   75.580675
return_std                                 5.986733
average_reward                             4.771508
round_time                   0 days 00:12:35.048122
episodes_test                                 127.0
episode_length_test                       15.740157
returns_test                              75.070217
return_std_test                             2.09351
average_reward_test                        4.769398
round_time_test              0 days 00:00:03.187727
round_time_total             0 days 00:12:35.049198
loss_total             20750747357431433653125120.0
loss_critic            25938433719912133598117888.0
loss_actor                   -11943260095250.431641
memory_size                                   540.0 

=== epoch 9/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                        126
episode_length                            15.738095
returns                                   75.117475
return_std                                 3.327571
average_reward                             4.772884
round_time                   0 days 00:12:31.840586
episodes_test                                 128.0
episode_length_test                        15.59375
returns_test                              74.378832
return_std_test                            2.396086
average_reward_test                        4.769919
round_time_test              0 days 00:00:03.230104
round_time_total             0 days 00:12:31.841658
loss_total             21155015657371924032389120.0
loss_critic            26443769103989059159588864.0
loss_actor                   -12024683034050.560547
memory_size                                   540.0 

=== epoch 9/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:04,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                        126
episode_length                            15.753968
returns                                   75.121316
return_std                                 2.121695
average_reward                             4.768424
round_time                   0 days 00:12:35.907457
episodes_test                                 127.0
episode_length_test                       15.708661
returns_test                              74.963571
return_std_test                            2.474564
average_reward_test                        4.772196
round_time_test              0 days 00:00:03.172124
round_time_total             0 days 00:12:35.908533
loss_total             21684484494462982566182912.0
loss_critic            27105605152730785601552384.0
loss_actor                   -12114663482327.039062
memory_size                                   540.0 

=== epoch 9/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                        126
episode_length                            15.769841
returns                                   75.212077
return_std                                 2.498418
average_reward                              4.76942
round_time                   0 days 00:12:35.030469
episodes_test                                 126.0
episode_length_test                       15.761905
returns_test                              75.197863
return_std_test                            4.792725
average_reward_test                        4.770785
round_time_test              0 days 00:00:03.176185
round_time_total             0 days 00:12:35.031545
loss_total             21862588442889703345094656.0
loss_critic            27328235090209745556996096.0
loss_actor                   -12207461523521.535156
memory_size                                   540.0 

=== epoch 9/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:36<00:00,  2.64it/s]
episodes                                        126
episode_length                            15.793651
returns                                   75.324752
return_std                                  1.91314
average_reward                             4.769339
round_time                   0 days 00:12:36.805761
episodes_test                                 125.0
episode_length_test                          15.864
returns_test                              75.761954
return_std_test                            4.673798
average_reward_test                        4.777055
round_time_test              0 days 00:00:03.187661
round_time_total             0 days 00:12:36.806832
loss_total             22100377311094031935078400.0
loss_critic            27625471148371498642702336.0
loss_actor                   -12267617501315.072266
memory_size                                   540.0 

=== epoch 9/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:38<00:00,  2.64it/s]
episodes                                        125
episode_length                               15.864
returns                                   75.663862
return_std                                 1.843046
average_reward                             4.769442
round_time                   0 days 00:12:38.620477
episodes_test                                 125.0
episode_length_test                           15.92
returns_test                              75.918301
return_std_test                            1.524599
average_reward_test                        4.768824
round_time_test              0 days 00:00:03.184637
round_time_total             0 days 00:12:38.621551
loss_total             22439970092109624432918528.0
loss_critic            28049962074560961924038656.0
loss_actor                   -12387572562526.207031
memory_size                                   540.0 

=== epoch 9/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
episodes                                        123
episode_length                            16.146341
returns                                   77.146817
return_std                                11.536371
average_reward                             4.777915
round_time                   0 days 00:12:29.830504
episodes_test                                 125.0
episode_length_test                          15.904
returns_test                              75.879231
return_std_test                            4.180836
average_reward_test                        4.771135
round_time_test              0 days 00:00:03.211766
round_time_total             0 days 00:12:29.831604
loss_total             22840223198506902431465472.0
loss_critic            28550278459286937667633152.0
loss_actor                   -12438125029621.759766
memory_size                                540.2075 

=== epoch 9/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                        126
episode_length                            15.753968
returns                                   75.182826
return_std                                 3.682019
average_reward                             4.772349
round_time                   0 days 00:12:32.271221
episodes_test                                 126.0
episode_length_test                        15.84127
returns_test                              75.554251
return_std_test                             1.82435
average_reward_test                        4.769536
round_time_test              0 days 00:00:03.118793
round_time_total             0 days 00:12:32.272282
loss_total             22826469102346666279698432.0
loss_critic            28533085861496558836514816.0
loss_actor                   -12433163152785.408203
memory_size                                   542.0 

=== epoch 9/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:41<00:00,  2.63it/s]
episodes                                        126
episode_length                            15.674603
returns                                   74.774355
return_std                                 2.386491
average_reward                             4.770207
round_time                   0 days 00:12:42.044361
episodes_test                                 127.0
episode_length_test                       15.700787
returns_test                              74.974752
return_std_test                            4.564768
average_reward_test                        4.775285
round_time_test              0 days 00:00:03.123778
round_time_total             0 days 00:12:42.045426
loss_total             22949831089697130037641216.0
loss_critic            28687288358150599648215040.0
loss_actor                   -12506476776521.728516
memory_size                                   542.0 

=== epoch 9/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                        125
episode_length                               15.952
returns                                   76.121269
return_std                                 6.648052
average_reward                             4.772005
round_time                   0 days 00:12:30.826720
episodes_test                                 127.0
episode_length_test                       15.653543
returns_test                              74.644342
return_std_test                            2.369198
average_reward_test                        4.768569
round_time_test              0 days 00:00:03.185325
round_time_total             0 days 00:12:30.827816
loss_total             23109002204761747378667520.0
loss_critic            28886252270716346160906240.0
loss_actor                   -12580452214767.615234
memory_size                                   542.0 

=== epoch 9/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:42<00:00,  2.62it/s]
episodes                                        125
episode_length                                 15.8
returns                                   75.381348
return_std                                 2.353352
average_reward                             4.770931
round_time                   0 days 00:12:42.590635
episodes_test                                 125.0
episode_length_test                           15.96
returns_test                               76.14338
return_std_test                            4.213496
average_reward_test                           4.771
round_time_test              0 days 00:00:03.077807
round_time_total             0 days 00:12:42.591694
loss_total             23425384901255180863930368.0
loss_critic            29281730592478090110173184.0
loss_actor                   -12643316395409.408203
memory_size                                   542.0 

=== epoch 9/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                        125
episode_length                               15.856
returns                                   75.646117
return_std                                 6.430864
average_reward                             4.770827
round_time                   0 days 00:12:32.436941
episodes_test                                 126.0
episode_length_test                       15.785714
returns_test                              75.312357
return_std_test                            2.511595
average_reward_test                        4.770891
round_time_test              0 days 00:00:03.076634
round_time_total             0 days 00:12:32.438013
loss_total             22014477884722289389338624.0
loss_critic            27518096881763893095956480.0
loss_actor                     -12614517942386.6875
memory_size                                   542.0 

=== epoch 9/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:07,  2.20it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                        125
episode_length                               15.912
returns                                    75.94462
return_std                                 3.729629
average_reward                             4.772824
round_time                   0 days 00:12:28.884211
episodes_test                                 125.0
episode_length_test                          15.896
returns_test                              75.874602
return_std_test                            2.121649
average_reward_test                        4.773111
round_time_test              0 days 00:00:03.134008
round_time_total             0 days 00:12:28.885278
loss_total             22834424370255999382061056.0
loss_critic            28543029993364771880042496.0
loss_actor                   -12749097607888.896484
memory_size                                   542.0 

=== epoch 9/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                        126
episode_length                            15.761905
returns                                   75.151648
return_std                                 2.191217
average_reward                             4.767921
round_time                   0 days 00:12:28.235683
episodes_test                                 126.0
episode_length_test                       15.761905
returns_test                              75.175877
return_std_test                            3.897947
average_reward_test                        4.769529
round_time_test              0 days 00:00:03.100416
round_time_total             0 days 00:12:28.236735
loss_total             23299306210888204668633088.0
loss_critic            29124132225484145012244480.0
loss_actor                   -12849478597345.279297
memory_size                                   542.0 

=== epoch 9/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:00,  2.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                        125
episode_length                               15.904
returns                                   75.907703
return_std                                 3.378799
average_reward                             4.772861
round_time                   0 days 00:12:25.955104
episodes_test                                 126.0
episode_length_test                       15.785714
returns_test                              75.362349
return_std_test                            2.835096
average_reward_test                        4.774075
round_time_test              0 days 00:00:03.110034
round_time_total             0 days 00:12:25.956162
loss_total             23405770914085931799543808.0
loss_critic            29257213121775126108438528.0
loss_actor                   -12849897035268.095703
memory_size                                   542.0 

=== epoch 9/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                        125
episode_length                               15.832
returns                                   75.537415
return_std                                 2.722324
average_reward                             4.771181
round_time                   0 days 00:12:23.763550
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.738716
return_std_test                            3.686302
average_reward_test                        4.771539
round_time_test              0 days 00:00:03.113725
round_time_total             0 days 00:12:23.764604
loss_total             23905062853115173603901440.0
loss_critic            29881328020197402178748416.0
loss_actor                   -12953835395874.816406
memory_size                                   542.0 

=== epoch 9/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                        126
episode_length                            15.714286
returns                                   75.039849
return_std                                 6.033142
average_reward                             4.775168
round_time                   0 days 00:12:26.064635
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                              76.428879
return_std_test                            9.479765
average_reward_test                        4.779261
round_time_test              0 days 00:00:03.072011
round_time_total             0 days 00:12:26.065684
loss_total             23707116595538502503890944.0
loss_critic            29633895209179321540280320.0
loss_actor                   -12965984845430.783203
memory_size                                   542.0 

=== epoch 9/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.70it/s]
episodes                                        126
episode_length                            15.777778
returns                                   75.266421
return_std                                 3.946651
average_reward                             4.770449
round_time                   0 days 00:12:22.458195
episodes_test                                 128.0
episode_length_test                       15.617188
returns_test                               74.50119
return_std_test                            4.976035
average_reward_test                        4.770541
round_time_test              0 days 00:00:03.133891
round_time_total             0 days 00:12:22.459244
loss_total             24481618966438545167745024.0
loss_critic            30602023162283963073953792.0
loss_actor                   -13099577895288.832031
memory_size                                   542.0 

=== epoch 9/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:57,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                        125
episode_length                               15.952
returns                                   76.121835
return_std                                  6.11444
average_reward                             4.772022
round_time                   0 days 00:12:21.070667
episodes_test                                 127.0
episode_length_test                       15.732283
returns_test                              75.022996
return_std_test                            2.098515
average_reward_test                        4.768868
round_time_test              0 days 00:00:03.070655
round_time_total             0 days 00:12:21.071720
loss_total             24644800413464620716523520.0
loss_critic            30805999959681459275431936.0
loss_actor                   -13119987004735.488281
memory_size                                   542.0 

=== epoch 9/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                        125
episode_length                               15.888
returns                                   75.769035
return_std                                 2.058458
average_reward                             4.768785
round_time                   0 days 00:12:20.814781
episodes_test                                 125.0
episode_length_test                          15.888
returns_test                              75.769348
return_std_test                            1.626562
average_reward_test                        4.769016
round_time_test              0 days 00:00:03.087118
round_time_total             0 days 00:12:20.815855
loss_total             25181911930604459099422720.0
loss_critic            31477389385217529480216576.0
loss_actor                   -13240983969857.535156
memory_size                                   542.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                        126
episode_length                            15.849206
returns                                   75.592522
return_std                                 1.910028
average_reward                             4.769624
round_time                   0 days 00:12:22.837294
episodes_test                                 126.0
episode_length_test                       15.849206
returns_test                              75.612764
return_std_test                            1.799066
average_reward_test                        4.770893
round_time_test              0 days 00:00:03.107085
round_time_total             0 days 00:12:22.838388
loss_total             25720994459866607409692672.0
loss_critic            32151242500246004979204096.0
loss_actor                   -13330757469601.792969
memory_size                                   542.0 

=== epoch 10/10 ==== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:53,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                        123
episode_length                            16.081301
returns                                   76.767195
return_std                                 4.252683
average_reward                             4.773574
round_time                   0 days 00:12:19.392764
episodes_test                                 125.0
episode_length_test                          15.912
returns_test                               75.89567
return_std_test                            1.505301
average_reward_test                        4.769684
round_time_test              0 days 00:00:03.132451
round_time_total             0 days 00:12:19.393867
loss_total             26039333765033988649385984.0
loss_critic            32549166661392961380548608.0
loss_actor                   -13417869065846.783203
memory_size                                   542.0 

=== epoch 10/10 ==== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.72it/s]
episodes                                        125
episode_length                               15.936
returns                                   75.979458
return_std                                 1.672064
average_reward                             4.767769
round_time                   0 days 00:12:16.520805
episodes_test                                 124.0
episode_length_test                       16.048387
returns_test                               76.63115
return_std_test                            2.839241
average_reward_test                        4.775054
round_time_test              0 days 00:00:03.076509
round_time_total             0 days 00:12:16.521878
loss_total             26013334178572748407701504.0
loss_critic            32516667164049007536242688.0
loss_actor                   -13465813842395.136719
memory_size                                   542.0 

=== epoch 10/10 ==== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:54,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                        124
episode_length                            15.967742
returns                                   76.177508
return_std                                 4.341743
average_reward                             4.770743
round_time                   0 days 00:12:20.237540
episodes_test                                 126.0
episode_length_test                       15.809524
returns_test                               75.41534
return_std_test                            2.839629
average_reward_test                         4.77034
round_time_test              0 days 00:00:03.123367
round_time_total             0 days 00:12:20.238627
loss_total             26238111799559538966790144.0
loss_critic            32797639200658788829888512.0
loss_actor                   -13519454058577.919922
memory_size                                   542.0 

=== epoch 10/10 ==== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                        124
episode_length                            15.927419
returns                                   76.003393
return_std                                 3.750031
average_reward                             4.771752
round_time                   0 days 00:12:27.396824
episodes_test                                 125.0
episode_length_test                          15.888
returns_test                              75.767539
return_std_test                            2.050472
average_reward_test                        4.768907
round_time_test              0 days 00:00:03.110595
round_time_total             0 days 00:12:27.397884
loss_total             26817591964508572413329408.0
loss_critic            33521989357557683069124608.0
loss_actor                   -13611424949469.183594
memory_size                                   542.0 

=== epoch 10/10 ==== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:30<00:00,  2.67it/s]
episodes                                        124
episode_length                            16.008065
returns                                   76.393498
return_std                                 5.308961
average_reward                             4.772168
round_time                   0 days 00:12:30.803388
episodes_test                                 124.0
episode_length_test                       16.064516
returns_test                              76.760989
return_std_test                            8.236696
average_reward_test                        4.778303
round_time_test              0 days 00:00:03.092736
round_time_total             0 days 00:12:30.804479
loss_total             27419427929985503817891840.0
loss_critic            34274284342938654409228288.0
loss_actor                     -13735218747277.3125
memory_size                                   542.0 

=== epoch 10/10 ==== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:33<00:00,  2.65it/s]
episodes                                        124
episode_length                            16.080645
returns                                   76.770021
return_std                                 9.857677
average_reward                             4.774189
round_time                   0 days 00:12:33.839406
episodes_test                                 125.0
episode_length_test                           15.88
returns_test                              75.750332
return_std_test                            1.717369
average_reward_test                        4.770038
round_time_test              0 days 00:00:03.157222
round_time_total             0 days 00:12:33.840476
loss_total             28610341444219641041780736.0
loss_critic            35762926222040385391165440.0
loss_actor                   -13790901638266.880859
memory_size                                 544.269 

=== epoch 10/10 ==== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                        125
episode_length                                15.88
returns                                   75.730465
return_std                                 1.756845
average_reward                             4.768891
round_time                   0 days 00:12:29.459415
episodes_test                                 125.0
episode_length_test                           15.92
returns_test                              75.935561
return_std_test                            1.398289
average_reward_test                         4.76986
round_time_test              0 days 00:00:03.125482
round_time_total             0 days 00:12:29.460489
loss_total             29227296734685950809997312.0
loss_critic            36534120326188135381729280.0
loss_actor                   -13866279870398.464844
memory_size                                   546.0 

=== epoch 10/10 ==== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.72it/s]
episodes                                        121
episode_length                            16.322314
returns                                   77.979752
return_std                                14.274893
average_reward                             4.777575
round_time                   0 days 00:12:16.652144
episodes_test                                 123.0
episode_length_test                       16.162602
returns_test                              77.366037
return_std_test                            9.634095
average_reward_test                        4.786691
round_time_test              0 days 00:00:03.110344
round_time_total             0 days 00:12:16.653202
loss_total             29618402979799724363612160.0
loss_critic            37023003125518704534618112.0
loss_actor                   -13907410698633.216797
memory_size                                547.3575 

=== epoch 10/10 ==== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:45,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.71it/s]
episodes                                        125
episode_length                               15.936
returns                                   76.011059
return_std                                  2.17277
average_reward                             4.769846
round_time                   0 days 00:12:17.303086
episodes_test                                 125.0
episode_length_test                          15.992
returns_test                              76.308764
return_std_test                            4.890607
average_reward_test                        4.771751
round_time_test              0 days 00:00:03.079763
round_time_total             0 days 00:12:17.304163
loss_total             30754646066548817585504256.0
loss_critic            38443306999375392002801664.0
loss_actor                   -13950208671481.855469
memory_size                                   556.0 

=== epoch 10/10 ==== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:35,  2.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                        124
episode_length                            15.967742
returns                                   76.199201
return_std                                 3.670556
average_reward                             4.772047
round_time                   0 days 00:12:21.181252
episodes_test                                 124.0
episode_length_test                       16.056452
returns_test                              76.567555
return_std_test                            5.196308
average_reward_test                        4.768671
round_time_test              0 days 00:00:03.100997
round_time_total             0 days 00:12:21.182313
loss_total             30547613879590157935443968.0
loss_critic            38184516706733959545880576.0
loss_actor                   -13990932683161.599609
memory_size                                   556.0 

=== epoch 10/10 ==== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:46,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                        125
episode_length                               15.824
returns                                   75.488464
return_std                                 2.654351
average_reward                             4.770486
round_time                   0 days 00:12:23.582360
episodes_test                                 125.0
episode_length_test                          15.928
returns_test                              76.037768
return_std_test                            4.441107
average_reward_test                        4.773863
round_time_test              0 days 00:00:03.107599
round_time_total             0 days 00:12:23.583439
loss_total             30832276281891447378018304.0
loss_critic            38540344717681023204196352.0
loss_actor                   -14041877554659.328125
memory_size                                   556.0 

=== epoch 10/10 ==== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:55,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.68it/s]
episodes                                        125
episode_length                                15.96
returns                                   76.266142
return_std                                 9.051655
average_reward                             4.778646
round_time                   0 days 00:12:25.401364
episodes_test                                 126.0
episode_length_test                       15.833333
returns_test                              75.527554
return_std_test                            2.247198
average_reward_test                        4.770208
round_time_test              0 days 00:00:03.124495
round_time_total             0 days 00:12:25.402431
loss_total             31724409996245291512102912.0
loss_critic            39655511815371154582929408.0
loss_actor                   -14181813659369.472656
memory_size                                   556.0 

=== epoch 10/10 ==== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                        125
episode_length                               15.832
returns                                   75.496599
return_std                                 1.784525
average_reward                             4.768551
round_time                   0 days 00:12:25.660452
episodes_test                                 124.0
episode_length_test                        16.08871
returns_test                              76.927175
return_std_test                           10.905019
average_reward_test                        4.781559
round_time_test              0 days 00:00:03.107729
round_time_total             0 days 00:12:25.661515
loss_total             31897567089672451937796096.0
loss_critic            39871958223804396195545088.0
loss_actor                   -14243186074976.255859
memory_size                                   556.0 

=== epoch 10/10 ==== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:29,  2.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:15<00:00,  2.72it/s]
episodes                                        125
episode_length                               15.936
returns                                    75.92194
return_std                                 4.063816
average_reward                             4.764158
round_time                   0 days 00:12:16.041097
episodes_test                                 125.0
episode_length_test                          15.928
returns_test                              76.067089
return_std_test                            3.433933
average_reward_test                        4.775806
round_time_test              0 days 00:00:03.117390
round_time_total             0 days 00:12:16.042162
loss_total             32012109965670663617249280.0
loss_critic            40015136785367437049593856.0
loss_actor                   -14310793603973.119141
memory_size                                   556.0 

=== epoch 10/10 ==== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:57,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                        125
episode_length                               15.832
returns                                   75.513139
return_std                                 2.722521
average_reward                             4.769695
round_time                   0 days 00:12:23.403350
episodes_test                                 126.0
episode_length_test                       15.865079
returns_test                              75.699513
return_std_test                            2.056096
average_reward_test                        4.771513
round_time_test              0 days 00:00:03.145877
round_time_total             0 days 00:12:23.404410
loss_total             33041915621580218262618112.0
loss_critic            41302393855974958830714880.0
loss_actor                   -14428555583160.320312
memory_size                                   556.0 

=== epoch 10/10 ==== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                        125
episode_length                               15.824
returns                                   75.495758
return_std                                  1.90061
average_reward                             4.770814
round_time                   0 days 00:12:21.258693
episodes_test                                 127.0
episode_length_test                       15.716535
returns_test                              74.952372
return_std_test                            2.180153
average_reward_test                        4.769098
round_time_test              0 days 00:00:03.131805
round_time_total             0 days 00:12:21.259753
loss_total             32259975362060645434392576.0
loss_critic            40324968525954995512672256.0
loss_actor                   -14448397065912.320312
memory_size                                   556.0 

=== epoch 10/10 ==== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:17<00:00,  2.71it/s]
episodes                                        125
episode_length                               15.872
returns                                   75.750729
return_std                                 4.673353
average_reward                             4.772507
round_time                   0 days 00:12:17.848299
episodes_test                                 126.0
episode_length_test                       15.769841
returns_test                               75.16703
return_std_test                             3.55676
average_reward_test                        4.766568
round_time_test              0 days 00:00:03.058476
round_time_total             0 days 00:12:17.849374
loss_total             32486488942721527984422912.0
loss_critic            40608110545303892096188416.0
loss_actor                   -14531267996942.335938
memory_size                                   556.0 

=== epoch 10/10 ==== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:14<00:00,  2.72it/s]
episodes                                        124
episode_length                            15.991935
returns                                    76.30895
return_std                                 2.851142
average_reward                             4.771623
round_time                   0 days 00:12:14.873735
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.412025
return_std_test                            7.752112
average_reward_test                        4.773371
round_time_test              0 days 00:00:03.080905
round_time_total             0 days 00:12:14.874822
loss_total             33048256185452397655490560.0
loss_critic            41310319527236347067629568.0
loss_actor                   -14639342902837.248047
memory_size                                   556.0 

=== epoch 10/10 ==== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:39,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:15<00:00,  2.72it/s]
episodes                                        125
episode_length                               15.872
returns                                   75.669553
return_std                                 1.792287
average_reward                             4.767448
round_time                   0 days 00:12:16.368980
episodes_test                                 125.0
episode_length_test                          15.904
returns_test                              75.848015
return_std_test                            1.762318
average_reward_test                        4.769173
round_time_test              0 days 00:00:03.090329
round_time_total             0 days 00:12:16.370032
loss_total             33733151352580398290829312.0
loss_critic            42166438489460996388683776.0
loss_actor                   -14711296022806.527344
memory_size                                   556.0 

=== epoch 10/10 ==== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                        126
episode_length                            15.849206
returns                                   75.693836
return_std                                 4.096048
average_reward                             4.775979
round_time                   0 days 00:12:21.409736
episodes_test                                 127.0
episode_length_test                       15.708661
returns_test                              74.948779
return_std_test                            2.277695
average_reward_test                        4.771292
round_time_test              0 days 00:00:03.149636
round_time_total             0 days 00:12:21.410784
loss_total             33693343033873515067473920.0
loss_critic            42116678052598632178253824.0
loss_actor                   -14745500921626.623047
memory_size                                   556.0 

=== epoch 10/10 ==== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:10<00:00,  2.74it/s]
episodes                                        124
episode_length                                 16.0
returns                                   76.384688
return_std                                   4.0851
average_reward                             4.774033
round_time                   0 days 00:12:10.802205
episodes_test                                 124.0
episode_length_test                       16.040323
returns_test                              76.523393
return_std_test                            5.894502
average_reward_test                        4.770683
round_time_test              0 days 00:00:03.082804
round_time_total             0 days 00:12:10.803265
loss_total             34719020518867548449538048.0
loss_critic            43398774919793925699928064.0
loss_actor                   -14871592436760.576172
memory_size                                   556.0 

=== epoch 10/10 ==== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:11<00:00,  2.73it/s]
episodes                                        124
episode_length                                 16.0
returns                                   76.365994
return_std                                 2.429254
average_reward                             4.772791
round_time                   0 days 00:12:12.167127
episodes_test                                 124.0
episode_length_test                       16.048387
returns_test                              76.502067
return_std_test                            5.599838
average_reward_test                        4.766999
round_time_test              0 days 00:00:03.083714
round_time_total             0 days 00:12:12.168195
loss_total             35150787340711152551198720.0
loss_critic            43938483461798188325797888.0
loss_actor                    -14972840371027.96875
memory_size                                   556.0 

=== epoch 10/10 ==== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:05<00:00,  2.76it/s]
episodes                                        125
episode_length                                15.84
returns                                   75.554581
return_std                                 1.918064
average_reward                             4.770129
round_time                   0 days 00:12:05.878748
episodes_test                                 125.0
episode_length_test                          15.952
returns_test                              76.094075
return_std_test                             2.76067
average_reward_test                        4.770239
round_time_test              0 days 00:00:03.063540
round_time_total             0 days 00:12:05.879808
loss_total             36122857234473602965831680.0
loss_critic            45153570779857964960317440.0
loss_actor                        -15069213294592.0
memory_size                                   556.0 

=== epoch 10/10 ==== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:09<00:00,  2.74it/s]
episodes                                        125
episode_length                               15.872
returns                                   75.733554
return_std                                 2.017308
average_reward                             4.771612
round_time                   0 days 00:12:09.885219
episodes_test                                 123.0
episode_length_test                       16.146341
returns_test                              77.212062
return_std_test                            9.181043
average_reward_test                        4.781989
round_time_test              0 days 00:00:03.090103
round_time_total             0 days 00:12:09.886271
loss_total             36783199616336492773769216.0
loss_critic            45978998729516463016640512.0
loss_actor                   -15150601311617.023438
memory_size                                   556.0 

=== epoch 10/10 ==== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:30,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.74it/s]
episodes                                        125
episode_length                               15.824
returns                                   75.450262
return_std                                  3.33652
average_reward                             4.767946
round_time                   0 days 00:12:09.397763
episodes_test                                 122.0
episode_length_test                       16.278689
returns_test                              77.848585
return_std_test                           14.628973
average_reward_test                        4.782177
round_time_test              0 days 00:00:03.069980
round_time_total             0 days 00:12:09.398818
loss_total             36609363036514920318369792.0
loss_critic            45761703022609778329255936.0
loss_actor                   -15230933745008.640625
memory_size                                   556.0 

=== epoch 10/10 ==== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:24,  2.48it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:05<00:00,  2.76it/s]
episodes                                        125
episode_length                               15.912
returns                                     75.9062
return_std                                 1.380206
average_reward                             4.770417
round_time                   0 days 00:12:05.878088
episodes_test                                 125.0
episode_length_test                          15.936
returns_test                              76.025424
return_std_test                            2.475246
average_reward_test                        4.770693
round_time_test              0 days 00:00:03.078442
round_time_total             0 days 00:12:05.879140
loss_total             37183185529139853281722368.0
loss_critic            46478981157846494623039488.0
loss_actor                   -15328246162784.255859
memory_size                                   556.0 

=== epoch 10/10 ==== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:48,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:09<00:00,  2.74it/s]
episodes                                        124
episode_length                            15.951613
returns                                   76.070973
return_std                                 1.462619
average_reward                             4.768728
round_time                   0 days 00:12:09.680154
episodes_test                                 124.0
episode_length_test                       16.032258
returns_test                              76.463381
return_std_test                            2.798927
average_reward_test                        4.769312
round_time_test              0 days 00:00:03.059055
round_time_total             0 days 00:12:09.681208
loss_total             37503693951778755140648960.0
loss_critic            46879616606449428936523776.0
loss_actor                   -15405806065811.455078
memory_size                                   556.0 

=== epoch 10/10 ==== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:43,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:10<00:00,  2.74it/s]
episodes                                        125
episode_length                               15.904
returns                                   75.854255
return_std                                  1.42708
average_reward                             4.769525
round_time                   0 days 00:12:11.094019
episodes_test                                 124.0
episode_length_test                       16.016129
returns_test                              76.447092
return_std_test                            2.445677
average_reward_test                        4.773015
round_time_test              0 days 00:00:03.058846
round_time_total             0 days 00:12:11.095076
loss_total             37727289929680415537233920.0
loss_critic            47159111590355712778174464.0
loss_actor                   -15480121908002.816406
memory_size                                   556.0 

=== epoch 10/10 ==== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:44,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:06<00:00,  2.75it/s]
episodes                                        124
episode_length                            16.056452
returns                                   76.677014
return_std                                 6.214463
average_reward                             4.775537
round_time                   0 days 00:12:06.955571
episodes_test                                 125.0
episode_length_test                          15.944
returns_test                              76.091642
return_std_test                            2.238556
average_reward_test                        4.772457
round_time_test              0 days 00:00:03.079464
round_time_total             0 days 00:12:06.956622
loss_total             38301854911248303097118720.0
loss_critic            47877317785898465204633600.0
loss_actor                   -15518181575098.367188
memory_size                                   556.0 

=== epoch 10/10 ==== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:07<00:00,  2.75it/s]
episodes                                        121
episode_length                            16.330579
returns                                   78.060341
return_std                                11.869675
average_reward                             4.779849
round_time                   0 days 00:12:07.898157
episodes_test                                 125.0
episode_length_test                            16.0
returns_test                              76.326477
return_std_test                            2.225625
average_reward_test                        4.770405
round_time_test              0 days 00:00:03.095490
round_time_total             0 days 00:12:07.899205
loss_total             39045075974348302006091776.0
loss_critic            48806344084509270994845696.0
loss_actor                   -15665318258016.255859
memory_size                                 556.516 

=== epoch 10/10 ==== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:53,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.74it/s]
episodes                                        124
episode_length                                 16.0
returns                                   76.368853
return_std                                 4.490537
average_reward                             4.772965
round_time                   0 days 00:12:09.456133
episodes_test                                 124.0
episode_length_test                       16.056452
returns_test                              76.562651
return_std_test                            5.973347
average_reward_test                        4.768437
round_time_test              0 days 00:00:03.072922
round_time_total             0 days 00:12:09.457189
loss_total             39539984953150564085006336.0
loss_critic            49424980276883221456617472.0
loss_actor                   -15667005640671.232422
memory_size                                   559.0 

=== epoch 10/10 ==== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:28,  2.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:01<00:00,  2.77it/s]
episodes                                        125
episode_length                                15.92
returns                                   75.975814
return_std                                 3.085774
average_reward                             4.772396
round_time                   0 days 00:12:01.869630
episodes_test                                 125.0
episode_length_test                          15.984
returns_test                              76.351483
return_std_test                            5.514259
average_reward_test                        4.776924
round_time_test              0 days 00:00:03.076160
round_time_total             0 days 00:12:01.870687
loss_total             38432347276247591467089920.0
loss_critic            48040433241427004708880384.0
loss_actor                   -15646743633854.464844
memory_size                                   559.0 

=== epoch 10/10 ==== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:45,  2.42it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.74it/s]
episodes                                        125
episode_length                               15.864
returns                                   75.678072
return_std                                 1.629707
average_reward                             4.770428
round_time                   0 days 00:12:09.137806
episodes_test                                 123.0
episode_length_test                       16.203252
returns_test                              77.307205
return_std_test                            8.925155
average_reward_test                        4.771126
round_time_test              0 days 00:00:03.083216
round_time_total             0 days 00:12:09.138850
loss_total             38085270757518305453408256.0
loss_critic            47606587626017766980452352.0
loss_actor                    -15734620021587.96875
memory_size                                   559.0 

=== epoch 10/10 ==== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
episodes                                        125
episode_length                                15.88
returns                                   75.760141
return_std                                 2.007771
average_reward                             4.770794
round_time                   0 days 00:12:08.955367
episodes_test                                 126.0
episode_length_test                       15.777778
returns_test                              75.235642
return_std_test                            1.974216
average_reward_test                        4.768471
round_time_test              0 days 00:00:03.097654
round_time_total             0 days 00:12:08.956445
loss_total             39445438561018730365583360.0
loss_critic            49306797361370104135155712.0
loss_actor                   -15856709775917.056641
memory_size                                   559.0 

=== epoch 10/10 ==== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:06<00:00,  2.75it/s]
episodes                                        125
episode_length                               15.904
returns                                   75.847013
return_std                                 2.566844
average_reward                             4.769005
round_time                   0 days 00:12:07.362821
episodes_test                                 125.0
episode_length_test                          15.928
returns_test                              76.019863
return_std_test                            3.535992
average_reward_test                        4.772794
round_time_test              0 days 00:00:03.085186
round_time_total             0 days 00:12:07.363870
loss_total             39605883272152462347206656.0
loss_critic            49507353196244067141287936.0
loss_actor                   -15907470293598.207031
memory_size                                   559.0 

=== epoch 10/10 ==== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:04<00:00,  2.76it/s]
episodes                                        125
episode_length                               15.848
returns                                   75.633281
return_std                                  3.32847
average_reward                             4.772497
round_time                   0 days 00:12:05.237388
episodes_test                                 125.0
episode_length_test                           15.92
returns_test                              76.044274
return_std_test                            5.353153
average_reward_test                        4.776651
round_time_test              0 days 00:00:03.060013
round_time_total             0 days 00:12:05.238427
loss_total             40837394928958194443091968.0
loss_critic            51046742703552321038581760.0
loss_actor                   -16021627978383.359375
memory_size                                   559.0 

=== epoch 10/10 ==== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:43,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:01<00:00,  2.77it/s]
episodes                                        125
episode_length                               15.864
returns                                   75.733512
return_std                                 3.672738
average_reward                             4.773908
round_time                   0 days 00:12:02.024928
episodes_test                                 123.0
episode_length_test                       16.162602
returns_test                              77.370956
return_std_test                           15.448442
average_reward_test                        4.786948
round_time_test              0 days 00:00:03.129912
round_time_total             0 days 00:12:02.025973
loss_total             41293887828239502347862016.0
loss_critic            51617358850568267868667904.0
loss_actor                   -16129073039802.367188
memory_size                                   559.0 

=== epoch 10/10 ==== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:55,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
episodes                                        126
episode_length                            15.785714
returns                                   75.359204
return_std                                   3.6102
average_reward                             4.773731
round_time                   0 days 00:12:08.746500
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.729173
return_std_test                            5.631693
average_reward_test                        4.770938
round_time_test              0 days 00:00:03.043867
round_time_total             0 days 00:12:08.747539
loss_total             40993965061755334660456448.0
loss_critic            51242455413215642301497344.0
loss_actor                   -16176385799749.632812
memory_size                                   559.0 

=== epoch 10/10 ==== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:47,  2.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:09<00:00,  2.74it/s]
episodes                                        126
episode_length                            15.849206
returns                                   75.695199
return_std                                   8.0419
average_reward                             4.776154
round_time                   0 days 00:12:09.995356
episodes_test                                 125.0
episode_length_test                          15.928
returns_test                              76.035207
return_std_test                            6.002035
average_reward_test                        4.774284
round_time_test              0 days 00:00:03.059509
round_time_total             0 days 00:12:09.996408
loss_total             41595153642207766971940864.0
loss_critic            51993941135322419757580288.0
loss_actor                   -16245707215208.447266
memory_size                                   559.0 

=== epoch 10/10 ==== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
episodes                                        124
episode_length                            15.943548
returns                                   76.161434
return_std                                 5.114697
average_reward                             4.776756
round_time                   0 days 00:12:08.661710
episodes_test                                 125.0
episode_length_test                          15.912
returns_test                              75.924182
return_std_test                            7.826273
average_reward_test                        4.771529
round_time_test              0 days 00:00:03.072240
round_time_total             0 days 00:12:08.662756
loss_total             42935938435986099424722944.0
loss_critic            53669922099586985864724480.0
loss_actor                        -16356572266496.0
memory_size                                   559.0 

=== epoch 10/10 ==== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.74it/s]
episodes                                        126
episode_length                            15.769841
returns                                   75.194478
return_std                                  2.38061
average_reward                             4.768286
round_time                   0 days 00:12:09.472015
episodes_test                                 126.0
episode_length_test                       15.857143
returns_test                              75.651555
return_std_test                            1.958285
average_reward_test                        4.770969
round_time_test              0 days 00:00:03.104907
round_time_total             0 days 00:12:09.473074
loss_total             42951809159119362498494464.0
loss_critic            53689760521373856326221824.0
loss_actor                   -16489386705682.431641
memory_size                                   559.0 

=== epoch 10/10 ==== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
episodes                                        124
episode_length                            16.048387
returns                                   76.618942
return_std                                 4.685048
average_reward                             4.774211
round_time                   0 days 00:12:08.736382
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.786446
return_std_test                            6.073994
average_reward_test                        4.774546
round_time_test              0 days 00:00:03.029490
round_time_total             0 days 00:12:08.737425
loss_total             44137750347147362212773888.0
loss_critic            55172186944727557173411840.0
loss_actor                   -16580583247839.232422
memory_size                                   559.0 

=== epoch 10/10 ==== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:59,  2.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:07<00:00,  2.75it/s]
episodes                                        124
episode_length                                 16.0
returns                                   76.363864
return_std                                 6.039085
average_reward                             4.772688
round_time                   0 days 00:12:07.528714
episodes_test                                 125.0
episode_length_test                          15.984
returns_test                              76.222025
return_std_test                             1.89842
average_reward_test                         4.76876
round_time_test              0 days 00:00:03.054083
round_time_total             0 days 00:12:07.529758
loss_total             44817952175473673552003072.0
loss_critic            56022439233594207133237248.0
loss_actor                   -16675104580173.824219
memory_size                                   559.0 

=== epoch 10/10 ==== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:08<00:00,  2.75it/s]
episodes                                        124
episode_length                            15.959677
returns                                   76.151804
return_std                                 3.598357
average_reward                             4.771441
round_time                   0 days 00:12:08.638540
episodes_test                                 126.0
episode_length_test                       15.873016
returns_test                              75.682538
return_std_test                            3.030989
average_reward_test                           4.768
round_time_test              0 days 00:00:03.117718
round_time_total             0 days 00:12:08.639601
loss_total             44174963896680759284465664.0
loss_critic            55218703826015833408143360.0
loss_actor                   -16726797905821.695312
memory_size                                   559.0 

=== epoch 10/10 ==== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:54,  2.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:04<00:00,  2.76it/s]
episodes                                        123
episode_length                            16.105691
returns                                   76.805925
return_std                                 9.252386
average_reward                             4.768817
round_time                   0 days 00:12:05.338115
episodes_test                                 124.0
episode_length_test                       16.008065
returns_test                              76.356996
return_std_test                            1.538991
average_reward_test                        4.769777
round_time_test              0 days 00:00:03.085475
round_time_total             0 days 00:12:05.339171
loss_total             45850389300354064490430464.0
loss_critic            57312985592713135182053376.0
loss_actor                   -16761376137019.392578
memory_size                                 561.985 

=== epoch 10/10 ==== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:22,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:09<00:00,  2.74it/s]
episodes                                        124
episode_length                            16.064516
returns                                   76.690908
return_std                                 6.410476
average_reward                             4.773912
round_time                   0 days 00:12:10.203417
episodes_test                                 124.0
episode_length_test                       16.056452
returns_test                              76.594569
return_std_test                            3.752672
average_reward_test                        4.770386
round_time_test              0 days 00:00:03.089485
round_time_total             0 days 00:12:10.204494
loss_total             46080626712558963777339392.0
loss_critic            57600782344710673773101056.0
loss_actor                   -16817341212917.759766
memory_size                                   563.0 

=== epoch 10/10 ==== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:39,  2.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:12<00:00,  2.73it/s]
episodes                                        124
episode_length                            15.975806
returns                                   76.183801
return_std                                 4.187342
average_reward                             4.768691
round_time                   0 days 00:12:12.534492
episodes_test                                 126.0
episode_length_test                       15.865079
returns_test                               75.65662
return_std_test                            1.643495
average_reward_test                        4.768816
round_time_test              0 days 00:00:03.107406
round_time_total             0 days 00:12:12.535527
loss_total             45150375585882235744550912.0
loss_critic            56437968445011666770329600.0
loss_actor                   -16837442518646.783203
memory_size                                   563.0 

=== epoch 10/10 ==== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:04<00:00,  2.76it/s]
episodes                                        126
episode_length                            15.753968
returns                                   75.137492
return_std                                 2.037783
average_reward                             4.769473
round_time                   0 days 00:12:05.068321
episodes_test                                 126.0
episode_length_test                       15.777778
returns_test                              75.258959
return_std_test                            2.075475
average_reward_test                        4.769971
round_time_test              0 days 00:00:03.047043
round_time_total             0 days 00:12:05.069365
loss_total             44236912446687829121040384.0
loss_critic            55296139525630351707734016.0
loss_actor                   -16803391704399.871094
memory_size                                   563.0 

=== epoch 10/10 ==== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:40,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:06<00:00,  2.75it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
episodes                                        125
episode_length                               15.856
returns                                   75.587188
return_std                                 3.432819
average_reward                             4.767196
round_time                   0 days 00:12:07.192986
episodes_test                                 126.0
episode_length_test                       15.809524
returns_test                              75.393465
return_std_test                            1.881896
average_reward_test                        4.768975
round_time_test              0 days 00:00:03.066089
round_time_total             0 days 00:12:07.194044
loss_total             45030667891327018739957760.0
loss_critic            56288333829699950822293504.0
loss_actor                   -16873057722826.751953
memory_size                                   563.0 


