/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: null
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Walker2d-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 7
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>

<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:02<00:00, 727.07it/s]
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   93
episode_length                      21.258065
returns                              1.251449
return_std                           6.643733
average_reward                       0.058081
round_time             0 days 00:00:02.770700
episodes_test                            19.0
episode_length_test                     104.0
returns_test                        85.728291
return_std_test                    101.723987
average_reward_test                  0.822014
round_time_test        0 days 00:00:04.144711
round_time_total       0 days 00:00:07.339332 

=== epoch 1/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 87%|████████▋ | 1739/2000 [00:02<00:00, 861.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 796.19it/s]
episodes                                   93
episode_length                      20.892473
returns                                3.4427
return_std                           7.829098
average_reward                       0.172481
round_time             0 days 00:00:04.216011
episodes_test                            19.0
episode_length_test                 98.052632
returns_test                        57.321583
return_std_test                     87.907972
average_reward_test                  0.594885
round_time_test        0 days 00:00:03.351040
round_time_total       0 days 00:00:06.264710 

=== epoch 1/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 80%|███████▉  | 1593/2000 [00:01<00:00, 843.17it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 792.78it/s]
episodes                                   83
episode_length                      23.891566
returns                              2.959642
return_std                            8.35725
average_reward                       0.123355
round_time             0 days 00:00:04.310064
episodes_test                            23.0
episode_length_test                 86.826087
returns_test                        42.680167
return_std_test                     40.269547
average_reward_test                  0.491641
round_time_test        0 days 00:00:03.656870
round_time_total       0 days 00:00:06.436853 

=== epoch 1/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 72%|███████▏  | 1433/2000 [00:02<00:00, 701.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 730.62it/s]
episodes                                   95
episode_length                      20.578947
returns                              1.831706
return_std                           6.356887
average_reward                       0.091364
round_time             0 days 00:00:04.513927
episodes_test                            18.0
episode_length_test                     101.0
returns_test                        59.503633
return_std_test                     60.676597
average_reward_test                   0.60196
round_time_test        0 days 00:00:03.881729
round_time_total       0 days 00:00:06.864188 

=== epoch 1/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 80%|████████  | 1601/2000 [00:02<00:00, 769.09it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 787.99it/s]
episodes                                   93
episode_length                      21.225806
returns                              2.778808
return_std                           7.468666
average_reward                       0.128766
round_time             0 days 00:00:04.286561
episodes_test                            15.0
episode_length_test                125.733333
returns_test                        87.776153
return_std_test                     93.308246
average_reward_test                  0.679874
round_time_test        0 days 00:00:03.357840
round_time_total       0 days 00:00:06.391028 

=== epoch 1/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/home/anon/20260123-icml-dcac/dcac/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 3/2000 [00:01<18:21,  1.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
starting training
episodes                                   17
episode_length                     106.411765
returns                             65.944329
return_std                         129.568712
average_reward                        0.65345
round_time             0 days 00:17:48.218007
episodes_test                            17.0
episode_length_test                108.882353
returns_test                        75.108863
return_std_test                     87.155338
average_reward_test                  0.713271
round_time_test        0 days 00:00:03.769978
round_time_total       0 days 00:17:48.220160
loss_total                          69.038397
loss_critic                        107.039315
loss_actor                          -82.96528
memory_size                         1088.9365 

=== epoch 1/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:02<17:01,  1.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:14<00:00,  1.83it/s]
episodes                                   39
episode_length                      47.358974
returns                             27.687408
return_std                          41.781998
average_reward                        0.66588
round_time             0 days 00:18:15.744313
episodes_test                             6.0
episode_length_test                     178.5
returns_test                       253.016131
return_std_test                     56.423295
average_reward_test                  1.228369
round_time_test        0 days 00:00:03.255564
round_time_total       0 days 00:18:15.746449
loss_total                         209.882305
loss_critic                        318.918802
loss_actor                        -226.263698
memory_size                          2495.291 

=== epoch 1/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:34,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:01<00:00,  1.51it/s]
episodes                                  122
episode_length                       16.02459
returns                              0.829915
return_std                           5.681341
average_reward                       0.061663
round_time             0 days 00:22:03.240858
episodes_test                            67.0
episode_length_test                 28.656716
returns_test                        12.077313
return_std_test                     12.183922
average_reward_test                  0.433924
round_time_test        0 days 00:00:03.502565
round_time_total       0 days 00:22:03.243997
loss_total                         113.373315
loss_critic                        239.674997
loss_actor                        -391.833423
memory_size                          2892.011 

=== epoch 1/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:52,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:19<00:00,  1.56it/s]
episodes                                  180
episode_length                      11.033333
returns                             -3.327536
return_std                           1.744261
average_reward                      -0.300015
round_time             0 days 00:21:21.734278
episodes_test                           159.0
episode_length_test                 12.490566
returns_test                        -2.767285
return_std_test                      2.252439
average_reward_test                 -0.218981
round_time_test        0 days 00:00:03.233592
round_time_total       0 days 00:21:21.736931
loss_total                          31.086791
loss_critic                        284.934424
loss_actor                        -984.303745
memory_size                            2896.0 

=== epoch 1/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:40,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:06<00:00,  1.38it/s]
episodes                                  194
episode_length                      10.268041
returns                             -3.998322
return_std                           1.523872
average_reward                      -0.390531
round_time             0 days 00:24:08.347814
episodes_test                           192.0
episode_length_test                 10.411458
returns_test                        -3.774708
return_std_test                      1.341978
average_reward_test                 -0.361854
round_time_test        0 days 00:00:03.916472
round_time_total       0 days 00:24:08.350049
loss_total                         388.682824
loss_critic                       1342.928032
loss_actor                       -3428.298033
memory_size                            2896.0 

=== epoch 1/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:46<00:00,  1.53it/s]
episodes                                  196
episode_length                      10.137755
returns                              -4.22694
return_std                           1.699704
average_reward                      -0.416701
round_time             0 days 00:21:47.824748
episodes_test                           193.0
episode_length_test                 10.316062
returns_test                        -4.053145
return_std_test                      1.282928
average_reward_test                 -0.392987
round_time_test        0 days 00:00:03.207008
round_time_total       0 days 00:21:47.826603
loss_total                       11580.304485
loss_critic                      17768.922219
loss_actor                      -13174.167462
memory_size                            2896.0 

=== epoch 1/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<20:44,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:36<00:00,  1.62it/s]
episodes                                  197
episode_length                      10.091371
returns                             -4.327379
return_std                           1.773775
average_reward                      -0.428918
round_time             0 days 00:20:37.191039
episodes_test                           199.0
episode_length_test                 10.015075
returns_test                        -3.976461
return_std_test                      1.495979
average_reward_test                  -0.39547
round_time_test        0 days 00:00:03.722103
round_time_total       0 days 00:20:37.193829
loss_total                       481424.08934
loss_critic                     614480.203459
loss_actor                      -50800.406463
memory_size                            2896.0 

=== epoch 1/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:21,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:11<00:00,  1.44it/s]
episodes                                  196
episode_length                      10.122449
returns                             -4.109816
return_std                           1.579248
average_reward                      -0.408603
round_time             0 days 00:23:13.936735
episodes_test                           195.0
episode_length_test                 10.225641
returns_test                        -4.130203
return_std_test                      1.516768
average_reward_test                 -0.403084
round_time_test        0 days 00:00:04.438525
round_time_total       0 days 00:23:13.938594
loss_total                    25816417.040562
loss_critic                   32317496.368813
loss_actor                     -187902.542625
memory_size                            2896.0 

=== epoch 1/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:34<00:00,  1.48it/s]
episodes                                  191
episode_length                      10.450262
returns                             -4.137098
return_std                           1.820413
average_reward                      -0.395778
round_time             0 days 00:22:35.981888
episodes_test                           196.0
episode_length_test                 10.173469
returns_test                        -4.355519
return_std_test                       1.25539
average_reward_test                 -0.427012
round_time_test        0 days 00:00:04.007222
round_time_total       0 days 00:22:35.983633
loss_total                      840116176.542
loss_critic                    1050293697.838
loss_actor                     -593978.536016
memory_size                            2896.0 

=== epoch 1/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:48,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:47<00:00,  1.60it/s]
episodes                                  178
episode_length                      11.151685
returns                             -4.037274
return_std                           1.490892
average_reward                      -0.361268
round_time             0 days 00:20:49.093700
episodes_test                           177.0
episode_length_test                 11.282486
returns_test                        -3.930374
return_std_test                      1.698559
average_reward_test                 -0.347059
round_time_test        0 days 00:00:03.292913
round_time_total       0 days 00:20:49.096100
loss_total                 10477472633.568001
loss_critic                13097205975.360001
loss_actor                    -1461665.484125
memory_size                            2896.0 

=== epoch 1/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:50<00:00,  1.77it/s]
episodes                                  141
episode_length                      14.106383
returns                             -5.219861
return_std                           3.076399
average_reward                       -0.36742
round_time             0 days 00:18:51.928008
episodes_test                           167.0
episode_length_test                 11.976048
returns_test                        -4.232113
return_std_test                      1.640084
average_reward_test                 -0.353381
round_time_test        0 days 00:00:03.154907
round_time_total       0 days 00:18:51.929920
loss_total                 78077209145.856003
loss_critic                   97597239002.112
loss_actor                    -2917656.078062
memory_size                          2919.163 

=== epoch 1/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:22,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:25<00:00,  1.72it/s]
episodes                                   62
episode_length                      31.983871
returns                             -1.460939
return_std                          22.031337
average_reward                      -0.051366
round_time             0 days 00:19:26.293521
episodes_test                           139.0
episode_length_test                 14.338129
returns_test                        -5.336079
return_std_test                      2.062876
average_reward_test                 -0.367937
round_time_test        0 days 00:00:04.077527
round_time_total       0 days 00:19:26.295824
loss_total                384443334895.616028
loss_critic               480555319582.719971
loss_actor                    -4635700.538375
memory_size                          3118.017 

=== epoch 1/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:26,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:45<00:00,  1.69it/s]
episodes                                   71
episode_length                      27.816901
returns                              2.931468
return_std                          15.189968
average_reward                       0.106174
round_time             0 days 00:19:46.542757
episodes_test                            24.0
episode_length_test                 82.208333
returns_test                         1.398717
return_std_test                     68.779263
average_reward_test                  0.024919
round_time_test        0 days 00:00:02.940466
round_time_total       0 days 00:19:46.545195
loss_total                616112091922.432007
loss_critic               770141546217.472046
loss_actor                      -5777384.9135
memory_size                          4146.111 

=== epoch 1/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:49,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:04<00:00,  1.75it/s]
episodes                                  186
episode_length                      10.704301
returns                             -1.291211
return_std                           2.631742
average_reward                      -0.118299
round_time             0 days 00:19:06.443201
episodes_test                           158.0
episode_length_test                 12.588608
returns_test                         0.747177
return_std_test                      5.221449
average_reward_test                  0.060179
round_time_test        0 days 00:00:03.946250
round_time_total       0 days 00:19:06.445741
loss_total                814341370396.671997
loss_critic              1017928431968.255981
loss_actor                     -6946095.05075
memory_size                            4471.0 

=== epoch 1/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:22,  1.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:56<00:00,  1.76it/s]
episodes                                  195
episode_length                      10.230769
returns                             -2.343319
return_std                           2.335718
average_reward                      -0.230126
round_time             0 days 00:18:57.752848
episodes_test                           192.0
episode_length_test                 10.401042
returns_test                        -2.185827
return_std_test                      2.316869
average_reward_test                 -0.209408
round_time_test        0 days 00:00:04.165431
round_time_total       0 days 00:18:57.754762
loss_total               1197278457757.696045
loss_critic              1496600186970.112061
loss_actor                       -8555575.846
memory_size                            4471.0 

=== epoch 1/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:42,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:58<00:00,  1.76it/s]
episodes                                  195
episode_length                      10.241026
returns                             -2.557473
return_std                           2.096718
average_reward                      -0.248788
round_time             0 days 00:19:00.198936
episodes_test                           193.0
episode_length_test                 10.362694
returns_test                          -2.5102
return_std_test                      1.979682
average_reward_test                 -0.242234
round_time_test        0 days 00:00:03.349017
round_time_total       0 days 00:19:00.200545
loss_total               1716032120627.199951
loss_critic              2145042741641.216064
loss_actor                      -10511763.682
memory_size                            4471.0 

=== epoch 1/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:43,  1.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:42<00:00,  1.78it/s]
episodes                                  201
episode_length                       9.900498
returns                             -3.295258
return_std                            1.95296
average_reward                      -0.332947
round_time             0 days 00:18:42.989478
episodes_test                           196.0
episode_length_test                 10.168367
returns_test                        -2.988978
return_std_test                      1.822926
average_reward_test                 -0.291008
round_time_test        0 days 00:00:03.175403
round_time_total       0 days 00:18:42.991455
loss_total               2587213237354.496094
loss_critic              3234019720396.799805
loss_actor                     -12924172.8265
memory_size                            4471.0 

=== epoch 1/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:19,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:49<00:00,  1.77it/s]
episodes                                  194
episode_length                      10.257732
returns                             -3.070784
return_std                           1.588427
average_reward                       -0.29789
round_time             0 days 00:18:50.596017
episodes_test                           194.0
episode_length_test                  10.28866
returns_test                        -2.843471
return_std_test                      1.853827
average_reward_test                 -0.274936
round_time_test        0 days 00:00:04.006998
round_time_total       0 days 00:18:50.597905
loss_total               3907001511084.032227
loss_critic              4883755777064.959961
loss_actor                      -15925108.661
memory_size                            4471.0 

=== epoch 1/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<14:32,  2.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:52<00:00,  1.77it/s]
episodes                                  185
episode_length                      10.783784
returns                             -2.926801
return_std                           1.857325
average_reward                      -0.273413
round_time             0 days 00:18:53.621178
episodes_test                           190.0
episode_length_test                 10.521053
returns_test                        -2.966713
return_std_test                      1.772025
average_reward_test                 -0.281378
round_time_test        0 days 00:00:03.729731
round_time_total       0 days 00:18:53.623027
loss_total               5860982755983.360352
loss_critic              7326233225986.047852
loss_actor                      -19524386.412
memory_size                           4472.41 

=== epoch 1/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:58,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:22<00:00,  1.72it/s]
episodes                                  177
episode_length                      11.248588
returns                             -2.612594
return_std                           2.129346
average_reward                      -0.230893
round_time             0 days 00:19:23.700623
episodes_test                           183.0
episode_length_test                 10.918033
returns_test                         -2.72132
return_std_test                      1.725181
average_reward_test                  -0.24832
round_time_test        0 days 00:00:04.301901
round_time_total       0 days 00:19:23.702338
loss_total               8896493729349.632812
loss_critic             11120622872428.544922
loss_actor                      -23780265.934
memory_size                            4479.0 

=== epoch 1/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:32,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:12<00:00,  1.74it/s]
episodes                                  179
episode_length                      11.145251
returns                             -2.681031
return_std                           1.765335
average_reward                      -0.240663
round_time             0 days 00:19:13.402004
episodes_test                           176.0
episode_length_test                 11.335227
returns_test                        -2.596029
return_std_test                      1.827006
average_reward_test                 -0.226548
round_time_test        0 days 00:00:04.119068
round_time_total       0 days 00:19:13.404348
loss_total              13580748223610.880859
loss_critic             16975942256820.224609
loss_actor                      -28811923.844
memory_size                            4479.0 

=== epoch 1/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:59,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:24<00:00,  1.81it/s]
episodes                                  169
episode_length                      11.775148
returns                             -2.298091
return_std                           2.155128
average_reward                      -0.194993
round_time             0 days 00:18:25.443650
episodes_test                           174.0
episode_length_test                 11.482759
returns_test                        -2.362218
return_std_test                      1.846885
average_reward_test                 -0.204688
round_time_test        0 days 00:00:03.528823
round_time_total       0 days 00:18:25.445172
loss_total               20075365182865.40625
loss_critic             25094214780977.152344
loss_actor                      -34788819.054
memory_size                            4479.0 

=== epoch 1/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:49,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:55<00:00,  1.67it/s]
episodes                                  171
episode_length                      11.637427
returns                             -2.101357
return_std                           2.537866
average_reward                      -0.180264
round_time             0 days 00:19:56.398058
episodes_test                           175.0
episode_length_test                 11.382857
returns_test                        -2.482941
return_std_test                      1.883542
average_reward_test                 -0.217321
round_time_test        0 days 00:00:03.192582
round_time_total       0 days 00:19:56.399637
loss_total              29902820360585.214844
loss_critic             37378535306821.632812
loss_actor                      -41864123.264
memory_size                          4479.606 

=== epoch 1/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:30,  1.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:02<00:00,  1.75it/s]
episodes                                  161
episode_length                      12.347826
returns                             -1.975675
return_std                           2.692433
average_reward                      -0.159944
round_time             0 days 00:19:04.036122
episodes_test                           171.0
episode_length_test                 11.672515
returns_test                        -1.715531
return_std_test                       1.84948
average_reward_test                  -0.14648
round_time_test        0 days 00:00:03.340835
round_time_total       0 days 00:19:04.038022
loss_total               43781524769734.65625
loss_critic             54726916977197.054688
loss_actor                      -50035238.298
memory_size                            4482.0 

=== epoch 1/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<16:52,  1.97it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:13<00:00,  1.73it/s]
episodes                                  158
episode_length                      12.613924
returns                             -1.271944
return_std                           2.503098
average_reward                      -0.098501
round_time             0 days 00:19:14.950737
episodes_test                           160.0
episode_length_test                   12.4375
returns_test                        -1.206336
return_std_test                      2.546815
average_reward_test                 -0.097071
round_time_test        0 days 00:00:03.708078
round_time_total       0 days 00:19:14.952236
loss_total              62416928482000.898438
loss_critic             78021173717237.765625
loss_actor                      -59104057.502
memory_size                            4482.0 

=== epoch 1/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:04,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:23<00:00,  1.72it/s]
episodes                                  153
episode_length                       12.96732
returns                             -0.380627
return_std                           2.509198
average_reward                      -0.028294
round_time             0 days 00:19:24.520993
episodes_test                           155.0
episode_length_test                 12.903226
returns_test                          -0.6444
return_std_test                       2.85559
average_reward_test                 -0.049941
round_time_test        0 days 00:00:03.397554
round_time_total       0 days 00:19:24.522832
loss_total              88068213398568.953125
loss_critic             110085284050763.78125
loss_actor                      -69621424.636
memory_size                            4482.0 

=== epoch 1/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:01,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:56<00:00,  1.76it/s]
episodes                                  157
episode_length                      12.649682
returns                             -0.491651
return_std                           2.443912
average_reward                      -0.040203
round_time             0 days 00:18:57.978135
episodes_test                           150.0
episode_length_test                 13.286667
returns_test                         0.099425
return_std_test                      3.330285
average_reward_test                  0.010586
round_time_test        0 days 00:00:03.531117
round_time_total       0 days 00:18:57.979538
loss_total             123812780634537.984375
loss_critic             154765993697083.40625
loss_actor                      -82062302.628
memory_size                            4482.0 

=== epoch 1/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:46,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:10<00:00,  1.74it/s]
episodes                                  147
episode_length                      13.544218
returns                              0.446497
return_std                           4.549378
average_reward                       0.033558
round_time             0 days 00:19:11.409542
episodes_test                           149.0
episode_length_test                  13.38255
returns_test                          0.46384
return_std_test                      3.997671
average_reward_test                  0.036917
round_time_test        0 days 00:00:03.568525
round_time_total       0 days 00:19:11.410991
loss_total               180946026845175.8125
loss_critic              226182553851133.9375
loss_actor                      -96214968.268
memory_size                         4486.9415 

=== epoch 1/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:32,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:00<00:00,  1.75it/s]
episodes                                   96
episode_length                      20.552083
returns                              7.384614
return_std                          11.158625
average_reward                       0.362439
round_time             0 days 00:19:00.779264
episodes_test                           109.0
episode_length_test                 18.220183
returns_test                         5.698206
return_std_test                     10.917741
average_reward_test                  0.317465
round_time_test        0 days 00:00:03.753923
round_time_total       0 days 00:19:00.781344
loss_total               287674283470094.3125
loss_critic              359592878200061.9375
loss_actor                      -109897328.04
memory_size                         4608.9535 

=== epoch 1/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:50,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:35<00:00,  1.79it/s]
episodes                                  127
episode_length                      15.598425
returns                              3.339378
return_std                           7.615167
average_reward                        0.21563
round_time             0 days 00:18:36.393241
episodes_test                           130.0
episode_length_test                 15.384615
returns_test                          3.28003
return_std_test                      6.478429
average_reward_test                  0.213202
round_time_test        0 days 00:00:03.592158
round_time_total       0 days 00:18:36.395061
loss_total                  368252475504853.0
loss_critic                 460315616228671.5
loss_actor                     -121757633.056
memory_size                         4693.1775 

=== epoch 1/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:48<00:00,  1.68it/s]
episodes                                  127
episode_length                      15.685039
returns                               4.11452
return_std                           6.597149
average_reward                       0.259338
round_time             0 days 00:19:49.634972
episodes_test                           126.0
episode_length_test                  15.65873
returns_test                         3.744885
return_std_test                      5.208213
average_reward_test                  0.244691
round_time_test        0 days 00:00:03.198017
round_time_total       0 days 00:19:49.636990
loss_total                 450293852402614.25
loss_critic              562867333765267.4375
loss_actor                     -131949500.428
memory_size                          4733.692 

=== epoch 1/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<21:11,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:47<00:00,  1.68it/s]
episodes                                  105
episode_length                      18.771429
returns                              7.076966
return_std                            7.49785
average_reward                       0.376697
round_time             0 days 00:19:48.192795
episodes_test                           114.0
episode_length_test                 17.535088
returns_test                         6.117814
return_std_test                      6.062774
average_reward_test                  0.349069
round_time_test        0 days 00:00:03.282193
round_time_total       0 days 00:19:48.194683
loss_total               532054076267954.1875
loss_critic                 665067609107988.5
loss_actor                     -141580151.324
memory_size                         4784.6245 

=== epoch 1/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:39,  1.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:40<00:00,  1.69it/s]
episodes                                  115
episode_length                      17.321739
returns                              6.004997
return_std                           6.393956
average_reward                        0.34604
round_time             0 days 00:19:41.188015
episodes_test                           112.0
episode_length_test                 17.839286
returns_test                         6.436772
return_std_test                      6.619144
average_reward_test                  0.360989
round_time_test        0 days 00:00:04.199409
round_time_total       0 days 00:19:41.189891
loss_total                620908682959388.625
loss_critic                776135860948041.75
loss_actor                      -150389027.44
memory_size                         4815.8065 

=== epoch 1/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:42,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:21<00:00,  1.72it/s]
episodes                                  104
episode_length                      19.038462
returns                              6.675424
return_std                           7.322054
average_reward                       0.350638
round_time             0 days 00:19:21.982862
episodes_test                           111.0
episode_length_test                 17.954955
returns_test                         5.964913
return_std_test                      6.913558
average_reward_test                  0.331193
round_time_test        0 days 00:00:03.781960
round_time_total       0 days 00:19:21.984802
loss_total                 717592263030472.75
loss_critic                 896990330107199.5
loss_actor                     -158895789.048
memory_size                          4857.577 

=== epoch 1/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:49,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:50<00:00,  1.68it/s]
episodes                                  126
episode_length                       15.81746
returns                              4.161245
return_std                           5.021867
average_reward                        0.26073
round_time             0 days 00:19:51.406450
episodes_test                           124.0
episode_length_test                 16.104839
returns_test                         4.433069
return_std_test                      4.714908
average_reward_test                  0.276287
round_time_test        0 days 00:00:03.663761
round_time_total       0 days 00:19:51.407920
loss_total                  787474374721536.0
loss_critic                 984342985317548.0
loss_actor                      -168137797.88
memory_size                         4884.3475 

=== epoch 1/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:01,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:02<00:00,  1.66it/s]
episodes                                  134
episode_length                      14.776119
returns                              3.733022
return_std                           4.491069
average_reward                       0.253155
round_time             0 days 00:20:03.207529
episodes_test                           140.0
episode_length_test                 14.257143
returns_test                         2.833729
return_std_test                      3.774338
average_reward_test                  0.199262
round_time_test        0 days 00:00:03.104938
round_time_total       0 days 00:20:03.209363
loss_total                  910695609856426.0
loss_critic               1138369543802978.25
loss_actor                     -178645504.656
memory_size                            4889.0 

=== epoch 1/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:37,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:35<00:00,  1.70it/s]
episodes                                  137
episode_length                      14.489051
returns                              3.137685
return_std                           3.814982
average_reward                       0.215655
round_time             0 days 00:19:36.685994
episodes_test                           137.0
episode_length_test                  14.59854
returns_test                         3.253889
return_std_test                      3.554334
average_reward_test                  0.222891
round_time_test        0 days 00:00:04.057895
round_time_total       0 days 00:19:36.687812
loss_total                 1021361969002709.0
loss_critic               1276702483527237.75
loss_actor                     -191164540.416
memory_size                          4890.479 

=== epoch 1/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:59,  1.39it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:20<00:00,  1.72it/s]
episodes                                  122
episode_length                      16.360656
returns                              4.081577
return_std                           5.538462
average_reward                       0.247397
round_time             0 days 00:19:21.791647
episodes_test                           130.0
episode_length_test                 15.330769
returns_test                         3.422521
return_std_test                      4.040717
average_reward_test                  0.226532
round_time_test        0 days 00:00:02.991951
round_time_total       0 days 00:19:21.793174
loss_total                 1203015024486908.0
loss_critic               1503768793803915.25
loss_actor                     -204883275.832
memory_size                          4911.952 

=== epoch 1/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:54,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:51<00:00,  1.68it/s]
episodes                                  124
episode_length                      15.830645
returns                              3.901593
return_std                           4.410267
average_reward                       0.253137
round_time             0 days 00:19:52.858365
episodes_test                           125.0
episode_length_test                      16.0
returns_test                         4.039845
return_std_test                      5.043021
average_reward_test                   0.25249
round_time_test        0 days 00:00:03.821367
round_time_total       0 days 00:19:52.859789
loss_total                1416352206109540.25
loss_critic                1770440257209106.5
loss_actor                     -220540389.848
memory_size                           4950.56 

=== epoch 1/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:14,  1.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:18<00:00,  1.64it/s]
episodes                                  120
episode_length                         16.575
returns                              4.755286
return_std                           4.658484
average_reward                       0.285094
round_time             0 days 00:20:19.876417
episodes_test                           118.0
episode_length_test                 16.822034
returns_test                         4.279184
return_std_test                      4.861672
average_reward_test                   0.25482
round_time_test        0 days 00:00:03.136643
round_time_total       0 days 00:20:19.877894
loss_total                 1684170959390507.0
loss_critic               2105213682605621.25
loss_actor                     -236096767.696
memory_size                         4968.2765 

=== epoch 1/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:44,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:51<00:00,  1.68it/s]
episodes                                  121
episode_length                      16.363636
returns                              4.407164
return_std                           5.698724
average_reward                       0.267411
round_time             0 days 00:19:52.276841
episodes_test                           126.0
episode_length_test                 15.825397
returns_test                         3.683807
return_std_test                      4.697859
average_reward_test                  0.235226
round_time_test        0 days 00:00:03.057980
round_time_total       0 days 00:19:52.278695
loss_total                1916743544517689.25
loss_critic                2395929407014306.0
loss_actor                     -253401819.512
memory_size                         4991.6305 

=== epoch 1/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:07,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:29<00:00,  1.71it/s]
episodes                                  114
episode_length                      17.394737
returns                              4.947668
return_std                           5.928366
average_reward                       0.285144
round_time             0 days 00:19:31.047691
episodes_test                           101.0
episode_length_test                 19.792079
returns_test                         6.010456
return_std_test                      5.210261
average_reward_test                  0.303951
round_time_test        0 days 00:00:02.993310
round_time_total       0 days 00:19:31.049736
loss_total                 2212497931192238.0
loss_critic                2765622378460348.5
loss_actor                     -269662911.952
memory_size                         5014.7725 

=== epoch 1/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:34,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:58<00:00,  1.67it/s]
episodes                                  118
episode_length                      16.813559
returns                              4.836537
return_std                              4.172
average_reward                       0.289038
round_time             0 days 00:19:59.874974
episodes_test                           107.0
episode_length_test                  18.64486
returns_test                         5.607569
return_std_test                      7.464351
average_reward_test                   0.30115
round_time_test        0 days 00:00:04.006379
round_time_total       0 days 00:19:59.876828
loss_total                 2436027850097164.5
loss_critic                3045034767457190.0
loss_actor                     -284468101.928
memory_size                            5024.0 

=== epoch 1/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:30,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:48<00:00,  1.68it/s]
episodes                                  109
episode_length                      18.018349
returns                              5.258792
return_std                           5.805234
average_reward                       0.294305
round_time             0 days 00:19:49.972812
episodes_test                           111.0
episode_length_test                 17.972973
returns_test                         5.389768
return_std_test                      6.127534
average_reward_test                  0.301691
round_time_test        0 days 00:00:03.670104
round_time_total       0 days 00:19:49.974281
loss_total                 2713006119898317.0
loss_critic                3391257597251158.0
loss_actor                     -297061676.528
memory_size                          5040.244 

=== epoch 1/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:53,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:53<00:00,  1.68it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   96
episode_length                      20.614583
returns                              7.723163
return_std                          10.454545
average_reward                       0.370326
round_time             0 days 00:19:54.340172
episodes_test                           115.0
episode_length_test                 17.286957
returns_test                         4.770824
return_std_test                      5.702825
average_reward_test                   0.27852
round_time_test        0 days 00:00:03.835436
round_time_total       0 days 00:19:54.342352
loss_total                 3064210032661889.0
loss_critic                3830262477501759.5
loss_actor                     -309930100.592
memory_size                         5125.6255 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:02<18:48,  1.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:47<00:00,  1.77it/s]
episodes                                   99
episode_length                      20.090909
returns                              6.226641
return_std                           7.057581
average_reward                       0.312071
round_time             0 days 00:18:47.889965
episodes_test                            96.0
episode_length_test                 20.583333
returns_test                         7.318075
return_std_test                      12.62712
average_reward_test                  0.357533
round_time_test        0 days 00:00:03.205162
round_time_total       0 days 00:18:47.891976
loss_total                 3476028786409996.5
loss_critic                4345035915974934.5
loss_actor                      -320963074.24
memory_size                         5232.7925 

=== epoch 2/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:30,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:51<00:00,  1.87it/s]
episodes                                   97
episode_length                      20.515464
returns                              7.200362
return_std                            7.01857
average_reward                       0.347865
round_time             0 days 00:17:52.501103
episodes_test                           104.0
episode_length_test                 19.192308
returns_test                         6.194058
return_std_test                      6.894166
average_reward_test                   0.32338
round_time_test        0 days 00:00:03.271098
round_time_total       0 days 00:17:52.503259
loss_total                 3766911147647697.0
loss_critic                4708638868490945.0
loss_actor                      -328409833.44
memory_size                         5304.0445 

=== epoch 2/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:04,  1.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:36<00:00,  1.79it/s]
episodes                                   93
episode_length                      20.913978
returns                               7.93118
return_std                          15.663021
average_reward                        0.37235
round_time             0 days 00:18:37.570730
episodes_test                            94.0
episode_length_test                 21.148936
returns_test                         7.629015
return_std_test                     10.761514
average_reward_test                  0.364105
round_time_test        0 days 00:00:03.666477
round_time_total       0 days 00:18:37.573025
loss_total                 4044983255324164.0
loss_critic                5056229000368620.0
loss_actor                     -333984534.096
memory_size                          5446.343 

=== epoch 2/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:20,  1.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:53<00:00,  1.76it/s]
episodes                                   94
episode_length                      20.851064
returns                              7.611411
return_std                           9.121795
average_reward                       0.366303
round_time             0 days 00:18:54.812744
episodes_test                            97.0
episode_length_test                 20.350515
returns_test                         7.178068
return_std_test                      9.298614
average_reward_test                  0.358208
round_time_test        0 days 00:00:03.693800
round_time_total       0 days 00:18:54.814716
loss_total                 4285537245907124.0
loss_critic                5356921478707151.0
loss_actor                      -337304827.52
memory_size                         5550.5645 

=== epoch 2/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<19:34,  1.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:01<00:00,  1.85it/s]
episodes                                   86
episode_length                      23.151163
returns                             10.160227
return_std                          13.435713
average_reward                       0.435166
round_time             0 days 00:18:02.320097
episodes_test                            92.0
episode_length_test                 21.663043
returns_test                          7.17149
return_std_test                      7.263107
average_reward_test                  0.329222
round_time_test        0 days 00:00:03.927019
round_time_total       0 days 00:18:02.321575
loss_total                 4511454973506093.0
loss_critic                5639318634321936.0
loss_actor                     -339935770.992
memory_size                          5694.893 

=== epoch 2/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:08<00:00,  1.84it/s]
episodes                                   95
episode_length                           21.0
returns                              8.132457
return_std                            9.37298
average_reward                       0.388241
round_time             0 days 00:18:09.118949
episodes_test                            89.0
episode_length_test                 22.269663
returns_test                         9.926064
return_std_test                     15.487447
average_reward_test                  0.450004
round_time_test        0 days 00:00:03.624127
round_time_total       0 days 00:18:09.120825
loss_total                 4737701049039585.0
loss_critic                5922126216139112.0
loss_actor                     -339984626.016
memory_size                         5863.2115 

=== epoch 2/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<23:38,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:27<00:00,  1.81it/s]
episodes                                   84
episode_length                      23.595238
returns                             10.993202
return_std                          11.721634
average_reward                       0.465479
round_time             0 days 00:18:28.706578
episodes_test                            90.0
episode_length_test                 22.222222
returns_test                          9.33593
return_std_test                     10.215197
average_reward_test                  0.420117
round_time_test        0 days 00:00:03.410634
round_time_total       0 days 00:18:28.708402
loss_total                 4839271741001826.0
loss_critic                6049089581033193.0
loss_actor                     -335199095.536
memory_size                           6037.42 

=== epoch 2/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<17:57,  1.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:02<00:00,  1.75it/s]
episodes                                   93
episode_length                      21.204301
returns                              7.531959
return_std                           8.630229
average_reward                       0.357919
round_time             0 days 00:19:03.541654
episodes_test                            86.0
episode_length_test                 23.244186
returns_test                          9.40989
return_std_test                     10.217828
average_reward_test                  0.405009
round_time_test        0 days 00:00:02.978966
round_time_total       0 days 00:19:03.543527
loss_total                 4848334236148564.0
loss_critic                6060417701828887.0
loss_actor                      -329079617.28
memory_size                         6192.9415 

=== epoch 2/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:11,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:21<00:00,  1.72it/s]
episodes                                  147
episode_length                      13.564626
returns                              2.008923
return_std                           3.525138
average_reward                       0.144583
round_time             0 days 00:19:22.456140
episodes_test                           126.0
episode_length_test                 15.809524
returns_test                         4.294637
return_std_test                      5.980752
average_reward_test                   0.27267
round_time_test        0 days 00:00:04.032997
round_time_total       0 days 00:19:22.458091
loss_total                 4573950687053873.0
loss_critic                5717438265963839.0
loss_actor                     -319773771.184
memory_size                            6250.0 

=== epoch 2/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:56,  1.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:36<00:00,  1.70it/s]
episodes                                  138
episode_length                      14.391304
returns                              1.949546
return_std                           3.828655
average_reward                       0.134634
round_time             0 days 00:19:37.841589
episodes_test                           145.0
episode_length_test                 13.737931
returns_test                         1.798064
return_std_test                      4.037356
average_reward_test                  0.134652
round_time_test        0 days 00:00:03.039349
round_time_total       0 days 00:19:37.843437
loss_total                 4429963798912172.0
loss_critic                5537454652330607.0
loss_actor                     -316793402.624
memory_size                         6251.8285 

=== epoch 2/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:37,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:52<00:00,  1.68it/s]
episodes                                  145
episode_length                      13.751724
returns                              1.094757
return_std                           4.854508
average_reward                       0.078068
round_time             0 days 00:19:53.226396
episodes_test                           137.0
episode_length_test                 14.576642
returns_test                         1.574379
return_std_test                      4.313722
average_reward_test                  0.108584
round_time_test        0 days 00:00:03.206020
round_time_total       0 days 00:19:53.228458
loss_total                 4443615936376733.5
loss_critic                5554519827600638.0
loss_actor                     -320539593.984
memory_size                         6259.2925 

=== epoch 2/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:51,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:29<00:00,  1.71it/s]
episodes                                  160
episode_length                       12.41875
returns                             -1.016531
return_std                           2.510204
average_reward                      -0.079779
round_time             0 days 00:19:30.191726
episodes_test                           147.0
episode_length_test                 13.598639
returns_test                         0.512529
return_std_test                      4.139881
average_reward_test                  0.038048
round_time_test        0 days 00:00:03.397811
round_time_total       0 days 00:19:30.193186
loss_total                 4442549198235107.5
loss_critic                5553186405510808.0
loss_actor                       -325543822.4
memory_size                            6261.0 

=== epoch 2/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:39,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:51<00:00,  1.68it/s]
episodes                                  152
episode_length                      12.967105
returns                             -0.975676
return_std                           3.481903
average_reward                      -0.072927
round_time             0 days 00:19:52.475388
episodes_test                           151.0
episode_length_test                 13.172185
returns_test                        -0.508309
return_std_test                      3.518713
average_reward_test                 -0.033817
round_time_test        0 days 00:00:04.262476
round_time_total       0 days 00:19:52.476849
loss_total                 4523028562285953.0
loss_critic                5653785610230432.0
loss_actor                     -331205067.472
memory_size                          6263.569 

=== epoch 2/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<15:23,  2.16it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:14<00:00,  1.73it/s]
episodes                                  150
episode_length                      13.246667
returns                             -1.225566
return_std                             3.0184
average_reward                       -0.09289
round_time             0 days 00:19:15.053520
episodes_test                           150.0
episode_length_test                 13.246667
returns_test                        -1.159619
return_std_test                      5.135584
average_reward_test                 -0.085776
round_time_test        0 days 00:00:03.353799
round_time_total       0 days 00:19:15.054942
loss_total                 4687218602053468.0
loss_critic                5859023158211772.0
loss_actor                     -337525943.808
memory_size                           6267.44 

=== epoch 2/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:11,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:59<00:00,  1.67it/s]
episodes                                  147
episode_length                      13.517007
returns                              -1.21196
return_std                           3.958418
average_reward                      -0.090154
round_time             0 days 00:20:00.668540
episodes_test                           148.0
episode_length_test                 13.439189
returns_test                        -1.474086
return_std_test                      3.030405
average_reward_test                 -0.105417
round_time_test        0 days 00:00:03.479073
round_time_total       0 days 00:20:00.670468
loss_total                 4882754675859784.0
loss_critic                6103443248473178.0
loss_actor                     -343451251.504
memory_size                          6275.856 

=== epoch 2/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:08,  1.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:05<00:00,  1.66it/s]
episodes                                  135
episode_length                      14.733333
returns                             -0.140813
return_std                           5.267658
average_reward                      -0.009258
round_time             0 days 00:20:06.650008
episodes_test                           143.0
episode_length_test                 13.937063
returns_test                        -1.346177
return_std_test                      4.384443
average_reward_test                 -0.092151
round_time_test        0 days 00:00:03.505099
round_time_total       0 days 00:20:06.651444
loss_total                 5153326765480870.0
loss_critic                6441658346977100.0
loss_actor                     -350476715.024
memory_size                         6293.4015 

=== epoch 2/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:08,  1.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:08<00:00,  1.65it/s]
episodes                                  139
episode_length                      14.309353
returns                             -0.699185
return_std                           4.610507
average_reward                      -0.048819
round_time             0 days 00:20:09.580798
episodes_test                           120.0
episode_length_test                 16.516667
returns_test                         0.840917
return_std_test                      7.949044
average_reward_test                   0.04942
round_time_test        0 days 00:00:03.373527
round_time_total       0 days 00:20:09.582286
loss_total                 5410364328605909.0
loss_critic                6762955309590774.0
loss_actor                      -359012033.52
memory_size                          6315.433 

=== epoch 2/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:44,  2.12it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:10<00:00,  1.74it/s]
episodes                                  130
episode_length                      15.030769
returns                              0.430663
return_std                            5.51482
average_reward                       0.033278
round_time             0 days 00:19:11.522607
episodes_test                           129.0
episode_length_test                  15.44186
returns_test                         0.015514
return_std_test                      5.998126
average_reward_test                  0.003066
round_time_test        0 days 00:00:03.114966
round_time_total       0 days 00:19:11.524559
loss_total                 5705039548132622.0
loss_critic                7131299318195028.0
loss_actor                     -366586012.864
memory_size                          6359.875 

=== epoch 2/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:21,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:12<00:00,  1.73it/s]
episodes                                  134
episode_length                      14.597015
returns                             -0.956369
return_std                           4.485126
average_reward                      -0.065669
round_time             0 days 00:19:13.824648
episodes_test                           137.0
episode_length_test                 14.430657
returns_test                         0.413056
return_std_test                      4.499378
average_reward_test                  0.032363
round_time_test        0 days 00:00:03.183106
round_time_total       0 days 00:19:13.826591
loss_total                 5893756729258148.0
loss_critic                7367195788830573.0
loss_actor                     -372569109.072
memory_size                          6392.594 

=== epoch 2/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:21,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:48<00:00,  1.68it/s]
episodes                                   94
episode_length                      20.585106
returns                             -1.510637
return_std                          23.604439
average_reward                      -0.086698
round_time             0 days 00:19:49.841226
episodes_test                           127.0
episode_length_test                 15.677165
returns_test                         1.343654
return_std_test                     12.865996
average_reward_test                  0.088916
round_time_test        0 days 00:00:03.212763
round_time_total       0 days 00:19:49.843007
loss_total                 5899436254893703.0
loss_critic                7374295193341657.0
loss_actor                     -377770620.688
memory_size                          6809.291 

=== epoch 2/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:45,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:31<00:00,  1.62it/s]
episodes                                  100
episode_length                          19.89
returns                              1.416751
return_std                            7.45004
average_reward                       0.068216
round_time             0 days 00:20:32.885711
episodes_test                           120.0
episode_length_test                     16.65
returns_test                        -1.731333
return_std_test                      8.325983
average_reward_test                 -0.103103
round_time_test        0 days 00:00:03.780250
round_time_total       0 days 00:20:32.887158
loss_total                 5867754652442821.0
loss_critic                7334693195327996.0
loss_actor                      -372016646.56
memory_size                         7131.8235 

=== epoch 2/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:05,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:15<00:00,  1.57it/s]
episodes                                  106
episode_length                      18.811321
returns                              4.815818
return_std                           11.56975
average_reward                       0.256074
round_time             0 days 00:21:16.318573
episodes_test                           102.0
episode_length_test                 19.607843
returns_test                         2.815212
return_std_test                      7.398944
average_reward_test                  0.143576
round_time_test        0 days 00:00:03.076292
round_time_total       0 days 00:21:16.320715
loss_total                 5548361671312933.0
loss_critic                6935451972405297.0
loss_actor                     -358202775.856
memory_size                         7390.6645 

=== epoch 2/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:40,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:51<00:00,  1.52it/s]
episodes                                   73
episode_length                      26.657534
returns                              9.558779
return_std                          11.482135
average_reward                       0.370689
round_time             0 days 00:21:52.984489
episodes_test                            82.0
episode_length_test                 24.304878
returns_test                         7.758595
return_std_test                     11.975293
average_reward_test                  0.319658
round_time_test        0 days 00:00:03.302508
round_time_total       0 days 00:21:52.986321
loss_total                 5430267608165253.0
loss_critic                6787834399560827.0
loss_actor                     -342297685.568
memory_size                         7643.5235 

=== epoch 2/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:42,  1.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:01<00:00,  1.58it/s]
episodes                                   91
episode_length                      21.681319
returns                              6.178334
return_std                          10.268359
average_reward                       0.284438
round_time             0 days 00:21:03.081153
episodes_test                            77.0
episode_length_test                 25.974026
returns_test                          9.36949
return_std_test                     11.068782
average_reward_test                  0.360725
round_time_test        0 days 00:00:03.182363
round_time_total       0 days 00:21:03.083047
loss_total                 5074379689431138.0
loss_critic                6342974500069442.0
loss_actor                     -318664801.232
memory_size                         7910.4745 

=== epoch 2/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<16:46,  1.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:12<00:00,  1.57it/s]
episodes                                   80
episode_length                        24.3375
returns                              6.290934
return_std                           8.680732
average_reward                       0.263909
round_time             0 days 00:21:13.497026
episodes_test                            82.0
episode_length_test                 24.146341
returns_test                         6.823918
return_std_test                      8.507575
average_reward_test                  0.289171
round_time_test        0 days 00:00:03.200365
round_time_total       0 days 00:21:13.498958
loss_total                 4736040142102331.0
loss_critic                5920050076662628.0
loss_actor                     -300793750.072
memory_size                          8123.009 

=== epoch 2/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:10,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:08<00:00,  1.58it/s]
episodes                                   99
episode_length                      19.979798
returns                               5.01901
return_std                           8.640714
average_reward                       0.247598
round_time             0 days 00:21:09.400523
episodes_test                            96.0
episode_length_test                 20.822917
returns_test                         5.966813
return_std_test                      8.382727
average_reward_test                  0.286897
round_time_test        0 days 00:00:03.081453
round_time_total       0 days 00:21:09.402669
loss_total                 4515084501990244.0
loss_critic                5643855536219750.0
loss_actor                     -288531808.184
memory_size                         8257.4085 

=== epoch 2/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:41,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:40<00:00,  1.61it/s]
episodes                                  144
episode_length                      13.861111
returns                              1.689402
return_std                           5.079449
average_reward                       0.121201
round_time             0 days 00:20:41.162351
episodes_test                           136.0
episode_length_test                 14.573529
returns_test                         1.979383
return_std_test                      4.241683
average_reward_test                  0.137621
round_time_test        0 days 00:00:03.274349
round_time_total       0 days 00:20:41.164360
loss_total                 4422025495382589.5
loss_critic                5527531774772511.0
loss_actor                     -282034300.392
memory_size                         8349.5935 

=== epoch 2/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<25:47,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:37<00:00,  1.62it/s]
episodes                                  142
episode_length                      13.873239
returns                              1.665601
return_std                           3.601891
average_reward                       0.126468
round_time             0 days 00:20:38.941234
episodes_test                           150.0
episode_length_test                     13.24
returns_test                         1.486115
return_std_test                      3.340901
average_reward_test                  0.117972
round_time_test        0 days 00:00:04.166486
round_time_total       0 days 00:20:38.942727
loss_total                 4267729759024185.5
loss_critic                5334662109676438.0
loss_actor                     -276299020.984
memory_size                          8393.783 

=== epoch 2/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:18,  1.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:23<00:00,  1.63it/s]
episodes                                  141
episode_length                      14.085106
returns                              1.080768
return_std                           4.265033
average_reward                       0.075691
round_time             0 days 00:20:24.398824
episodes_test                           159.0
episode_length_test                 12.578616
returns_test                         0.893214
return_std_test                      3.202776
average_reward_test                  0.071011
round_time_test        0 days 00:00:03.078029
round_time_total       0 days 00:20:24.400758
loss_total                 4084863876590469.0
loss_critic                5106079760241394.0
loss_actor                     -266180405.032
memory_size                          8427.908 

=== epoch 2/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:12,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:25<00:00,  1.63it/s]
episodes                                  139
episode_length                      14.338129
returns                              0.801654
return_std                           3.190299
average_reward                       0.053063
round_time             0 days 00:20:26.440290
episodes_test                           155.0
episode_length_test                 12.845161
returns_test                        -0.311143
return_std_test                      3.482101
average_reward_test                 -0.020367
round_time_test        0 days 00:00:03.472542
round_time_total       0 days 00:20:26.442144
loss_total                 3740560044509364.0
loss_critic                4675699973889720.0
loss_actor                     -254815066.352
memory_size                          8462.314 

=== epoch 2/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:08,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:50<00:00,  1.60it/s]
episodes                                  136
episode_length                      14.573529
returns                              1.493487
return_std                           4.327872
average_reward                       0.103194
round_time             0 days 00:20:51.685360
episodes_test                           139.0
episode_length_test                 14.359712
returns_test                         1.332723
return_std_test                      4.182211
average_reward_test                  0.094093
round_time_test        0 days 00:00:03.062932
round_time_total       0 days 00:20:51.687250
loss_total                 3364435894479094.0
loss_critic                4205544797064134.5
loss_actor                      -242792236.24
memory_size                          8500.611 

=== epoch 2/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:20,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:58<00:00,  1.59it/s]
episodes                                  134
episode_length                      14.828358
returns                              0.838996
return_std                           4.652297
average_reward                       0.058284
round_time             0 days 00:20:59.643387
episodes_test                           132.0
episode_length_test                 15.037879
returns_test                         1.436115
return_std_test                       4.24351
average_reward_test                  0.098891
round_time_test        0 days 00:00:03.372552
round_time_total       0 days 00:20:59.644844
loss_total                 3230185320213381.0
loss_critic                4037731580532228.0
loss_actor                     -232580730.216
memory_size                         8588.8325 

=== epoch 2/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:03,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:57<00:00,  1.59it/s]
episodes                                  150
episode_length                          13.26
returns                              1.301498
return_std                           2.733526
average_reward                       0.098393
round_time             0 days 00:20:58.262084
episodes_test                           137.0
episode_length_test                 14.554745
returns_test                         1.313031
return_std_test                      3.590729
average_reward_test                  0.087919
round_time_test        0 days 00:00:03.519230
round_time_total       0 days 00:20:58.264040
loss_total                 2878727676351217.5
loss_critic                3598409531610104.0
loss_actor                     -222202590.624
memory_size                         8624.7975 

=== epoch 2/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:11,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:21<00:00,  1.64it/s]
episodes                                  119
episode_length                      16.722689
returns                              2.420612
return_std                           3.813896
average_reward                       0.143621
round_time             0 days 00:20:22.230885
episodes_test                           136.0
episode_length_test                 14.573529
returns_test                         1.893662
return_std_test                      4.031254
average_reward_test                   0.13395
round_time_test        0 days 00:00:03.133250
round_time_total       0 days 00:20:22.232350
loss_total                 2644722832591814.5
loss_critic                3305903486188650.5
loss_actor                     -211068422.296
memory_size                          8677.383 

=== epoch 2/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:07,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:35<00:00,  1.54it/s]
episodes                                  124
episode_length                      15.782258
returns                              1.335614
return_std                           3.272136
average_reward                       0.096349
round_time             0 days 00:21:36.651804
episodes_test                           136.0
episode_length_test                 14.654412
returns_test                         1.671032
return_std_test                      3.849744
average_reward_test                  0.116249
round_time_test        0 days 00:00:03.286253
round_time_total       0 days 00:21:36.653545
loss_total                 2496634619311423.5
loss_critic                3120793220678680.5
loss_actor                     -204047740.992
memory_size                          8795.644 

=== epoch 2/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:11,  1.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:44<00:00,  1.61it/s]
episodes                                  131
episode_length                      15.183206
returns                              1.962357
return_std                           3.896106
average_reward                       0.128372
round_time             0 days 00:20:45.367255
episodes_test                           121.0
episode_length_test                 16.454545
returns_test                         2.326761
return_std_test                       4.38304
average_reward_test                  0.143044
round_time_test        0 days 00:00:03.192668
round_time_total       0 days 00:20:45.368680
loss_total                 2367148357744328.5
loss_critic                2958935397243027.5
loss_actor                     -197114282.064
memory_size                          8873.287 

=== epoch 2/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:28,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:09<00:00,  1.58it/s]
episodes                                  116
episode_length                      17.086207
returns                               2.37308
return_std                           4.048742
average_reward                       0.140004
round_time             0 days 00:21:10.427434
episodes_test                           140.0
episode_length_test                     14.25
returns_test                         1.255962
return_std_test                      3.466279
average_reward_test                   0.08992
round_time_test        0 days 00:00:03.373719
round_time_total       0 days 00:21:10.429238
loss_total                 2196372158601494.5
loss_critic                2745465154144567.5
loss_actor                     -188039593.544
memory_size                         8942.4165 

=== epoch 2/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:09,  1.44it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:18<00:00,  1.56it/s]
episodes                                  110
episode_length                      17.781818
returns                              0.134853
return_std                           4.198082
average_reward                       0.002838
round_time             0 days 00:21:18.837110
episodes_test                           125.0
episode_length_test                    15.984
returns_test                         0.030918
return_std_test                      4.008368
average_reward_test                  0.002349
round_time_test        0 days 00:00:04.125293
round_time_total       0 days 00:21:18.838874
loss_total                 1984706882320728.0
loss_critic                2480883563138908.0
loss_actor                     -173471085.088
memory_size                         9077.7445 

=== epoch 2/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:52,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:52<00:00,  1.52it/s]
episodes                                  112
episode_length                      17.821429
returns                              0.485539
return_std                           4.047342
average_reward                       0.027999
round_time             0 days 00:21:52.995883
episodes_test                           117.0
episode_length_test                 16.991453
returns_test                         1.263212
return_std_test                      4.040415
average_reward_test                  0.078843
round_time_test        0 days 00:00:03.744104
round_time_total       0 days 00:21:52.997732
loss_total                1695475758857715.75
loss_critic               2119344665135153.25
loss_actor                     -160463135.648
memory_size                           9155.76 

=== epoch 2/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<22:41,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:52<00:00,  1.52it/s]
episodes                                  149
episode_length                      13.369128
returns                              0.985019
return_std                           3.271174
average_reward                       0.072501
round_time             0 days 00:21:53.523691
episodes_test                           133.0
episode_length_test                 14.977444
returns_test                         1.540876
return_std_test                      4.296358
average_reward_test                  0.105593
round_time_test        0 days 00:00:03.903285
round_time_total       0 days 00:21:53.525460
loss_total                1463053230425505.75
loss_critic               1828816507686092.75
loss_actor                     -149203078.504
memory_size                         9221.3245 

=== epoch 2/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:12,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:21<00:00,  1.49it/s]
episodes                                  151
episode_length                      13.245033
returns                              1.122837
return_std                           3.008417
average_reward                       0.084774
round_time             0 days 00:22:22.741713
episodes_test                           147.0
episode_length_test                  13.55102
returns_test                         1.561246
return_std_test                      3.438167
average_reward_test                  0.116438
round_time_test        0 days 00:00:03.075691
round_time_total       0 days 00:22:22.743194
loss_total                 1276431402815979.5
loss_critic                1595539225783042.0
loss_actor                     -139811121.064
memory_size                          9265.177 

=== epoch 2/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:11,  1.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:41<00:00,  1.54it/s]
episodes                                  169
episode_length                      11.751479
returns                              0.770135
return_std                           2.852427
average_reward                        0.06751
round_time             0 days 00:21:42.762705
episodes_test                           184.0
episode_length_test                 10.858696
returns_test                         0.703613
return_std_test                      2.177814
average_reward_test                  0.065644
round_time_test        0 days 00:00:03.087317
round_time_total       0 days 00:21:42.764546
loss_total               1107534256394993.625
loss_critic                1384417796582015.0
loss_actor                     -131329022.016
memory_size                            9282.0 

=== epoch 2/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:14,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:00<00:00,  1.51it/s]
episodes                                  171
episode_length                      11.643275
returns                              0.576682
return_std                            2.41396
average_reward                       0.051417
round_time             0 days 00:22:01.680743
episodes_test                           163.0
episode_length_test                 12.263804
returns_test                         1.190184
return_std_test                      3.062616
average_reward_test                  0.097445
round_time_test        0 days 00:00:03.668955
round_time_total       0 days 00:22:01.682148
loss_total               1006781526894968.875
loss_critic                1258476887206789.0
loss_actor                     -127179935.624
memory_size                          9285.577 

=== epoch 2/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:59,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:43<00:00,  1.99it/s]
episodes                                  165
episode_length                      12.024242
returns                               0.96841
return_std                           2.349086
average_reward                       0.080329
round_time             0 days 00:16:44.875314
episodes_test                           162.0
episode_length_test                 12.314815
returns_test                         0.991852
return_std_test                      2.497166
average_reward_test                  0.082666
round_time_test        0 days 00:00:03.157218
round_time_total       0 days 00:16:44.876637
loss_total                  930265881861161.0
loss_critic               1162832333556940.75
loss_actor                      -126321842.34
memory_size                            9286.0 

=== epoch 2/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:04,  2.07it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:50<00:00,  2.25it/s]
episodes                                  173
episode_length                      11.485549
returns                              0.711038
return_std                            2.44654
average_reward                       0.060642
round_time             0 days 00:14:51.046720
episodes_test                           141.0
episode_length_test                 14.163121
returns_test                         0.341681
return_std_test                      5.773218
average_reward_test                  0.025215
round_time_test        0 days 00:00:02.945323
round_time_total       0 days 00:14:51.048017
loss_total                 855486099768737.75
loss_critic               1069357609816948.75
loss_actor                     -125554736.296
memory_size                            9286.0 

=== epoch 2/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:03,  2.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:18<00:00,  2.33it/s]
episodes                                  177
episode_length                      11.242938
returns                              0.436409
return_std                           2.197755
average_reward                       0.037277
round_time             0 days 00:14:19.334969
episodes_test                           179.0
episode_length_test                 11.134078
returns_test                         0.287846
return_std_test                      2.245315
average_reward_test                  0.025609
round_time_test        0 days 00:00:02.903199
round_time_total       0 days 00:14:19.336474
loss_total                816048803449667.625
loss_critic              1020060991372656.625
loss_actor                     -126360265.584
memory_size                            9286.0 

=== epoch 2/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:27,  2.15it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:31<00:00,  2.46it/s]
episodes                                  177
episode_length                      11.231638
returns                              0.809412
return_std                           2.467318
average_reward                       0.073225
round_time             0 days 00:13:32.412564
episodes_test                           178.0
episode_length_test                 11.185393
returns_test                         0.503776
return_std_test                      2.307108
average_reward_test                  0.045227
round_time_test        0 days 00:00:02.871261
round_time_total       0 days 00:13:32.413683
loss_total                768379805828644.875
loss_critic               960474746334478.375
loss_actor                     -124974560.028
memory_size                            9286.0 

=== epoch 2/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                  175
episode_length                      11.388571
returns                              0.798671
return_std                           2.578932
average_reward                       0.072404
round_time             0 days 00:12:25.810818
episodes_test                           180.0
episode_length_test                 11.083333
returns_test                         0.532482
return_std_test                      2.370533
average_reward_test                  0.050205
round_time_test        0 days 00:00:02.733524
round_time_total       0 days 00:12:25.811925
loss_total                  734128834940502.0
loss_critic                917661035849056.25
loss_actor                     -122606492.484
memory_size                            9286.0 

=== epoch 2/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:36<00:00,  2.87it/s]
episodes                                  175
episode_length                          11.32
returns                              0.858164
return_std                            2.30931
average_reward                       0.074789
round_time             0 days 00:11:37.441319
episodes_test                           175.0
episode_length_test                      11.4
returns_test                         0.674446
return_std_test                      2.311765
average_reward_test                  0.061913
round_time_test        0 days 00:00:02.623158
round_time_total       0 days 00:11:37.442437
loss_total                683714873967771.625
loss_critic               854643589538381.875
loss_actor                     -120784093.624
memory_size                            9286.0 

=== epoch 2/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:31<00:00,  2.89it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  183
episode_length                       10.89071
returns                               0.27155
return_std                           2.018108
average_reward                       0.026155
round_time             0 days 00:11:31.804512
episodes_test                           181.0
episode_length_test                 11.033149
returns_test                         0.380596
return_std_test                      2.297231
average_reward_test                  0.036058
round_time_test        0 days 00:00:02.575408
round_time_total       0 days 00:11:31.805607
loss_total                646968076604014.625
loss_critic                 808710097000726.5
loss_actor                     -118649917.592
memory_size                           9296.45 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  115
episode_length                      16.765217
returns                             -4.324012
return_std                           4.894221
average_reward                      -0.219975
round_time             0 days 00:11:13.028967
episodes_test                           179.0
episode_length_test                 11.173184
returns_test                         0.322029
return_std_test                      2.309645
average_reward_test                  0.028822
round_time_test        0 days 00:00:02.562405
round_time_total       0 days 00:11:13.030078
loss_total               1080679675245625.375
loss_critic               1350849580186468.25
loss_actor                     -149675370.352
memory_size                         9317.2235 

=== epoch 3/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                   38
episode_length                      49.921053
returns                             23.456171
return_std                          58.076691
average_reward                       0.541172
round_time             0 days 00:11:16.291990
episodes_test                            56.0
episode_length_test                 34.678571
returns_test                         5.128348
return_std_test                      50.07446
average_reward_test                  0.168593
round_time_test        0 days 00:00:02.406376
round_time_total       0 days 00:11:16.293096
loss_total                1928277177040633.75
loss_critic                2410346436244799.5
loss_actor                      -194752414.12
memory_size                         9834.6565 

=== epoch 3/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:24,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                   23
episode_length                      81.913043
returns                             98.570188
return_std                            31.6995
average_reward                       1.192408
round_time             0 days 00:11:20.204705
episodes_test                            23.0
episode_length_test                  85.73913
returns_test                       105.665315
return_std_test                      34.50321
average_reward_test                  1.226341
round_time_test        0 days 00:00:02.376192
round_time_total       0 days 00:11:20.205807
loss_total                 2292925709414301.5
loss_critic                2866157089833615.5
loss_actor                     -213636770.752
memory_size                        11020.4015 

=== epoch 3/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:24,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                   16
episode_length                          119.5
returns                                7.4417
return_std                          95.709947
average_reward                       0.100185
round_time             0 days 00:11:17.627947
episodes_test                            25.0
episode_length_test                      79.4
returns_test                        95.237137
return_std_test                     27.002189
average_reward_test                  1.193592
round_time_test        0 days 00:00:02.364231
round_time_total       0 days 00:11:17.629067
loss_total                 2350695898549321.5
loss_critic                2938369824297844.5
loss_actor                     -212860910.192
memory_size                        12329.1655 

=== epoch 3/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:18,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                   51
episode_length                      37.960784
returns                            -12.058272
return_std                          29.179352
average_reward                      -0.337955
round_time             0 days 00:11:15.554875
episodes_test                            18.0
episode_length_test                105.666667
returns_test                        -3.961645
return_std_test                     77.203874
average_reward_test                 -0.049192
round_time_test        0 days 00:00:02.370002
round_time_total       0 days 00:11:15.555966
loss_total                2017215736375672.75
loss_critic                2521519634658623.5
loss_actor                     -194050389.232
memory_size                        13619.4335 

=== epoch 3/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:26<00:00,  2.91it/s]
episodes                                  139
episode_length                      14.366906
returns                             -7.325276
return_std                           6.116008
average_reward                      -0.507978
round_time             0 days 00:11:26.688218
episodes_test                            95.0
episode_length_test                 20.484211
returns_test                        -8.201945
return_std_test                     20.165589
average_reward_test                 -0.398222
round_time_test        0 days 00:00:02.452731
round_time_total       0 days 00:11:26.689338
loss_total                 1726069186314109.0
loss_critic                2157586452441989.0
loss_actor                     -174308735.016
memory_size                         14033.507 

=== epoch 3/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:58,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
episodes                                  163
episode_length                      12.165644
returns                             -6.727762
return_std                           3.333039
average_reward                       -0.55179
round_time             0 days 00:11:17.323042
episodes_test                           162.0
episode_length_test                 12.314815
returns_test                        -6.723117
return_std_test                      2.622996
average_reward_test                 -0.542292
round_time_test        0 days 00:00:02.529166
round_time_total       0 days 00:11:17.324141
loss_total                1505401664553615.25
loss_critic               1881752063260491.75
loss_actor                     -168492080.512
memory_size                        14054.9015 

=== epoch 3/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  174
episode_length                      11.442529
returns                             -6.819264
return_std                           2.357032
average_reward                      -0.598307
round_time             0 days 00:11:19.422437
episodes_test                           167.0
episode_length_test                  11.97006
returns_test                        -7.127128
return_std_test                      2.746699
average_reward_test                 -0.594747
round_time_test        0 days 00:00:02.539432
round_time_total       0 days 00:11:19.423536
loss_total                 1399793082827800.5
loss_critic                1749741340377219.0
loss_actor                      -167412151.68
memory_size                           14064.0 

=== epoch 3/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                  178
episode_length                      11.162921
returns                             -6.904654
return_std                            2.54984
average_reward                      -0.616948
round_time             0 days 00:11:24.175357
episodes_test                           178.0
episode_length_test                 11.235955
returns_test                        -6.310741
return_std_test                      3.025457
average_reward_test                 -0.561656
round_time_test        0 days 00:00:02.572501
round_time_total       0 days 00:11:24.176467
loss_total                 1318507449010880.5
loss_critic               1648134304284278.75
loss_actor                     -169737190.864
memory_size                           14064.0 

=== epoch 3/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                  168
episode_length                      11.821429
returns                             -7.433152
return_std                           3.008112
average_reward                      -0.629824
round_time             0 days 00:11:25.100321
episodes_test                           174.0
episode_length_test                 11.482759
returns_test                        -7.531279
return_std_test                      2.585468
average_reward_test                 -0.654551
round_time_test        0 days 00:00:02.565224
round_time_total       0 days 00:11:25.101416
loss_total                1327574377704718.25
loss_critic                1659467971040379.0
loss_actor                      -175297405.24
memory_size                        14071.7025 

=== epoch 3/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                  174
episode_length                       11.41954
returns                             -6.862428
return_std                           2.950875
average_reward                      -0.602107
round_time             0 days 00:11:21.032953
episodes_test                           169.0
episode_length_test                 11.804734
returns_test                        -6.843947
return_std_test                      2.824442
average_reward_test                 -0.577772
round_time_test        0 days 00:00:02.605278
round_time_total       0 days 00:11:21.034069
loss_total                1371571049992290.25
loss_critic                1714463812276453.5
loss_actor                      -182311791.84
memory_size                           14090.0 

=== epoch 3/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                  162
episode_length                      12.283951
returns                             -6.567293
return_std                           3.173087
average_reward                      -0.534271
round_time             0 days 00:11:25.077548
episodes_test                           167.0
episode_length_test                 11.916168
returns_test                        -7.358162
return_std_test                      2.739972
average_reward_test                 -0.615253
round_time_test        0 days 00:00:02.559774
round_time_total       0 days 00:11:25.078649
loss_total                1439527326860705.75
loss_critic                1799409151521063.0
loss_actor                     -189593052.072
memory_size                         14093.705 

=== epoch 3/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.92it/s]
episodes                                  169
episode_length                      11.739645
returns                             -7.266026
return_std                           2.661218
average_reward                      -0.621274
round_time             0 days 00:11:24.448028
episodes_test                           162.0
episode_length_test                 12.271605
returns_test                        -7.976709
return_std_test                      2.663772
average_reward_test                 -0.651091
round_time_test        0 days 00:00:02.520996
round_time_total       0 days 00:11:24.449127
loss_total                 1498521402729300.0
loss_critic               1873151746910453.75
loss_actor                      -196265150.12
memory_size                        14096.0215 

=== epoch 3/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:28<00:00,  2.90it/s]
episodes                                  172
episode_length                      11.598837
returns                             -6.262064
return_std                           2.828474
average_reward                      -0.539273
round_time             0 days 00:11:29.036216
episodes_test                           176.0
episode_length_test                 11.335227
returns_test                        -6.696746
return_std_test                      2.300978
average_reward_test                   -0.5897
round_time_test        0 days 00:00:02.575559
round_time_total       0 days 00:11:29.037406
loss_total                1564269100895567.75
loss_critic               1955336361087074.25
loss_actor                     -202256426.128
memory_size                           14097.0 

=== epoch 3/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.92it/s]
episodes                                  160
episode_length                         12.425
returns                             -5.174127
return_std                           3.856843
average_reward                      -0.419409
round_time             0 days 00:11:24.384641
episodes_test                           166.0
episode_length_test                 12.024096
returns_test                        -5.889935
return_std_test                      3.384339
average_reward_test                 -0.488274
round_time_test        0 days 00:00:02.540983
round_time_total       0 days 00:11:24.385746
loss_total                1639748667915108.25
loss_critic               2049685815734304.75
loss_actor                     -207520749.936
memory_size                         14101.329 

=== epoch 3/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:35,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                  150
episode_length                      13.233333
returns                              -5.98686
return_std                           3.685398
average_reward                      -0.452593
round_time             0 days 00:11:21.071256
episodes_test                           149.0
episode_length_test                 13.315436
returns_test                        -5.995493
return_std_test                      3.882678
average_reward_test                 -0.449492
round_time_test        0 days 00:00:02.536131
round_time_total       0 days 00:11:21.072394
loss_total                1709980219244281.75
loss_critic                2137475249891967.0
loss_actor                     -212743426.528
memory_size                         14107.471 

=== epoch 3/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  156
episode_length                       12.75641
returns                             -5.072632
return_std                           4.212931
average_reward                      -0.397585
round_time             0 days 00:11:18.642922
episodes_test                           153.0
episode_length_test                 13.039216
returns_test                        -5.383045
return_std_test                      3.596176
average_reward_test                 -0.410713
round_time_test        0 days 00:00:02.545505
round_time_total       0 days 00:11:18.644015
loss_total                1783672548047716.25
loss_critic               2229590659264675.75
loss_actor                     -218171033.448
memory_size                           14110.0 

=== epoch 3/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:33,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:22<00:00,  2.93it/s]
episodes                                  155
episode_length                      12.832258
returns                             -5.129658
return_std                           4.080462
average_reward                      -0.400936
round_time             0 days 00:11:22.895605
episodes_test                           163.0
episode_length_test                 12.245399
returns_test                        -4.265755
return_std_test                      3.628961
average_reward_test                 -0.347898
round_time_test        0 days 00:00:02.563594
round_time_total       0 days 00:11:22.896710
loss_total                 1857110484243710.0
loss_critic                2321388074828824.5
loss_actor                      -222078925.52
memory_size                        14114.4655 

=== epoch 3/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:32,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
episodes                                  154
episode_length                      12.915584
returns                             -5.272301
return_std                           3.684212
average_reward                      -0.408109
round_time             0 days 00:11:17.458024
episodes_test                           157.0
episode_length_test                  12.66242
returns_test                        -5.184636
return_std_test                      3.631184
average_reward_test                 -0.408514
round_time_test        0 days 00:00:02.625732
round_time_total       0 days 00:11:17.459116
loss_total                1942359309400670.25
loss_critic                2427949102076526.5
loss_actor                      -227609655.72
memory_size                           14119.0 

=== epoch 3/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:59,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  154
episode_length                      12.896104
returns                             -5.076277
return_std                           3.596134
average_reward                      -0.391075
round_time             0 days 00:11:13.950646
episodes_test                           168.0
episode_length_test                 11.863095
returns_test                        -5.960632
return_std_test                      3.419141
average_reward_test                 -0.499494
round_time_test        0 days 00:00:02.531625
round_time_total       0 days 00:11:13.951744
loss_total                2003755346182537.25
loss_critic                2504694146480996.5
loss_actor                     -231326599.864
memory_size                           14119.0 

=== epoch 3/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  160
episode_length                       12.36875
returns                             -4.216186
return_std                            3.69074
average_reward                       -0.34093
round_time             0 days 00:11:17.565973
episodes_test                           163.0
episode_length_test                 12.208589
returns_test                        -3.942797
return_std_test                      3.700348
average_reward_test                 -0.321648
round_time_test        0 days 00:00:02.552159
round_time_total       0 days 00:11:17.567080
loss_total                 2062144002320236.5
loss_critic                2577679961938723.0
loss_actor                     -235125011.424
memory_size                           14119.0 

=== epoch 3/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.97it/s]
episodes                                  164
episode_length                      12.146341
returns                             -3.299403
return_std                           3.558776
average_reward                      -0.272731
round_time             0 days 00:11:14.605363
episodes_test                           171.0
episode_length_test                 11.660819
returns_test                        -3.554065
return_std_test                      4.021518
average_reward_test                 -0.301261
round_time_test        0 days 00:00:02.565723
round_time_total       0 days 00:11:14.606470
loss_total                2169003004433268.75
loss_critic                2711253713783095.5
loss_actor                     -238999373.512
memory_size                           14119.0 

=== epoch 3/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  155
episode_length                      12.845161
returns                             -3.132945
return_std                           4.146382
average_reward                      -0.246311
round_time             0 days 00:11:18.211395
episodes_test                           154.0
episode_length_test                 12.922078
returns_test                        -2.854451
return_std_test                      3.911547
average_reward_test                 -0.216107
round_time_test        0 days 00:00:02.551929
round_time_total       0 days 00:11:18.212496
loss_total                 2290897380607787.0
loss_critic                2863621680142483.5
loss_actor                     -246472283.184
memory_size                           14119.0 

=== epoch 3/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                  154
episode_length                      12.922078
returns                              -3.14243
return_std                            3.77452
average_reward                      -0.245633
round_time             0 days 00:11:23.569680
episodes_test                           159.0
episode_length_test                  12.54717
returns_test                        -2.551828
return_std_test                      3.547314
average_reward_test                   -0.2024
round_time_test        0 days 00:00:02.558576
round_time_total       0 days 00:11:23.570765
loss_total                 2443094312309752.0
loss_critic                3053867840500138.0
loss_actor                     -255035077.192
memory_size                           14119.0 

=== epoch 3/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:11,  2.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  143
episode_length                      13.916084
returns                             -3.496079
return_std                            4.29806
average_reward                      -0.251383
round_time             0 days 00:11:18.381382
episodes_test                           149.0
episode_length_test                  13.38255
returns_test                        -3.181055
return_std_test                      3.827465
average_reward_test                 -0.236539
round_time_test        0 days 00:00:02.509722
round_time_total       0 days 00:11:18.382476
loss_total                 2635791675229733.0
loss_critic                3294739538471026.5
loss_actor                      -263483427.64
memory_size                         14122.175 

=== epoch 3/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
episodes                                  144
episode_length                      13.708333
returns                             -3.824812
return_std                           4.084406
average_reward                      -0.276371
round_time             0 days 00:11:17.352453
episodes_test                           150.0
episode_length_test                 13.293333
returns_test                        -3.733004
return_std_test                      4.548894
average_reward_test                 -0.278969
round_time_test        0 days 00:00:02.525251
round_time_total       0 days 00:11:17.353552
loss_total                 2811826761581461.5
loss_critic                3514783390245060.5
loss_actor                     -272756438.768
memory_size                           14124.0 

=== epoch 3/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  147
episode_length                      13.537415
returns                             -2.288094
return_std                           3.920078
average_reward                      -0.166897
round_time             0 days 00:11:16.461306
episodes_test                           137.0
episode_length_test                 14.562044
returns_test                        -2.658473
return_std_test                      4.855067
average_reward_test                 -0.180145
round_time_test        0 days 00:00:02.526249
round_time_total       0 days 00:11:16.462404
loss_total                 3010107224765759.5
loss_critic                3762633962724262.0
loss_actor                     -281644241.824
memory_size                           14124.0 

=== epoch 3/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:33,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  143
episode_length                      13.874126
returns                             -1.844688
return_std                           3.903113
average_reward                        -0.1341
round_time             0 days 00:11:15.684821
episodes_test                           152.0
episode_length_test                 13.059211
returns_test                        -1.811327
return_std_test                      3.717986
average_reward_test                 -0.135759
round_time_test        0 days 00:00:02.572917
round_time_total       0 days 00:11:15.685907
loss_total                 3214971163876786.0
loss_critic                4018713890547302.5
loss_actor                      -290165628.88
memory_size                        14127.9965 

=== epoch 3/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:38,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  126
episode_length                      15.746032
returns                             -1.402399
return_std                           4.521604
average_reward                      -0.087843
round_time             0 days 00:11:16.174768
episodes_test                           130.0
episode_length_test                 15.353846
returns_test                        -2.006614
return_std_test                       4.50842
average_reward_test                 -0.129552
round_time_test        0 days 00:00:02.515169
round_time_total       0 days 00:11:16.175865
loss_total                 3458665754973438.0
loss_critic                4323332114519949.5
loss_actor                      -297684603.96
memory_size                        14148.0855 

=== epoch 3/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                  141
episode_length                      14.085106
returns                             -1.463511
return_std                           4.135374
average_reward                      -0.102421
round_time             0 days 00:11:21.220057
episodes_test                           148.0
episode_length_test                 13.445946
returns_test                        -1.059002
return_std_test                      3.659924
average_reward_test                 -0.078888
round_time_test        0 days 00:00:02.503644
round_time_total       0 days 00:11:21.221140
loss_total                 3693646871743430.5
loss_critic                4617058510138507.0
loss_actor                     -306839484.704
memory_size                         14160.619 

=== epoch 3/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  135
episode_length                      14.762963
returns                             -1.429816
return_std                           4.692388
average_reward                      -0.099107
round_time             0 days 00:11:17.076898
episodes_test                           146.0
episode_length_test                 13.691781
returns_test                        -1.709631
return_std_test                      3.940976
average_reward_test                 -0.124304
round_time_test        0 days 00:00:02.544756
round_time_total       0 days 00:11:17.077984
loss_total                 3876182159337193.5
loss_critic                4845227622650610.0
loss_actor                     -315536067.472
memory_size                         14168.215 

=== epoch 3/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:55,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.97it/s]
episodes                                  123
episode_length                      16.178862
returns                             -0.895087
return_std                            4.29862
average_reward                      -0.057079
round_time             0 days 00:11:14.668384
episodes_test                           123.0
episode_length_test                 16.186992
returns_test                        -1.398338
return_std_test                      4.001683
average_reward_test                 -0.081712
round_time_test        0 days 00:00:02.517894
round_time_total       0 days 00:11:14.669468
loss_total                 4150132783311749.0
loss_critic                5187665894649627.0
loss_actor                     -322930204.208
memory_size                         14196.881 

=== epoch 3/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:14,  2.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  138
episode_length                      14.413043
returns                             -1.602708
return_std                           4.269437
average_reward                      -0.111373
round_time             0 days 00:11:17.476716
episodes_test                           141.0
episode_length_test                 14.184397
returns_test                        -0.600827
return_std_test                      4.202025
average_reward_test                 -0.042358
round_time_test        0 days 00:00:02.553657
round_time_total       0 days 00:11:17.477816
loss_total                 4368303209609953.5
loss_critic                5460378920106852.0
loss_actor                     -331610327.664
memory_size                           14208.0 

=== epoch 3/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:22,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  136
episode_length                         14.625
returns                             -2.228256
return_std                           4.033058
average_reward                      -0.150037
round_time             0 days 00:11:17.846173
episodes_test                           137.0
episode_length_test                 14.554745
returns_test                        -2.702423
return_std_test                      3.953721
average_reward_test                 -0.184926
round_time_test        0 days 00:00:02.512288
round_time_total       0 days 00:11:17.847257
loss_total                 4595016437529051.0
loss_critic                5743770457320980.0
loss_actor                     -340014110.544
memory_size                         14219.633 

=== epoch 3/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  118
episode_length                      16.779661
returns                             -2.053897
return_std                           4.495421
average_reward                       -0.12372
round_time             0 days 00:11:14.330432
episodes_test                           118.0
episode_length_test                 16.940678
returns_test                        -2.579853
return_std_test                      4.444419
average_reward_test                 -0.151838
round_time_test        0 days 00:00:02.507346
round_time_total       0 days 00:11:14.331507
loss_total                 4724950829608468.0
loss_critic                5906188441967657.0
loss_actor                     -345685984.192
memory_size                        14257.7075 

=== epoch 3/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                  113
episode_length                       17.40708
returns                             -1.604327
return_std                           4.711711
average_reward                      -0.094782
round_time             0 days 00:11:20.662695
episodes_test                           121.0
episode_length_test                 16.512397
returns_test                        -1.472852
return_std_test                      4.609292
average_reward_test                  -0.08847
round_time_test        0 days 00:00:02.490553
round_time_total       0 days 00:11:20.663788
loss_total                 4915581126942130.0
loss_critic                6144476304826696.0
loss_actor                     -351498184.528
memory_size                        14297.2105 

=== epoch 3/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                  134
episode_length                      14.738806
returns                             -3.118282
return_std                           4.359554
average_reward                      -0.206814
round_time             0 days 00:11:19.806225
episodes_test                           134.0
episode_length_test                  14.91791
returns_test                        -2.237346
return_std_test                      4.065334
average_reward_test                 -0.149365
round_time_test        0 days 00:00:02.492000
round_time_total       0 days 00:11:19.807311
loss_total                 5104339579347927.0
loss_critic                6380424369360863.0
loss_actor                     -354991120.544
memory_size                         14308.573 

=== epoch 3/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  117
episode_length                      16.700855
returns                               -3.2528
return_std                           4.837046
average_reward                       -0.18276
round_time             0 days 00:11:19.372254
episodes_test                           108.0
episode_length_test                 18.444444
returns_test                        -2.468897
return_std_test                      4.999811
average_reward_test                 -0.129716
round_time_test        0 days 00:00:02.475056
round_time_total       0 days 00:11:19.373364
loss_total                 5208884415333138.0
loss_critic                6511105411574137.0
loss_actor                     -358306270.608
memory_size                         14323.006 

=== epoch 3/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.94it/s]
episodes                                  120
episode_length                         16.475
returns                             -1.683232
return_std                           5.653132
average_reward                      -0.102381
round_time             0 days 00:11:21.699394
episodes_test                           122.0
episode_length_test                 16.286885
returns_test                         -2.50568
return_std_test                      4.738106
average_reward_test                  -0.14981
round_time_test        0 days 00:00:02.494359
round_time_total       0 days 00:11:21.700489
loss_total                 5251213011057639.0
loss_critic                6564016152069014.0
loss_actor                     -361674932.016
memory_size                         14326.768 

=== epoch 3/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  107
episode_length                      18.654206
returns                             -1.513507
return_std                           6.431343
average_reward                       -0.08106
round_time             0 days 00:11:16.928678
episodes_test                           124.0
episode_length_test                 16.080645
returns_test                        -1.237992
return_std_test                      5.531794
average_reward_test                 -0.075503
round_time_test        0 days 00:00:02.487430
round_time_total       0 days 00:11:16.929764
loss_total                 5352281294623474.0
loss_critic                6690351502952759.0
loss_actor                     -364459342.304
memory_size                        14335.5625 

=== epoch 3/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  107
episode_length                      18.439252
returns                             -1.785271
return_std                            5.63086
average_reward                      -0.098525
round_time             0 days 00:11:19.372918
episodes_test                            90.0
episode_length_test                 22.166667
returns_test                        -2.027677
return_std_test                      5.339397
average_reward_test                 -0.089945
round_time_test        0 days 00:00:02.464144
round_time_total       0 days 00:11:19.374015
loss_total                 5565197406309450.0
loss_critic                6956496641134166.0
loss_actor                     -367278803.952
memory_size                        14394.4735 

=== epoch 3/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:19,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                  134
episode_length                       14.80597
returns                             -2.922359
return_std                           4.504748
average_reward                         -0.193
round_time             0 days 00:11:22.265438
episodes_test                           114.0
episode_length_test                 17.464912
returns_test                        -2.006569
return_std_test                      5.391054
average_reward_test                 -0.115588
round_time_test        0 days 00:00:02.463518
round_time_total       0 days 00:11:22.266520
loss_total                 5761614896995238.0
loss_critic                7202018504592065.0
loss_actor                     -372237279.408
memory_size                        14418.1325 

=== epoch 3/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                  136
episode_length                      14.610294
returns                             -2.198473
return_std                           4.723624
average_reward                      -0.152099
round_time             0 days 00:11:24.086271
episodes_test                           129.0
episode_length_test                 15.472868
returns_test                        -2.396864
return_std_test                      4.884964
average_reward_test                 -0.152812
round_time_test        0 days 00:00:02.477541
round_time_total       0 days 00:11:24.087363
loss_total                 5905925870148649.0
loss_critic                7382407215782560.0
loss_actor                     -377197456.944
memory_size                           14420.0 

=== epoch 3/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                  114
episode_length                      17.412281
returns                             -1.577668
return_std                           6.580883
average_reward                      -0.094046
round_time             0 days 00:11:20.420561
episodes_test                           141.0
episode_length_test                 14.177305
returns_test                         -2.60299
return_std_test                      4.439868
average_reward_test                 -0.183026
round_time_test        0 days 00:00:02.533874
round_time_total       0 days 00:11:20.421657
loss_total                 6077731713042088.0
loss_critic                7597164512973685.0
loss_actor                     -382316401.648
memory_size                        14431.6035 

=== epoch 3/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:29,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  130
episode_length                      15.276923
returns                              -2.37255
return_std                           5.754172
average_reward                      -0.151198
round_time             0 days 00:11:18.443182
episodes_test                           115.0
episode_length_test                 17.234783
returns_test                        -0.276072
return_std_test                        6.5005
average_reward_test                 -0.018164
round_time_test        0 days 00:00:02.478913
round_time_total       0 days 00:11:18.444286
loss_total                 6392480369605607.0
loss_critic                7990600319736218.0
loss_actor                     -388937410.976
memory_size                        14472.2565 

=== epoch 3/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:14,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:20<00:00,  2.94it/s]
episodes                                  140
episode_length                      14.214286
returns                             -2.264588
return_std                           4.604426
average_reward                      -0.160406
round_time             0 days 00:11:21.428285
episodes_test                           125.0
episode_length_test                    15.896
returns_test                        -0.931426
return_std_test                      4.998725
average_reward_test                 -0.058639
round_time_test        0 days 00:00:02.509174
round_time_total       0 days 00:11:21.429369
loss_total                 6636606248835875.0
loss_critic                8295757667868082.0
loss_actor                      -395734503.44
memory_size                        14480.6725 

=== epoch 3/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:23<00:00,  2.93it/s]
episodes                                  133
episode_length                       14.93985
returns                             -2.424213
return_std                           4.846528
average_reward                      -0.157986
round_time             0 days 00:11:23.735978
episodes_test                           146.0
episode_length_test                 13.630137
returns_test                        -1.575831
return_std_test                      5.107543
average_reward_test                 -0.114385
round_time_test        0 days 00:00:02.566753
round_time_total       0 days 00:11:23.737097
loss_total                 6869346758846054.0
loss_critic                8586683307729617.0
loss_actor                     -402337928.288
memory_size                         14488.253 

=== epoch 3/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  153
episode_length                      12.882353
returns                              -3.62025
return_std                           4.116092
average_reward                      -0.280351
round_time             0 days 00:11:18.639047
episodes_test                           138.0
episode_length_test                 14.442029
returns_test                        -1.900014
return_std_test                      4.696392
average_reward_test                 -0.130753
round_time_test        0 days 00:00:02.511225
round_time_total       0 days 00:11:18.640118
loss_total                 7145977831242072.0
loss_critic                8932472144852419.0
loss_actor                     -407641614.848
memory_size                           14494.0 

=== epoch 3/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                  158
episode_length                      12.626582
returns                             -2.974274
return_std                           3.622373
average_reward                      -0.234534
round_time             0 days 00:11:28.479056
episodes_test                           155.0
episode_length_test                 12.864516
returns_test                        -3.521139
return_std_test                      3.345584
average_reward_test                  -0.27253
round_time_test        0 days 00:00:02.510863
round_time_total       0 days 00:11:28.480138
loss_total                 7356133429403976.0
loss_critic                9195166636045238.0
loss_actor                     -416745227.472
memory_size                           14494.0 

=== epoch 3/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.95it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  155
episode_length                      12.864516
returns                             -2.248859
return_std                           3.917759
average_reward                      -0.175068
round_time             0 days 00:11:17.534546
episodes_test                           169.0
episode_length_test                 11.786982
returns_test                        -2.383275
return_std_test                      3.419736
average_reward_test                 -0.198976
round_time_test        0 days 00:00:02.580336
round_time_total       0 days 00:11:17.535634
loss_total                 7680184005475434.0
loss_critic                9600229836119344.0
loss_actor                     -426006142.096
memory_size                           14494.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<13:06,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  167
episode_length                      11.946108
returns                             -2.875985
return_std                           3.696581
average_reward                      -0.238353
round_time             0 days 00:11:02.716035
episodes_test                           155.0
episode_length_test                 12.858065
returns_test                        -2.859073
return_std_test                      4.201811
average_reward_test                 -0.217959
round_time_test        0 days 00:00:02.531392
round_time_total       0 days 00:11:02.717155
loss_total                 7897654662214451.0
loss_critic                9872068158737612.0
loss_actor                      -435847918.16
memory_size                           14494.0 

=== epoch 4/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  181
episode_length                      10.966851
returns                             -2.720332
return_std                           2.810127
average_reward                      -0.245905
round_time             0 days 00:11:04.051450
episodes_test                           169.0
episode_length_test                 11.822485
returns_test                        -2.565645
return_std_test                      3.390491
average_reward_test                 -0.216232
round_time_test        0 days 00:00:02.539896
round_time_total       0 days 00:11:04.052548
loss_total                 8281091369573810.0
loss_critic               10351364040520892.0
loss_actor                      -442592524.48
memory_size                           14494.0 

=== epoch 4/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  177
episode_length                      11.242938
returns                               -2.5079
return_std                           3.006593
average_reward                      -0.223251
round_time             0 days 00:11:03.786685
episodes_test                           175.0
episode_length_test                 11.405714
returns_test                        -2.065233
return_std_test                      2.874372
average_reward_test                 -0.180473
round_time_test        0 days 00:00:02.533216
round_time_total       0 days 00:11:03.787776
loss_total                 8634030560458572.0
loss_critic               10792538018339094.0
loss_actor                      -452716522.96
memory_size                           14494.0 

=== epoch 4/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  176
episode_length                      11.244318
returns                             -2.651741
return_std                           3.226785
average_reward                      -0.235915
round_time             0 days 00:11:04.628250
episodes_test                           185.0
episode_length_test                 10.778378
returns_test                        -2.735978
return_std_test                      3.302946
average_reward_test                 -0.250836
round_time_test        0 days 00:00:02.568035
round_time_total       0 days 00:11:04.629353
loss_total                 9173243876956176.0
loss_critic               11466554641966170.0
loss_actor                     -461142160.704
memory_size                           14494.0 

=== epoch 4/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  171
episode_length                      11.637427
returns                             -2.253593
return_std                           3.027635
average_reward                      -0.191107
round_time             0 days 00:11:02.686810
episodes_test                           177.0
episode_length_test                 11.276836
returns_test                        -1.920526
return_std_test                      3.049928
average_reward_test                 -0.167628
round_time_test        0 days 00:00:02.633937
round_time_total       0 days 00:11:02.687905
loss_total                 9432677110696116.0
loss_critic               11790846173420454.0
loss_actor                     -469384280.976
memory_size                           14494.0 

=== epoch 4/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  176
episode_length                      11.289773
returns                             -2.613295
return_std                           2.973674
average_reward                      -0.233641
round_time             0 days 00:11:03.372454
episodes_test                           167.0
episode_length_test                 11.922156
returns_test                        -2.624379
return_std_test                      3.374711
average_reward_test                 -0.219295
round_time_test        0 days 00:00:02.547638
round_time_total       0 days 00:11:03.373600
loss_total                 9759658565672370.0
loss_critic               12199573004623020.0
loss_actor                     -478898448.016
memory_size                           14494.0 

=== epoch 4/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  171
episode_length                      11.654971
returns                             -2.060199
return_std                           2.980085
average_reward                      -0.178839
round_time             0 days 00:11:04.318130
episodes_test                           179.0
episode_length_test                 11.117318
returns_test                        -3.203508
return_std_test                      2.865078
average_reward_test                 -0.286388
round_time_test        0 days 00:00:02.567136
round_time_total       0 days 00:11:04.319219
loss_total                10344599622185386.0
loss_critic               12930749306943570.0
loss_actor                     -490710350.624
memory_size                           14494.0 

=== epoch 4/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  171
episode_length                      11.672515
returns                             -2.718845
return_std                           3.539496
average_reward                      -0.232859
round_time             0 days 00:11:07.237411
episodes_test                           170.0
episode_length_test                 11.694118
returns_test                        -2.346436
return_std_test                      3.343249
average_reward_test                 -0.197313
round_time_test        0 days 00:00:02.585499
round_time_total       0 days 00:11:07.238506
loss_total                10992221818902806.0
loss_critic               13740277036432228.0
loss_actor                     -502808765.248
memory_size                           14494.0 

=== epoch 4/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:09,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  166
episode_length                           12.0
returns                             -1.514478
return_std                            4.02758
average_reward                      -0.125357
round_time             0 days 00:11:09.333084
episodes_test                           174.0
episode_length_test                 11.408046
returns_test                        -2.174547
return_std_test                      3.147755
average_reward_test                 -0.189713
round_time_test        0 days 00:00:02.646898
round_time_total       0 days 00:11:09.334184
loss_total                11569540759761716.0
loss_critic               14461925698312340.0
loss_actor                      -516245655.44
memory_size                          14494.45 

=== epoch 4/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  179
episode_length                      11.111732
returns                             -2.588555
return_std                           2.715109
average_reward                      -0.231798
round_time             0 days 00:11:04.565141
episodes_test                           178.0
episode_length_test                 11.235955
returns_test                        -2.025362
return_std_test                      2.941214
average_reward_test                 -0.180257
round_time_test        0 days 00:00:02.588034
round_time_total       0 days 00:11:04.566246
loss_total                11987331893285618.0
loss_critic               14984164610217608.0
loss_actor                     -531518158.112
memory_size                           14495.0 

=== epoch 4/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  186
episode_length                      10.715054
returns                             -2.311625
return_std                           2.916958
average_reward                      -0.215809
round_time             0 days 00:11:02.248868
episodes_test                           173.0
episode_length_test                 11.543353
returns_test                        -2.380704
return_std_test                      3.214911
average_reward_test                  -0.20542
round_time_test        0 days 00:00:02.540841
round_time_total       0 days 00:11:02.249956
loss_total                12732246969607520.0
loss_critic               15915308450586820.0
loss_actor                     -544599307.664
memory_size                           14495.0 

=== epoch 4/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  190
episode_length                      10.442105
returns                             -2.574467
return_std                           2.343257
average_reward                      -0.245118
round_time             0 days 00:11:03.461377
episodes_test                           185.0
episode_length_test                 10.794595
returns_test                        -2.563117
return_std_test                      2.833111
average_reward_test                 -0.235395
round_time_test        0 days 00:00:02.589801
round_time_total       0 days 00:11:03.462458
loss_total                13695994656863552.0
loss_critic               17119993033652174.0
loss_actor                     -558754320.896
memory_size                           14495.0 

=== epoch 4/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                  187
episode_length                      10.625668
returns                             -2.265709
return_std                           2.487497
average_reward                      -0.212955
round_time             0 days 00:10:59.623095
episodes_test                           180.0
episode_length_test                 11.038889
returns_test                         -2.56018
return_std_test                        2.7042
average_reward_test                 -0.230039
round_time_test        0 days 00:00:02.589848
round_time_total       0 days 00:10:59.624196
loss_total                14251600325864588.0
loss_critic               17814500099334602.0
loss_actor                     -572844446.928
memory_size                           14495.0 

=== epoch 4/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  178
episode_length                      11.196629
returns                             -2.908658
return_std                           3.039983
average_reward                      -0.260616
round_time             0 days 00:11:06.255560
episodes_test                           178.0
episode_length_test                 11.196629
returns_test                        -2.624104
return_std_test                      3.209497
average_reward_test                 -0.233884
round_time_test        0 days 00:00:02.575438
round_time_total       0 days 00:11:06.256655
loss_total                14957669849516474.0
loss_critic               18697086986417604.0
loss_actor                     -586265138.288
memory_size                           14495.0 

=== epoch 4/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  180
episode_length                      11.022222
returns                             -2.613141
return_std                           3.034109
average_reward                      -0.238665
round_time             0 days 00:11:01.944249
episodes_test                           178.0
episode_length_test                 11.191011
returns_test                        -2.831257
return_std_test                       3.49289
average_reward_test                 -0.250195
round_time_test        0 days 00:00:02.558552
round_time_total       0 days 00:11:01.945339
loss_total                15831795784707212.0
loss_critic               19789744403661192.0
loss_actor                     -600611241.952
memory_size                           14495.0 

=== epoch 4/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:19,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  179
episode_length                      11.094972
returns                             -2.061422
return_std                           3.202446
average_reward                       -0.18785
round_time             0 days 00:11:02.994060
episodes_test                           178.0
episode_length_test                 11.235955
returns_test                        -2.792855
return_std_test                      3.556027
average_reward_test                 -0.248564
round_time_test        0 days 00:00:02.593962
round_time_total       0 days 00:11:02.995150
loss_total                16885431735070752.0
loss_critic               21106789291955060.0
loss_actor                     -616167997.504
memory_size                           14495.0 

=== epoch 4/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  174
episode_length                      11.402299
returns                             -2.416052
return_std                            2.90134
average_reward                      -0.206406
round_time             0 days 00:11:03.816090
episodes_test                           172.0
episode_length_test                 11.616279
returns_test                        -2.267887
return_std_test                      2.852023
average_reward_test                 -0.194293
round_time_test        0 days 00:00:02.549256
round_time_total       0 days 00:11:03.817186
loss_total                17845106048633930.0
loss_critic               22306382185385428.0
loss_actor                     -633414675.744
memory_size                           14495.0 

=== epoch 4/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:40,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  184
episode_length                      10.804348
returns                             -2.246024
return_std                           2.852365
average_reward                      -0.207539
round_time             0 days 00:11:12.228921
episodes_test                           169.0
episode_length_test                 11.786982
returns_test                        -2.107254
return_std_test                      3.429878
average_reward_test                 -0.181819
round_time_test        0 days 00:00:02.513186
round_time_total       0 days 00:11:12.230008
loss_total                18947672439552736.0
loss_critic               23684590145747552.0
loss_actor                      -649530998.24
memory_size                           14495.0 

=== epoch 4/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  183
episode_length                      10.852459
returns                             -2.259887
return_std                           2.552143
average_reward                      -0.206649
round_time             0 days 00:11:03.183485
episodes_test                           183.0
episode_length_test                  10.89071
returns_test                        -2.275915
return_std_test                       2.65649
average_reward_test                 -0.208268
round_time_test        0 days 00:00:02.582339
round_time_total       0 days 00:11:03.184586
loss_total                19502461759556420.0
loss_critic               24378076770150120.0
loss_actor                     -665126242.592
memory_size                           14495.0 

=== epoch 4/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:59,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  193
episode_length                      10.295337
returns                             -2.190897
return_std                           2.602716
average_reward                      -0.213075
round_time             0 days 00:11:08.258769
episodes_test                           198.0
episode_length_test                 10.080808
returns_test                        -2.579852
return_std_test                      2.282395
average_reward_test                 -0.254879
round_time_test        0 days 00:00:02.572612
round_time_total       0 days 00:11:08.259854
loss_total                20556132474727432.0
loss_critic               25695165176394808.0
loss_actor                     -676112096.256
memory_size                           14495.0 

=== epoch 4/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  186
episode_length                      10.731183
returns                             -2.803068
return_std                           2.812427
average_reward                      -0.261001
round_time             0 days 00:11:11.169726
episodes_test                           184.0
episode_length_test                 10.858696
returns_test                        -2.156359
return_std_test                      3.060747
average_reward_test                 -0.197893
round_time_test        0 days 00:00:02.581749
round_time_total       0 days 00:11:11.170831
loss_total                21394733789983276.0
loss_critic               26743416766509088.0
loss_actor                      -692014260.48
memory_size                           14495.0 

=== epoch 4/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  178
episode_length                      11.162921
returns                             -2.709546
return_std                             3.1847
average_reward                      -0.244322
round_time             0 days 00:11:05.369326
episodes_test                           177.0
episode_length_test                 11.237288
returns_test                        -2.230389
return_std_test                      2.653883
average_reward_test                 -0.193176
round_time_test        0 days 00:00:02.541455
round_time_total       0 days 00:11:05.370410
loss_total                22724754782721932.0
loss_critic               28405942981058624.0
loss_actor                     -709687350.624
memory_size                           14495.0 

=== epoch 4/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  187
episode_length                      10.641711
returns                             -2.614731
return_std                           2.461559
average_reward                      -0.245043
round_time             0 days 00:11:07.305615
episodes_test                           173.0
episode_length_test                 11.549133
returns_test                        -2.966093
return_std_test                      2.942383
average_reward_test                 -0.255667
round_time_test        0 days 00:00:02.536602
round_time_total       0 days 00:11:07.306711
loss_total                23234805278057692.0
loss_critic               29043506132037928.0
loss_actor                     -721000043.616
memory_size                           14495.0 

=== epoch 4/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:07,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  186
episode_length                      10.731183
returns                             -2.555653
return_std                           2.636759
average_reward                      -0.238893
round_time             0 days 00:11:03.043428
episodes_test                           183.0
episode_length_test                 10.896175
returns_test                        -2.393331
return_std_test                      2.581912
average_reward_test                 -0.217026
round_time_test        0 days 00:00:02.616660
round_time_total       0 days 00:11:03.044510
loss_total                24392167119034252.0
loss_critic               30490208376484528.0
loss_actor                     -734444007.296
memory_size                           14495.0 

=== epoch 4/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  172
episode_length                       11.52907
returns                              -1.96634
return_std                           2.992367
average_reward                      -0.165199
round_time             0 days 00:11:10.378023
episodes_test                           160.0
episode_length_test                   12.4125
returns_test                        -1.252652
return_std_test                      4.054424
average_reward_test                 -0.100513
round_time_test        0 days 00:00:02.542945
round_time_total       0 days 00:11:10.379103
loss_total                25743049752550836.0
loss_critic               32178811621605376.0
loss_actor                     -753769099.328
memory_size                           14495.0 

=== epoch 4/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:04,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  164
episode_length                      12.109756
returns                             -2.184834
return_std                           3.503095
average_reward                      -0.183738
round_time             0 days 00:11:06.222342
episodes_test                           168.0
episode_length_test                  11.89881
returns_test                        -1.744522
return_std_test                       3.20975
average_reward_test                 -0.146037
round_time_test        0 days 00:00:02.555999
round_time_total       0 days 00:11:06.223408
loss_total                27450295114509844.0
loss_critic               34312868339086524.0
loss_actor                     -776475455.584
memory_size                           14495.0 

=== epoch 4/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  178
episode_length                      11.213483
returns                             -1.834445
return_std                           2.623185
average_reward                      -0.161576
round_time             0 days 00:11:07.379839
episodes_test                           180.0
episode_length_test                 11.111111
returns_test                        -2.001902
return_std_test                      2.836702
average_reward_test                 -0.180171
round_time_test        0 days 00:00:02.568572
round_time_total       0 days 00:11:07.380933
loss_total                28870619313454712.0
loss_critic               36088273511867480.0
loss_actor                     -795950244.608
memory_size                           14495.0 

=== epoch 4/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  173
episode_length                      11.473988
returns                             -2.307237
return_std                           3.480023
average_reward                      -0.200696
round_time             0 days 00:11:07.681681
episodes_test                           172.0
episode_length_test                 11.616279
returns_test                        -2.355691
return_std_test                      3.071047
average_reward_test                 -0.202069
round_time_test        0 days 00:00:02.560178
round_time_total       0 days 00:11:07.682765
loss_total                30209244758043264.0
loss_critic               37761555290356976.0
loss_actor                     -810978269.088
memory_size                           14495.0 

=== epoch 4/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  192
episode_length                      10.369792
returns                             -2.218169
return_std                            2.70857
average_reward                      -0.214019
round_time             0 days 00:11:05.766672
episodes_test                           183.0
episode_length_test                 10.912568
returns_test                         -2.92596
return_std_test                      2.568079
average_reward_test                 -0.267213
round_time_test        0 days 00:00:02.530483
round_time_total       0 days 00:11:05.767769
loss_total                31711118711412752.0
loss_critic               39638897697440664.0
loss_actor                     -828471597.824
memory_size                        14496.7235 

=== epoch 4/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:39,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  185
episode_length                      10.778378
returns                             -2.331308
return_std                           2.794123
average_reward                       -0.21784
round_time             0 days 00:11:02.908528
episodes_test                           192.0
episode_length_test                 10.380208
returns_test                        -2.370232
return_std_test                      2.253149
average_reward_test                 -0.227534
round_time_test        0 days 00:00:02.569376
round_time_total       0 days 00:11:02.909631
loss_total                33457869548706332.0
loss_critic               41822336182585920.0
loss_actor                     -845500288.672
memory_size                           14497.0 

=== epoch 4/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  179
episode_length                      11.089385
returns                             -2.212983
return_std                           2.966547
average_reward                      -0.201837
round_time             0 days 00:11:06.217219
episodes_test                           190.0
episode_length_test                 10.505263
returns_test                        -2.326723
return_std_test                      2.599489
average_reward_test                 -0.219256
round_time_test        0 days 00:00:02.561331
round_time_total       0 days 00:11:06.218300
loss_total                35108670613401632.0
loss_critic               43885837542244744.0
loss_actor                     -866553392.224
memory_size                           14497.0 

=== epoch 4/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  193
episode_length                      10.316062
returns                             -2.340291
return_std                           2.596899
average_reward                      -0.222939
round_time             0 days 00:11:04.393807
episodes_test                           191.0
episode_length_test                 10.460733
returns_test                        -2.089517
return_std_test                      2.422073
average_reward_test                 -0.199186
round_time_test        0 days 00:00:02.588987
round_time_total       0 days 00:11:04.394887
loss_total                37180162139091896.0
loss_critic               46475201863189792.0
loss_actor                     -889931653.376
memory_size                           14497.0 

=== epoch 4/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:32,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  178
episode_length                       11.11236
returns                             -2.312243
return_std                            2.98197
average_reward                      -0.205179
round_time             0 days 00:11:05.762253
episodes_test                           197.0
episode_length_test                 10.126904
returns_test                        -2.387297
return_std_test                      2.378681
average_reward_test                 -0.236291
round_time_test        0 days 00:00:02.585472
round_time_total       0 days 00:11:05.763329
loss_total                38866716810171912.0
loss_critic               48583395191168176.0
loss_actor                      -913360477.76
memory_size                           14497.0 

=== epoch 4/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  184
episode_length                      10.847826
returns                              -2.38265
return_std                           2.569302
average_reward                      -0.219797
round_time             0 days 00:11:00.545527
episodes_test                           191.0
episode_length_test                 10.455497
returns_test                        -2.517609
return_std_test                      2.708707
average_reward_test                 -0.240201
round_time_test        0 days 00:00:02.607565
round_time_total       0 days 00:11:00.546603
loss_total                40444354210518008.0
loss_critic               50555441816107224.0
loss_actor                     -929596648.192
memory_size                           14497.0 

=== epoch 4/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  184
episode_length                      10.766304
returns                             -2.819188
return_std                            2.51191
average_reward                      -0.262942
round_time             0 days 00:11:06.167820
episodes_test                           183.0
episode_length_test                 10.923497
returns_test                        -2.685975
return_std_test                      2.742809
average_reward_test                 -0.245414
round_time_test        0 days 00:00:02.598646
round_time_total       0 days 00:11:06.168899
loss_total                42252762658252000.0
loss_critic               52815952396310024.0
loss_actor                     -948968442.816
memory_size                           14497.0 

=== epoch 4/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  191
episode_length                      10.450262
returns                             -2.735699
return_std                           2.558236
average_reward                      -0.261301
round_time             0 days 00:11:02.699896
episodes_test                           195.0
episode_length_test                 10.194872
returns_test                        -3.014935
return_std_test                      2.018477
average_reward_test                 -0.292666
round_time_test        0 days 00:00:02.573721
round_time_total       0 days 00:11:02.700968
loss_total                43971616408426512.0
loss_critic               54964519567385168.0
loss_actor                     -972852093.536
memory_size                           14497.0 

=== epoch 4/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:25,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  184
episode_length                      10.836957
returns                             -2.819464
return_std                           3.000129
average_reward                       -0.26062
round_time             0 days 00:11:03.321500
episodes_test                           193.0
episode_length_test                 10.310881
returns_test                        -3.229419
return_std_test                      2.360513
average_reward_test                 -0.310851
round_time_test        0 days 00:00:02.592319
round_time_total       0 days 00:11:03.322578
loss_total                46055629044758608.0
loss_critic               57569535295154552.0
loss_actor                    -1000469835.072
memory_size                           14497.0 

=== epoch 4/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:26<00:00,  1.36it/s]
episodes                                  185
episode_length                           10.8
returns                             -2.694984
return_std                            3.24299
average_reward                      -0.248731
round_time             0 days 00:24:27.031840
episodes_test                           181.0
episode_length_test                 11.049724
returns_test                        -2.995434
return_std_test                      2.655253
average_reward_test                 -0.271087
round_time_test        0 days 00:00:02.565817
round_time_total       0 days 00:24:27.033746
loss_total                48121709488255272.0
loss_critic               60152135814091896.0
loss_actor                    -1019901457.728
memory_size                           14497.0 

=== epoch 4/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<40:20,  1.21s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:50<00:00,  1.87it/s]
episodes                                  188
episode_length                      10.537234
returns                             -2.399813
return_std                           2.481116
average_reward                      -0.229663
round_time             0 days 00:17:53.031755
episodes_test                           185.0
episode_length_test                 10.745946
returns_test                        -2.162517
return_std_test                       2.67212
average_reward_test                 -0.199756
round_time_test        0 days 00:00:08.380791
round_time_total       0 days 00:17:53.032841
loss_total                51605473654864872.0
loss_critic               64506840963029664.0
loss_actor                    -1043857571.488
memory_size                           14497.0 

=== epoch 4/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:14,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:31<00:00,  2.89it/s]
episodes                                  179
episode_length                      11.156425
returns                             -2.524676
return_std                           3.202483
average_reward                      -0.225976
round_time             0 days 00:11:31.887206
episodes_test                           180.0
episode_length_test                 11.105556
returns_test                         -2.33308
return_std_test                      3.790336
average_reward_test                  -0.20957
round_time_test        0 days 00:00:02.573721
round_time_total       0 days 00:11:31.888283
loss_total                53597833661564056.0
loss_critic               66997290911542544.0
loss_actor                    -1074134963.744
memory_size                           14497.0 

=== epoch 4/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:42,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  172
episode_length                      11.593023
returns                             -2.182872
return_std                           3.637225
average_reward                      -0.190214
round_time             0 days 00:11:07.761756
episodes_test                           164.0
episode_length_test                 12.164634
returns_test                        -1.543966
return_std_test                      3.710113
average_reward_test                 -0.124004
round_time_test        0 days 00:00:02.540591
round_time_total       0 days 00:11:07.762841
loss_total                55814676188281960.0
loss_critic               69768344063631688.0
loss_actor                    -1092726524.864
memory_size                        14497.9085 

=== epoch 4/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:32,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  192
episode_length                       10.40625
returns                             -2.849156
return_std                           2.501462
average_reward                      -0.273236
round_time             0 days 00:11:06.478981
episodes_test                           172.0
episode_length_test                 11.581395
returns_test                        -2.697726
return_std_test                      2.880285
average_reward_test                 -0.228571
round_time_test        0 days 00:00:02.522550
round_time_total       0 days 00:11:06.480049
loss_total                57558037512547464.0
loss_critic               71947545678966688.0
loss_actor                    -1106324234.784
memory_size                           14498.0 

=== epoch 4/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:42,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  186
episode_length                      10.704301
returns                             -2.544804
return_std                           2.538101
average_reward                      -0.237953
round_time             0 days 00:11:06.454502
episodes_test                           185.0
episode_length_test                 10.751351
returns_test                        -2.312031
return_std_test                      2.835417
average_reward_test                 -0.215663
round_time_test        0 days 00:00:02.558628
round_time_total       0 days 00:11:06.455576
loss_total                61311430703347400.0
loss_critic               76639287045596912.0
loss_actor                    -1136910429.792
memory_size                           14498.0 

=== epoch 4/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:18,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  182
episode_length                      10.923077
returns                             -2.775778
return_std                           3.029386
average_reward                      -0.255072
round_time             0 days 00:11:06.910885
episodes_test                           185.0
episode_length_test                 10.767568
returns_test                        -2.614218
return_std_test                      2.344116
average_reward_test                 -0.241204
round_time_test        0 days 00:00:02.567277
round_time_total       0 days 00:11:06.911970
loss_total                65594681784904184.0
loss_critic               81993350780236592.0
loss_actor                    -1167552103.392
memory_size                         14500.847 

=== epoch 4/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  173
episode_length                      11.549133
returns                             -2.331388
return_std                           3.206799
average_reward                      -0.203133
round_time             0 days 00:11:04.794767
episodes_test                           175.0
episode_length_test                 11.388571
returns_test                        -2.669777
return_std_test                      2.924694
average_reward_test                 -0.230893
round_time_test        0 days 00:00:02.548140
round_time_total       0 days 00:11:04.795834
loss_total                69037869898672376.0
loss_critic               86297335874396880.0
loss_actor                     -1198469684.48
memory_size                        14501.7035 

=== epoch 4/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:38,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  170
episode_length                      11.747059
returns                              -2.11448
return_std                           6.061848
average_reward                      -0.178296
round_time             0 days 00:11:04.920397
episodes_test                           176.0
episode_length_test                 11.318182
returns_test                        -2.673081
return_std_test                      2.677254
average_reward_test                 -0.232234
round_time_test        0 days 00:00:02.523901
round_time_total       0 days 00:11:04.921476
loss_total                72217037496131776.0
loss_critic               90271295255257024.0
loss_actor                    -1232374115.008
memory_size                         14518.478 

=== epoch 4/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:14,  2.96it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  171
episode_length                      11.584795
returns                             -1.999444
return_std                           3.694751
average_reward                      -0.171262
round_time             0 days 00:11:04.856066
episodes_test                           186.0
episode_length_test                 10.731183
returns_test                        -2.999611
return_std_test                      2.530064
average_reward_test                 -0.278353
round_time_test        0 days 00:00:02.586797
round_time_total       0 days 00:11:04.857156
loss_total                76313263508450960.0
loss_critic               95391577778172192.0
loss_actor                    -1259163499.328
memory_size                        14529.0075 

=== epoch 4/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:59,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  179
episode_length                      11.111732
returns                             -2.005345
return_std                            2.90951
average_reward                       -0.18074
round_time             0 days 00:11:07.858193
episodes_test                           193.0
episode_length_test                 10.352332
returns_test                        -2.229908
return_std_test                      2.664602
average_reward_test                 -0.214554
round_time_test        0 days 00:00:02.584473
round_time_total       0 days 00:11:07.859283
loss_total                78411176612089296.0
loss_critic               98013969163894128.0
loss_actor                      -1283732780.8
memory_size                           14530.0 

=== epoch 4/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  186
episode_length                       10.66129
returns                             -2.282554
return_std                           2.641853
average_reward                      -0.213033
round_time             0 days 00:11:09.475047
episodes_test                           182.0
episode_length_test                 10.978022
returns_test                        -2.510624
return_std_test                      2.764738
average_reward_test                 -0.228374
round_time_test        0 days 00:00:02.567024
round_time_total       0 days 00:11:09.476157
loss_total                79511313244579952.0
loss_critic               99389139817873808.0
loss_actor                    -1306957091.328
memory_size                           14530.0 

=== epoch 4/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  189
episode_length                      10.534392
returns                             -2.754681
return_std                            2.46518
average_reward                      -0.261688
round_time             0 days 00:11:03.395563
episodes_test                           190.0
episode_length_test                 10.510526
returns_test                        -2.014281
return_std_test                      2.669265
average_reward_test                 -0.190802
round_time_test        0 days 00:00:02.540272
round_time_total       0 days 00:11:03.396628
loss_total                82239623541416784.0
loss_critic              102799527780724768.0
loss_actor                    -1331692084.864
memory_size                           14530.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  190
episode_length                      10.515789
returns                             -2.700403
return_std                           2.522438
average_reward                      -0.255713
round_time             0 days 00:11:10.841982
episodes_test                           189.0
episode_length_test                 10.566138
returns_test                        -2.859279
return_std_test                      2.392877
average_reward_test                 -0.269013
round_time_test        0 days 00:00:02.560058
round_time_total       0 days 00:11:10.843099
loss_total                86816730506487392.0
loss_critic              108520911159840208.0
loss_actor                    -1365165808.512
memory_size                           14530.0 

=== epoch 5/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  181
episode_length                      10.966851
returns                             -2.410113
return_std                           3.136236
average_reward                      -0.220764
round_time             0 days 00:11:05.938753
episodes_test                           183.0
episode_length_test                 10.928962
returns_test                        -2.722396
return_std_test                      2.914288
average_reward_test                 -0.249099
round_time_test        0 days 00:00:02.593977
round_time_total       0 days 00:11:05.939848
loss_total                92109539182402080.0
loss_critic              115136922080163920.0
loss_actor                     -1394764821.12
memory_size                           14530.0 

=== epoch 5/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  187
episode_length                      10.679144
returns                             -2.656264
return_std                           2.933565
average_reward                      -0.250606
round_time             0 days 00:11:06.548914
episodes_test                           172.0
episode_length_test                 11.593023
returns_test                        -2.392915
return_std_test                      3.350225
average_reward_test                 -0.206829
round_time_test        0 days 00:00:02.563803
round_time_total       0 days 00:11:06.549994
loss_total                97340261415256064.0
loss_critic              121675324611117456.0
loss_actor                    -1422437806.592
memory_size                           14530.0 

=== epoch 5/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  183
episode_length                      10.907104
returns                             -1.907118
return_std                           3.653602
average_reward                      -0.172961
round_time             0 days 00:11:03.336316
episodes_test                           176.0
episode_length_test                 11.357955
returns_test                        -2.266073
return_std_test                      3.117269
average_reward_test                 -0.198958
round_time_test        0 days 00:00:02.539924
round_time_total       0 days 00:11:03.337411
loss_total                98784547980043088.0
loss_critic              123480683016280336.0
loss_actor                    -1441282908.096
memory_size                         14532.519 

=== epoch 5/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  179
episode_length                      11.139665
returns                             -2.654667
return_std                           2.728952
average_reward                      -0.238779
round_time             0 days 00:11:02.237527
episodes_test                           187.0
episode_length_test                 10.657754
returns_test                        -1.670641
return_std_test                      2.548432
average_reward_test                 -0.154134
round_time_test        0 days 00:00:02.559160
round_time_total       0 days 00:11:02.238621
loss_total               101425919410361600.0
loss_critic              126782397047285744.0
loss_actor                     -1465173415.68
memory_size                           14533.0 

=== epoch 5/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  178
episode_length                      11.224719
returns                               -2.2146
return_std                           2.825091
average_reward                        -0.1964
round_time             0 days 00:11:04.486907
episodes_test                           184.0
episode_length_test                 10.853261
returns_test                        -2.399605
return_std_test                      2.754834
average_reward_test                 -0.219889
round_time_test        0 days 00:00:02.575047
round_time_total       0 days 00:11:04.488066
loss_total               107925593922706864.0
loss_critic              134906990342067712.0
loss_actor                    -1492002639.616
memory_size                           14533.0 

=== epoch 5/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  183
episode_length                      10.825137
returns                              -2.50152
return_std                           3.242538
average_reward                       -0.23507
round_time             0 days 00:11:03.432735
episodes_test                           183.0
episode_length_test                 10.874317
returns_test                        -2.205269
return_std_test                        3.6556
average_reward_test                 -0.200171
round_time_test        0 days 00:00:02.553219
round_time_total       0 days 00:11:03.433818
loss_total               111637898329617072.0
loss_critic              139547370393828320.0
loss_actor                    -1526581881.472
memory_size                           14533.0 

=== epoch 5/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  184
episode_length                      10.820652
returns                             -2.426005
return_std                           2.699786
average_reward                      -0.223554
round_time             0 days 00:11:07.762939
episodes_test                           177.0
episode_length_test                 11.299435
returns_test                        -2.617109
return_std_test                      3.893427
average_reward_test                 -0.231614
round_time_test        0 days 00:00:02.577991
round_time_total       0 days 00:11:07.764019
loss_total               117598676018584880.0
loss_critic              146998342468530880.0
loss_actor                    -1561935916.096
memory_size                           14533.0 

=== epoch 5/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  173
episode_length                      11.526012
returns                             -2.006262
return_std                           3.245584
average_reward                       -0.17424
round_time             0 days 00:11:05.400386
episodes_test                           170.0
episode_length_test                 11.723529
returns_test                        -2.189253
return_std_test                      3.766717
average_reward_test                 -0.185421
round_time_test        0 days 00:00:02.541398
round_time_total       0 days 00:11:05.401486
loss_total               122239184349207456.0
loss_critic              152798977787856672.0
loss_actor                    -1601031993.408
memory_size                        14533.2745 

=== epoch 5/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:55,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  174
episode_length                       11.37931
returns                             -1.524406
return_std                           4.845583
average_reward                      -0.130761
round_time             0 days 00:11:04.113223
episodes_test                           160.0
episode_length_test                  12.44375
returns_test                        -1.772358
return_std_test                       4.28876
average_reward_test                 -0.142039
round_time_test        0 days 00:00:02.551874
round_time_total       0 days 00:11:04.114317
loss_total               127684792156034176.0
loss_critic              159605987323320192.0
loss_actor                    -1631469633.472
memory_size                        14558.8345 

=== epoch 5/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  176
episode_length                        11.3125
returns                             -2.156394
return_std                           3.253663
average_reward                      -0.193041
round_time             0 days 00:11:05.694799
episodes_test                           191.0
episode_length_test                 10.465969
returns_test                        -2.559073
return_std_test                      2.565831
average_reward_test                    -0.244
round_time_test        0 days 00:00:02.556578
round_time_total       0 days 00:11:05.695893
loss_total               133479001926186048.0
loss_critic              166848749451989760.0
loss_actor                    -1667963072.256
memory_size                           14561.0 

=== epoch 5/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  172
episode_length                      11.563953
returns                             -1.938183
return_std                           4.660211
average_reward                      -0.165684
round_time             0 days 00:11:04.363749
episodes_test                           188.0
episode_length_test                 10.595745
returns_test                        -2.192101
return_std_test                      3.760175
average_reward_test                 -0.206704
round_time_test        0 days 00:00:02.577687
round_time_total       0 days 00:11:04.364850
loss_total               137817349735126656.0
loss_critic              172271684344430464.0
loss_actor                    -1705676426.944
memory_size                        14582.3585 

=== epoch 5/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:24,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  181
episode_length                      10.966851
returns                             -3.034972
return_std                           2.762164
average_reward                       -0.27547
round_time             0 days 00:11:04.930327
episodes_test                           189.0
episode_length_test                 10.529101
returns_test                        -2.724221
return_std_test                      2.176966
average_reward_test                 -0.253059
round_time_test        0 days 00:00:02.590619
round_time_total       0 days 00:11:04.931408
loss_total               142808593303896848.0
loss_critic              178510738596281984.0
loss_actor                     -1738944739.52
memory_size                           14590.0 

=== epoch 5/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  167
episode_length                      11.868263
returns                             -1.776682
return_std                           3.160159
average_reward                      -0.154311
round_time             0 days 00:11:06.606048
episodes_test                           180.0
episode_length_test                      11.1
returns_test                        -2.559061
return_std_test                      2.514924
average_reward_test                 -0.229584
round_time_test        0 days 00:00:02.584092
round_time_total       0 days 00:11:06.607204
loss_total               150361149595044544.0
loss_critic              187951433826535744.0
loss_actor                    -1777687726.848
memory_size                           14590.0 

=== epoch 5/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  164
episode_length                      12.134146
returns                             -1.977854
return_std                           3.303187
average_reward                      -0.164392
round_time             0 days 00:11:07.889829
episodes_test                           188.0
episode_length_test                 10.638298
returns_test                        -2.224108
return_std_test                      2.972069
average_reward_test                 -0.209066
round_time_test        0 days 00:00:02.583491
round_time_total       0 days 00:11:07.890924
loss_total               155113669856703008.0
loss_critic              193892083932954880.0
loss_actor                    -1806453451.392
memory_size                           14590.0 

=== epoch 5/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  173
episode_length                      11.514451
returns                             -2.181689
return_std                           5.207831
average_reward                      -0.190223
round_time             0 days 00:11:06.962141
episodes_test                           172.0
episode_length_test                 11.604651
returns_test                        -1.972979
return_std_test                      3.878786
average_reward_test                 -0.168589
round_time_test        0 days 00:00:02.548898
round_time_total       0 days 00:11:06.963223
loss_total               159658883189202496.0
loss_critic              199573600464629920.0
loss_actor                     -1833006278.72
memory_size                        14593.8225 

=== epoch 5/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  177
episode_length                      11.254237
returns                             -2.584946
return_std                           3.062843
average_reward                      -0.225953
round_time             0 days 00:11:06.407924
episodes_test                           171.0
episode_length_test                 11.631579
returns_test                         -2.34693
return_std_test                      2.844485
average_reward_test                 -0.197947
round_time_test        0 days 00:00:02.603141
round_time_total       0 days 00:11:06.409019
loss_total               167499569768731520.0
loss_critic              209374458537643616.0
loss_actor                     -1865765858.88
memory_size                           14612.0 

=== epoch 5/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:52,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  163
episode_length                      12.190184
returns                             -2.072314
return_std                           3.478609
average_reward                      -0.170374
round_time             0 days 00:11:06.041458
episodes_test                           175.0
episode_length_test                 11.388571
returns_test                        -2.366808
return_std_test                      3.425418
average_reward_test                 -0.208891
round_time_test        0 days 00:00:02.562069
round_time_total       0 days 00:11:06.042558
loss_total               174259550724416352.0
loss_critic              217824434666751392.0
loss_actor                    -1906379266.304
memory_size                        14613.5075 

=== epoch 5/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:18,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  172
episode_length                      11.563953
returns                              -2.89681
return_std                           3.083226
average_reward                      -0.251439
round_time             0 days 00:11:03.543776
episodes_test                           175.0
episode_length_test                 11.417143
returns_test                        -2.522054
return_std_test                      3.488388
average_reward_test                 -0.219996
round_time_test        0 days 00:00:02.563022
round_time_total       0 days 00:11:03.544865
loss_total               179416738331495488.0
loss_critic              224270919158420480.0
loss_actor                    -1941541028.736
memory_size                           14615.0 

=== epoch 5/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  172
episode_length                      11.598837
returns                             -2.518295
return_std                           3.183874
average_reward                      -0.218636
round_time             0 days 00:11:07.385674
episodes_test                           180.0
episode_length_test                 11.094444
returns_test                        -2.551785
return_std_test                      3.059926
average_reward_test                 -0.229416
round_time_test        0 days 00:00:02.634716
round_time_total       0 days 00:11:07.386766
loss_total               189494014712211296.0
loss_critic              236867514612840384.0
loss_actor                    -1974439020.864
memory_size                         14621.748 

=== epoch 5/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  170
episode_length                      11.641176
returns                             -2.579138
return_std                           3.943125
average_reward                      -0.221198
round_time             0 days 00:11:06.348189
episodes_test                           158.0
episode_length_test                 12.613924
returns_test                        -1.951257
return_std_test                      4.636769
average_reward_test                 -0.154781
round_time_test        0 days 00:00:02.531376
round_time_total       0 days 00:11:06.349282
loss_total               194906945644419616.0
loss_critic              243633677800285664.0
loss_actor                    -2013699712.896
memory_size                         14628.075 

=== epoch 5/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:40,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  174
episode_length                      11.471264
returns                              -2.47164
return_std                           3.023486
average_reward                        -0.2144
round_time             0 days 00:11:10.689798
episodes_test                           180.0
episode_length_test                 11.088889
returns_test                        -2.926438
return_std_test                      3.074486
average_reward_test                 -0.261423
round_time_test        0 days 00:00:02.568357
round_time_total       0 days 00:11:10.690876
loss_total               203060617747917440.0
loss_critic              253825767525931040.0
loss_actor                    -2046831614.336
memory_size                           14634.0 

=== epoch 5/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  176
episode_length                      11.284091
returns                             -2.036081
return_std                           3.630543
average_reward                      -0.178956
round_time             0 days 00:11:08.815019
episodes_test                           171.0
episode_length_test                 11.631579
returns_test                         -2.35117
return_std_test                      3.336639
average_reward_test                 -0.199666
round_time_test        0 days 00:00:02.544285
round_time_total       0 days 00:11:08.816129
loss_total               206817953456702080.0
loss_critic              258522437516783520.0
loss_actor                     -2079570117.76
memory_size                        14634.1725 

=== epoch 5/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  167
episode_length                      11.874251
returns                             -2.430224
return_std                            3.34242
average_reward                      -0.202727
round_time             0 days 00:11:08.973886
episodes_test                           171.0
episode_length_test                 11.625731
returns_test                        -2.162521
return_std_test                      3.148272
average_reward_test                 -0.185196
round_time_test        0 days 00:00:02.547945
round_time_total       0 days 00:11:08.974980
loss_total               214045855783432736.0
loss_critic              267557315048313440.0
loss_actor                    -2116448017.152
memory_size                           14636.0 

=== epoch 5/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  164
episode_length                       11.97561
returns                             -2.019431
return_std                            3.99556
average_reward                      -0.156614
round_time             0 days 00:11:09.504378
episodes_test                           172.0
episode_length_test                 11.610465
returns_test                        -2.524742
return_std_test                      3.050951
average_reward_test                 -0.216037
round_time_test        0 days 00:00:02.547981
round_time_total       0 days 00:11:09.505470
loss_total               222553346429486624.0
loss_critic              278191678247432896.0
loss_actor                     -2150693113.28
memory_size                        14642.0995 

=== epoch 5/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  172
episode_length                      11.453488
returns                             -3.174817
return_std                           2.970905
average_reward                      -0.254458
round_time             0 days 00:11:10.471389
episodes_test                           168.0
episode_length_test                  11.85119
returns_test                        -1.874116
return_std_test                      3.454884
average_reward_test                 -0.158212
round_time_test        0 days 00:00:02.579563
round_time_total       0 days 00:11:10.472465
loss_total               227608689139228096.0
loss_critic              284510856309265920.0
loss_actor                    -2189130252.736
memory_size                        14668.8405 

=== epoch 5/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:14,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  165
episode_length                      11.987879
returns                             -1.792221
return_std                           6.638368
average_reward                      -0.141722
round_time             0 days 00:11:07.776234
episodes_test                           162.0
episode_length_test                 12.339506
returns_test                        -2.429821
return_std_test                      4.181087
average_reward_test                 -0.196311
round_time_test        0 days 00:00:02.529716
round_time_total       0 days 00:11:07.777314
loss_total               238196505918039904.0
loss_critic              297745627369216960.0
loss_actor                     -2230398327.36
memory_size                        14681.6765 

=== epoch 5/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:11,  2.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  168
episode_length                      11.827381
returns                             -2.745769
return_std                           3.862659
average_reward                      -0.229827
round_time             0 days 00:11:08.250361
episodes_test                           156.0
episode_length_test                 12.794872
returns_test                        -1.665853
return_std_test                      4.520021
average_reward_test                 -0.129424
round_time_test        0 days 00:00:02.534737
round_time_total       0 days 00:11:08.251441
loss_total               248624536979146528.0
loss_critic              310780666090373504.0
loss_actor                    -2263790821.824
memory_size                          14703.94 

=== epoch 5/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  158
episode_length                      12.601266
returns                             -1.854408
return_std                           6.616977
average_reward                      -0.149055
round_time             0 days 00:11:07.393426
episodes_test                           183.0
episode_length_test                 10.912568
returns_test                        -2.262671
return_std_test                      3.322444
average_reward_test                 -0.205675
round_time_test        0 days 00:00:02.559772
round_time_total       0 days 00:11:07.394508
loss_total               255167419168505984.0
loss_critic              318959268464684928.0
loss_actor                    -2296230638.976
memory_size                        14725.4845 

=== epoch 5/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  160
episode_length                        12.4375
returns                             -2.203636
return_std                           5.060435
average_reward                      -0.178156
round_time             0 days 00:11:07.614195
episodes_test                           166.0
episode_length_test                 11.981928
returns_test                        -2.445731
return_std_test                      3.675308
average_reward_test                 -0.199917
round_time_test        0 days 00:00:02.527030
round_time_total       0 days 00:11:07.615292
loss_total               262713938120762464.0
loss_critic              328392417280633344.0
loss_actor                    -2332333931.776
memory_size                        14765.5035 

=== epoch 5/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:02,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  174
episode_length                      11.408046
returns                             -3.002975
return_std                            3.92977
average_reward                      -0.261091
round_time             0 days 00:11:06.844981
episodes_test                           161.0
episode_length_test                 12.360248
returns_test                        -2.443756
return_std_test                      6.898825
average_reward_test                 -0.196337
round_time_test        0 days 00:00:02.561476
round_time_total       0 days 00:11:06.846063
loss_total               271259251395581184.0
loss_critic              339074058549886720.0
loss_actor                    -2361688209.536
memory_size                         14772.019 

=== epoch 5/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.97it/s]
episodes                                  173
episode_length                      11.520231
returns                             -2.661633
return_std                           3.577051
average_reward                      -0.228379
round_time             0 days 00:11:14.734213
episodes_test                           184.0
episode_length_test                 10.847826
returns_test                        -3.029024
return_std_test                      2.491726
average_reward_test                  -0.27762
round_time_test        0 days 00:00:02.569248
round_time_total       0 days 00:11:14.735297
loss_total               274895609319577888.0
loss_critic              343619506146008640.0
loss_actor                     -2390653735.68
memory_size                        14775.4665 

=== epoch 5/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:35,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  183
episode_length                      10.885246
returns                             -2.910264
return_std                           2.870672
average_reward                      -0.269003
round_time             0 days 00:11:06.337930
episodes_test                           169.0
episode_length_test                 11.804734
returns_test                        -1.851195
return_std_test                      9.804261
average_reward_test                 -0.157406
round_time_test        0 days 00:00:02.535908
round_time_total       0 days 00:11:06.339029
loss_total               285968523292760352.0
loss_critic              357460647678868224.0
loss_actor                    -2421523627.264
memory_size                           14776.0 

=== epoch 5/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:33,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  170
episode_length                           11.7
returns                             -2.111222
return_std                           4.320349
average_reward                      -0.181134
round_time             0 days 00:11:15.737062
episodes_test                           171.0
episode_length_test                 11.625731
returns_test                        -3.196498
return_std_test                      3.123727
average_reward_test                 -0.273777
round_time_test        0 days 00:00:02.556807
round_time_total       0 days 00:11:15.738160
loss_total               292450833335006016.0
loss_critic              365563535483467712.0
loss_actor                    -2457849439.616
memory_size                        14778.1315 

=== epoch 5/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  188
episode_length                      10.590426
returns                             -2.487742
return_std                           2.854646
average_reward                      -0.235572
round_time             0 days 00:11:09.296734
episodes_test                           203.0
episode_length_test                  9.817734
returns_test                         -2.68636
return_std_test                      3.283063
average_reward_test                 -0.273799
round_time_test        0 days 00:00:02.637818
round_time_total       0 days 00:11:09.297813
loss_total               295750610406627840.0
loss_critic              369688256752664960.0
loss_actor                    -2483543180.032
memory_size                           14782.0 

=== epoch 5/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  177
episode_length                      11.225989
returns                             -2.630027
return_std                           2.772645
average_reward                      -0.235791
round_time             0 days 00:11:07.400019
episodes_test                           169.0
episode_length_test                 11.786982
returns_test                        -1.957874
return_std_test                      2.931851
average_reward_test                  -0.16674
round_time_test        0 days 00:00:02.551725
round_time_total       0 days 00:11:07.401100
loss_total               288167862556973600.0
loss_critic              360209821769335296.0
loss_actor                    -2500507761.536
memory_size                         14782.161 

=== epoch 5/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  187
episode_length                      10.641711
returns                             -3.113716
return_std                           2.216716
average_reward                      -0.290391
round_time             0 days 00:11:08.983273
episodes_test                           186.0
episode_length_test                 10.752688
returns_test                        -2.778354
return_std_test                      2.477367
average_reward_test                 -0.258387
round_time_test        0 days 00:00:02.597082
round_time_total       0 days 00:11:08.984352
loss_total               298026995375472128.0
loss_critic              372533737680789312.0
loss_actor                    -2526222069.504
memory_size                           14783.0 

=== epoch 5/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
episodes                                  179
episode_length                      11.100559
returns                             -2.772405
return_std                           5.957577
average_reward                      -0.250493
round_time             0 days 00:11:08.480660
episodes_test                           174.0
episode_length_test                 11.431034
returns_test                        -2.799925
return_std_test                      3.917184
average_reward_test                  -0.24419
round_time_test        0 days 00:00:02.540482
round_time_total       0 days 00:11:08.481748
loss_total               310169914851754752.0
loss_critic              387712387043859392.0
loss_actor                    -2558205702.272
memory_size                        14789.1985 

=== epoch 5/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:32,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  171
episode_length                      11.637427
returns                             -2.264614
return_std                           4.814855
average_reward                      -0.193779
round_time             0 days 00:11:09.524971
episodes_test                           179.0
episode_length_test                 11.067039
returns_test                        -2.905026
return_std_test                      2.608168
average_reward_test                 -0.262875
round_time_test        0 days 00:00:02.549334
round_time_total       0 days 00:11:09.526046
loss_total               321820692985721920.0
loss_critic              402275859230281984.0
loss_actor                     -2601189226.24
memory_size                         14809.234 

=== epoch 5/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  190
episode_length                      10.515789
returns                             -2.946083
return_std                            2.58683
average_reward                      -0.279978
round_time             0 days 00:11:07.438250
episodes_test                           176.0
episode_length_test                 11.340909
returns_test                        -2.726234
return_std_test                      3.670328
average_reward_test                 -0.238652
round_time_test        0 days 00:00:02.556418
round_time_total       0 days 00:11:07.439323
loss_total               335661259688748544.0
loss_critic              419576567430824064.0
loss_actor                     -2644971132.16
memory_size                           14827.0 

=== epoch 5/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  199
episode_length                           10.0
returns                               -2.8643
return_std                           2.402035
average_reward                      -0.285754
round_time             0 days 00:11:09.034476
episodes_test                           189.0
episode_length_test                 10.560847
returns_test                        -3.134081
return_std_test                      2.492985
average_reward_test                 -0.294103
round_time_test        0 days 00:00:02.564666
round_time_total       0 days 00:11:09.035545
loss_total               350558318484653056.0
loss_critic              438197890818330560.0
loss_actor                    -2678800999.936
memory_size                           14827.0 

=== epoch 5/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  186
episode_length                      10.741935
returns                             -3.117122
return_std                           3.265195
average_reward                      -0.288891
round_time             0 days 00:11:12.379019
episodes_test                           186.0
episode_length_test                 10.715054
returns_test                        -3.080162
return_std_test                      2.656776
average_reward_test                 -0.284777
round_time_test        0 days 00:00:02.560133
round_time_total       0 days 00:11:12.380102
loss_total               364506961824865856.0
loss_critic              455633694522223872.0
loss_actor                    -2722794508.928
memory_size                         14835.829 

=== epoch 5/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:37,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  176
episode_length                      11.261364
returns                             -2.500634
return_std                           4.534993
average_reward                      -0.221883
round_time             0 days 00:11:09.061727
episodes_test                           181.0
episode_length_test                      11.0
returns_test                        -2.800559
return_std_test                      2.669385
average_reward_test                 -0.249487
round_time_test        0 days 00:00:02.559502
round_time_total       0 days 00:11:09.062790
loss_total               365746844305408896.0
loss_critic              457183547970795072.0
loss_actor                    -2751051675.008
memory_size                         14862.474 

=== epoch 5/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  184
episode_length                      10.847826
returns                             -2.625032
return_std                           2.927955
average_reward                      -0.241794
round_time             0 days 00:11:09.588089
episodes_test                           172.0
episode_length_test                 11.575581
returns_test                        -3.010509
return_std_test                      3.794487
average_reward_test                 -0.260314
round_time_test        0 days 00:00:02.545470
round_time_total       0 days 00:11:09.589167
loss_total               363552698298754112.0
loss_critic              454440865465697856.0
loss_actor                    -2771366245.504
memory_size                           14867.0 

=== epoch 5/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  189
episode_length                      10.444444
returns                             -2.441565
return_std                           2.271614
average_reward                      -0.228572
round_time             0 days 00:11:11.523665
episodes_test                           176.0
episode_length_test                 11.284091
returns_test                        -2.360768
return_std_test                       5.20559
average_reward_test                 -0.208651
round_time_test        0 days 00:00:02.569611
round_time_total       0 days 00:11:11.524749
loss_total               351786251829307776.0
loss_critic              439732807405733440.0
loss_actor                    -2757710286.848
memory_size                           14867.0 

=== epoch 5/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  192
episode_length                      10.354167
returns                              -2.36483
return_std                           2.323553
average_reward                      -0.226476
round_time             0 days 00:11:11.398559
episodes_test                           187.0
episode_length_test                 10.668449
returns_test                        -2.285676
return_std_test                      2.342942
average_reward_test                 -0.211154
round_time_test        0 days 00:00:02.579906
round_time_total       0 days 00:11:11.399658
loss_total               349915019833541376.0
loss_critic              437393767374518208.0
loss_actor                    -2751997073.792
memory_size                           14867.0 

=== epoch 5/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:04,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  186
episode_length                      10.682796
returns                             -2.606999
return_std                           2.451226
average_reward                       -0.24588
round_time             0 days 00:11:06.897578
episodes_test                           194.0
episode_length_test                 10.268041
returns_test                        -2.700046
return_std_test                      2.296438
average_reward_test                 -0.261969
round_time_test        0 days 00:00:02.580304
round_time_total       0 days 00:11:06.898663
loss_total               360710471413946880.0
loss_critic              450888081695406272.0
loss_actor                    -2795986597.504
memory_size                           14867.0 

=== epoch 5/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:33,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  185
episode_length                      10.756757
returns                             -2.605759
return_std                           2.568875
average_reward                      -0.242512
round_time             0 days 00:11:13.775764
episodes_test                           197.0
episode_length_test                 10.152284
returns_test                        -2.533464
return_std_test                      2.742553
average_reward_test                 -0.249546
round_time_test        0 days 00:00:02.611376
round_time_total       0 days 00:11:13.776863
loss_total               377826680257678144.0
loss_critic              472283342318426112.0
loss_actor                    -2836049485.568
memory_size                           14867.0 

=== epoch 5/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                  196
episode_length                      10.183673
returns                             -2.600215
return_std                           3.424507
average_reward                      -0.253772
round_time             0 days 00:11:12.654498
episodes_test                           181.0
episode_length_test                 11.038674
returns_test                        -2.955337
return_std_test                      3.576182
average_reward_test                 -0.266894
round_time_test        0 days 00:00:02.561568
round_time_total       0 days 00:11:12.655569
loss_total               395469058093074112.0
loss_critic              494336313892190272.0
loss_actor                    -2889111990.784
memory_size                         14869.087 

=== epoch 5/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:02,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  200
episode_length                          9.945
returns                             -2.992303
return_std                           1.924317
average_reward                      -0.299992
round_time             0 days 00:11:08.960822
episodes_test                           192.0
episode_length_test                  10.40625
returns_test                        -2.895417
return_std_test                      2.174093
average_reward_test                 -0.277176
round_time_test        0 days 00:00:02.589496
round_time_total       0 days 00:11:08.961914
loss_total               409467635259941056.0
loss_critic              511834535432378368.0
loss_actor                    -2947084392.192
memory_size                           14871.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  198
episode_length                       10.10101
returns                             -2.655897
return_std                           2.005418
average_reward                      -0.262934
round_time             0 days 00:11:04.431979
episodes_test                           194.0
episode_length_test                 10.252577
returns_test                        -2.937847
return_std_test                      2.628106
average_reward_test                 -0.284298
round_time_test        0 days 00:00:02.568466
round_time_total       0 days 00:11:04.433114
loss_total               436136455624630208.0
loss_critic              545170560103334528.0
loss_actor                    -3006690861.696
memory_size                           14871.0 

=== epoch 6/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:38,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  201
episode_length                       9.905473
returns                             -2.850765
return_std                           3.208633
average_reward                      -0.287107
round_time             0 days 00:11:05.962518
episodes_test                           203.0
episode_length_test                  9.837438
returns_test                           -2.952
return_std_test                      2.079925
average_reward_test                  -0.29885
round_time_test        0 days 00:00:02.601394
round_time_total       0 days 00:11:05.963619
loss_total               449168745431425856.0
loss_critic              561460922086951232.0
loss_actor                    -3073299629.824
memory_size                        14871.6105 

=== epoch 6/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  192
episode_length                      10.390625
returns                              -3.10154
return_std                           2.422838
average_reward                      -0.297965
round_time             0 days 00:11:06.338099
episodes_test                           194.0
episode_length_test                 10.298969
returns_test                        -2.995491
return_std_test                      2.716927
average_reward_test                 -0.289643
round_time_test        0 days 00:00:02.574910
round_time_total       0 days 00:11:06.339186
loss_total               472416442908252544.0
loss_critic              590520543586166016.0
loss_actor                    -3137794345.088
memory_size                           14874.0 

=== epoch 6/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:09,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  186
episode_length                      10.666667
returns                             -2.584757
return_std                           2.745189
average_reward                      -0.241444
round_time             0 days 00:11:03.393317
episodes_test                           181.0
episode_length_test                 11.022099
returns_test                        -2.879594
return_std_test                      3.113824
average_reward_test                 -0.259845
round_time_test        0 days 00:00:02.557598
round_time_total       0 days 00:11:03.394416
loss_total               484808414234369088.0
loss_critic              606010507421689088.0
loss_actor                    -3197565990.528
memory_size                           14874.0 

=== epoch 6/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  193
episode_length                      10.341969
returns                             -2.732233
return_std                           2.309998
average_reward                      -0.264877
round_time             0 days 00:11:09.364618
episodes_test                           183.0
episode_length_test                 10.874317
returns_test                        -1.695051
return_std_test                      4.769131
average_reward_test                 -0.155645
round_time_test        0 days 00:00:02.618158
round_time_total       0 days 00:11:09.365714
loss_total               508324005258287104.0
loss_critic              635404996066295168.0
loss_actor                    -3256444391.808
memory_size                           14874.0 

=== epoch 6/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  202
episode_length                       9.861386
returns                              -2.94544
return_std                           2.511407
average_reward                      -0.299985
round_time             0 days 00:11:06.586101
episodes_test                           195.0
episode_length_test                 10.235897
returns_test                         -2.36526
return_std_test                      3.387651
average_reward_test                 -0.229523
round_time_test        0 days 00:00:02.580088
round_time_total       0 days 00:11:06.587186
loss_total               530858328046140672.0
loss_critic              663572898332415104.0
loss_actor                    -3314597738.112
memory_size                           14874.0 

=== epoch 6/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  196
episode_length                      10.153061
returns                              -2.62581
return_std                           2.520742
average_reward                      -0.258074
round_time             0 days 00:11:06.151869
episodes_test                           202.0
episode_length_test                  9.886139
returns_test                         -3.09471
return_std_test                      2.194969
average_reward_test                 -0.312278
round_time_test        0 days 00:00:02.548318
round_time_total       0 days 00:11:06.152954
loss_total               544704899611663296.0
loss_critic              680881112821530624.0
loss_actor                     -3385072830.72
memory_size                           14874.0 

=== epoch 6/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  208
episode_length                       9.610577
returns                              -3.06271
return_std                           1.953981
average_reward                      -0.319252
round_time             0 days 00:11:10.047740
episodes_test                           196.0
episode_length_test                 10.188776
returns_test                        -2.863876
return_std_test                      2.353236
average_reward_test                  -0.27936
round_time_test        0 days 00:00:02.633276
round_time_total       0 days 00:11:10.048827
loss_total               566435751554510208.0
loss_critic              708044677452662784.0
loss_actor                    -3433208070.016
memory_size                           14874.0 

=== epoch 6/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  194
episode_length                      10.283505
returns                             -2.549804
return_std                           2.244756
average_reward                      -0.246395
round_time             0 days 00:11:10.932089
episodes_test                           198.0
episode_length_test                 10.080808
returns_test                        -2.393834
return_std_test                      4.075865
average_reward_test                 -0.236721
round_time_test        0 days 00:00:02.581319
round_time_total       0 days 00:11:10.933189
loss_total               584542632138783104.0
loss_critic              730678277275692032.0
loss_actor                    -3483027437.568
memory_size                        14885.8875 

=== epoch 6/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  192
episode_length                      10.307292
returns                             -2.985189
return_std                           2.267518
average_reward                      -0.286932
round_time             0 days 00:11:06.960607
episodes_test                           186.0
episode_length_test                  10.72043
returns_test                        -2.164632
return_std_test                      7.217778
average_reward_test                 -0.202155
round_time_test        0 days 00:00:02.640677
round_time_total       0 days 00:11:06.961734
loss_total               607025002602792192.0
loss_critic              758781239812390144.0
loss_actor                    -3535043414.656
memory_size                           14889.0 

=== epoch 6/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  194
episode_length                      10.293814
returns                             -2.848898
return_std                           2.474745
average_reward                      -0.275961
round_time             0 days 00:11:05.184296
episodes_test                           183.0
episode_length_test                 10.896175
returns_test                         -2.18747
return_std_test                      4.594747
average_reward_test                 -0.201177
round_time_test        0 days 00:00:02.549928
round_time_total       0 days 00:11:05.185392
loss_total               628102442655294720.0
loss_critic              785128039931705344.0
loss_actor                     -3582730211.84
memory_size                           14889.0 

=== epoch 6/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  189
episode_length                       10.52381
returns                             -2.703226
return_std                           2.852821
average_reward                       -0.25624
round_time             0 days 00:11:04.394176
episodes_test                           190.0
episode_length_test                 10.473684
returns_test                        -2.268078
return_std_test                      4.605393
average_reward_test                  -0.21512
round_time_test        0 days 00:00:02.566259
round_time_total       0 days 00:11:04.395263
loss_total               639853569599166464.0
loss_critic              799816948205670656.0
loss_actor                    -3633820226.432
memory_size                          14917.75 

=== epoch 6/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  184
episode_length                      10.793478
returns                             -2.354268
return_std                           3.850488
average_reward                      -0.215339
round_time             0 days 00:11:05.208721
episodes_test                           178.0
episode_length_test                 11.202247
returns_test                        -2.391543
return_std_test                      4.529215
average_reward_test                  -0.21328
round_time_test        0 days 00:00:02.544793
round_time_total       0 days 00:11:05.209807
loss_total               676959227812452992.0
loss_critic              846199020137981440.0
loss_actor                    -3684805033.856
memory_size                        14931.7475 

=== epoch 6/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  198
episode_length                      10.030303
returns                             -3.089008
return_std                           1.941523
average_reward                      -0.307464
round_time             0 days 00:11:08.821728
episodes_test                           194.0
episode_length_test                 10.262887
returns_test                        -2.997485
return_std_test                      2.854721
average_reward_test                 -0.287745
round_time_test        0 days 00:00:02.607955
round_time_total       0 days 00:11:08.822821
loss_total               694905580661143296.0
loss_critic              868631961902145024.0
loss_actor                     -3754360539.52
memory_size                           14945.0 

=== epoch 6/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  197
episode_length                      10.116751
returns                             -2.902588
return_std                           3.504174
average_reward                      -0.288607
round_time             0 days 00:11:05.015454
episodes_test                           194.0
episode_length_test                 10.268041
returns_test                        -2.995203
return_std_test                      2.710108
average_reward_test                 -0.290795
round_time_test        0 days 00:00:02.585821
round_time_total       0 days 00:11:05.016541
loss_total               714363300250386432.0
loss_critic              892954109945590016.0
loss_actor                     -3825682868.48
memory_size                         14947.996 

=== epoch 6/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  197
episode_length                      10.126904
returns                             -2.593482
return_std                           2.435795
average_reward                      -0.258438
round_time             0 days 00:11:05.425164
episodes_test                           207.0
episode_length_test                  9.661836
returns_test                        -3.316855
return_std_test                      1.851032
average_reward_test                 -0.343294
round_time_test        0 days 00:00:02.589062
round_time_total       0 days 00:11:05.426255
loss_total               748038930661201920.0
loss_critic              935048647495252992.0
loss_actor                    -3897370546.048
memory_size                           14952.0 

=== epoch 6/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  184
episode_length                      10.869565
returns                             -2.372122
return_std                           5.158141
average_reward                      -0.218235
round_time             0 days 00:11:07.382066
episodes_test                           203.0
episode_length_test                  9.842365
returns_test                        -2.721081
return_std_test                      3.858543
average_reward_test                 -0.275122
round_time_test        0 days 00:00:02.584359
round_time_total       0 days 00:11:07.383145
loss_total               780385717973254400.0
loss_critic              975482131594516352.0
loss_actor                    -3976780707.712
memory_size                        14964.1555 

=== epoch 6/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  196
episode_length                      10.188776
returns                             -2.720347
return_std                           2.222469
average_reward                      -0.266503
round_time             0 days 00:11:05.687528
episodes_test                           193.0
episode_length_test                 10.321244
returns_test                        -2.631313
return_std_test                      3.805032
average_reward_test                 -0.254335
round_time_test        0 days 00:00:02.609328
round_time_total       0 days 00:11:05.688621
loss_total               803784236100013312.0
loss_critic             1004730278303777152.0
loss_actor                    -4045930795.904
memory_size                           14981.0 

=== epoch 6/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  198
episode_length                      10.050505
returns                             -3.040138
return_std                           2.097865
average_reward                       -0.30248
round_time             0 days 00:11:04.947366
episodes_test                           183.0
episode_length_test                 10.923497
returns_test                        -2.649162
return_std_test                      2.586976
average_reward_test                 -0.241949
round_time_test        0 days 00:00:02.538094
round_time_total       0 days 00:11:04.948454
loss_total               835591402423916160.0
loss_critic             1044489234619517952.0
loss_actor                    -4097101562.368
memory_size                           14981.0 

=== epoch 6/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:18,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  195
episode_length                      10.174359
returns                             -2.781391
return_std                            3.69383
average_reward                       -0.26612
round_time             0 days 00:11:09.214065
episodes_test                           196.0
episode_length_test                 10.178571
returns_test                        -2.898351
return_std_test                      2.484147
average_reward_test                 -0.283502
round_time_test        0 days 00:00:02.578828
round_time_total       0 days 00:11:09.215156
loss_total               861086238974832256.0
loss_critic             1076357779590903552.0
loss_actor                    -4177732388.736
memory_size                         14989.702 

=== epoch 6/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  194
episode_length                      10.247423
returns                             -2.744748
return_std                           3.572012
average_reward                      -0.267139
round_time             0 days 00:11:11.076583
episodes_test                           186.0
episode_length_test                 10.725806
returns_test                        -2.755031
return_std_test                      3.973348
average_reward_test                 -0.254243
round_time_test        0 days 00:00:02.544206
round_time_total       0 days 00:11:11.077673
loss_total               895675428024931584.0
loss_critic             1119594266687359104.0
loss_actor                    -4247532539.648
memory_size                        15003.1775 

=== epoch 6/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  190
episode_length                      10.468421
returns                              -2.48267
return_std                           5.158368
average_reward                      -0.236148
round_time             0 days 00:11:03.890667
episodes_test                           192.0
episode_length_test                 10.369792
returns_test                        -2.907005
return_std_test                      2.581555
average_reward_test                 -0.280593
round_time_test        0 days 00:00:02.576248
round_time_total       0 days 00:11:03.891780
loss_total               926730232351929984.0
loss_critic             1158412771511991552.0
loss_actor                    -4311761094.144
memory_size                        15026.7995 

=== epoch 6/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  193
episode_length                      10.316062
returns                             -2.391717
return_std                           5.652742
average_reward                      -0.231339
round_time             0 days 00:11:06.796594
episodes_test                           189.0
episode_length_test                 10.529101
returns_test                        -2.608584
return_std_test                      4.416296
average_reward_test                 -0.248064
round_time_test        0 days 00:00:02.547025
round_time_total       0 days 00:11:06.797690
loss_total               964691097765931904.0
loss_critic             1205863851965233920.0
loss_actor                    -4389778556.928
memory_size                         15053.928 

=== epoch 6/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:56,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  196
episode_length                      10.163265
returns                             -2.913363
return_std                           3.087294
average_reward                      -0.286841
round_time             0 days 00:11:09.958897
episodes_test                           195.0
episode_length_test                 10.117949
returns_test                        -2.573496
return_std_test                      1.964589
average_reward_test                 -0.237227
round_time_test        0 days 00:00:02.585438
round_time_total       0 days 00:11:09.959986
loss_total               963617946431215616.0
loss_critic             1204522412921392640.0
loss_actor                    -4458611169.152
memory_size                        15085.4585 

=== epoch 6/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:59,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  192
episode_length                      10.359375
returns                              -2.53286
return_std                           3.172201
average_reward                      -0.243544
round_time             0 days 00:11:10.913648
episodes_test                           197.0
episode_length_test                 10.121827
returns_test                        -2.800542
return_std_test                      2.165072
average_reward_test                 -0.277203
round_time_test        0 days 00:00:02.624954
round_time_total       0 days 00:11:10.914738
loss_total              1026023257845465600.0
loss_critic             1282529050651606784.0
loss_actor                    -4525368223.744
memory_size                          15107.81 

=== epoch 6/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:04,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  189
episode_length                      10.439153
returns                             -2.658319
return_std                             2.7659
average_reward                       -0.25333
round_time             0 days 00:11:12.885556
episodes_test                           197.0
episode_length_test                  10.13198
returns_test                        -3.064156
return_std_test                      2.706615
average_reward_test                 -0.299783
round_time_test        0 days 00:00:02.582404
round_time_total       0 days 00:11:12.886644
loss_total              1059806785350949376.0
loss_critic             1324758458506600704.0
loss_actor                    -4604370826.496
memory_size                        15155.8785 

=== epoch 6/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:38,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  200
episode_length                           9.98
returns                             -2.965108
return_std                           2.020353
average_reward                      -0.297326
round_time             0 days 00:11:11.207345
episodes_test                           196.0
episode_length_test                 10.193878
returns_test                        -2.975695
return_std_test                      3.939642
average_reward_test                   -0.2905
round_time_test        0 days 00:00:02.615882
round_time_total       0 days 00:11:11.208458
loss_total              1099202353434487424.0
loss_critic             1374002918497206784.0
loss_actor                    -4668271314.048
memory_size                           15166.0 

=== epoch 6/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:34,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  203
episode_length                       9.748768
returns                             -2.840416
return_std                           1.809997
average_reward                      -0.292111
round_time             0 days 00:11:08.006522
episodes_test                           200.0
episode_length_test                     9.995
returns_test                         -2.86705
return_std_test                      1.990969
average_reward_test                 -0.286275
round_time_test        0 days 00:00:02.571533
round_time_total       0 days 00:11:08.007645
loss_total              1144833966495672320.0
loss_critic             1431042433243139840.0
loss_actor                    -4741170905.856
memory_size                           15166.0 

=== epoch 6/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  200
episode_length                           9.93
returns                             -3.036192
return_std                           2.082098
average_reward                      -0.299013
round_time             0 days 00:11:06.781640
episodes_test                           201.0
episode_length_test                  9.900498
returns_test                        -2.739734
return_std_test                      4.142807
average_reward_test                 -0.273917
round_time_test        0 days 00:00:02.587853
round_time_total       0 days 00:11:06.782750
loss_total              1185511159176888320.0
loss_critic             1481888922683762944.0
loss_actor                    -4811835008.512
memory_size                           15166.0 

=== epoch 6/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:51,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                  193
episode_length                      10.321244
returns                             -2.563078
return_std                           3.843955
average_reward                      -0.249463
round_time             0 days 00:11:12.663907
episodes_test                           202.0
episode_length_test                  9.891089
returns_test                        -2.901928
return_std_test                      1.743092
average_reward_test                 -0.292182
round_time_test        0 days 00:00:02.594994
round_time_total       0 days 00:11:12.664992
loss_total              1212470915107175424.0
loss_critic             1515588617830697728.0
loss_actor                     -4902653703.68
memory_size                         15170.416 

=== epoch 6/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  194
episode_length                      10.262887
returns                             -3.170241
return_std                             2.2981
average_reward                      -0.308863
round_time             0 days 00:11:14.311691
episodes_test                           190.0
episode_length_test                 10.431579
returns_test                        -2.464595
return_std_test                      3.767728
average_reward_test                 -0.232116
round_time_test        0 days 00:00:02.593762
round_time_total       0 days 00:11:14.312784
loss_total              1240438351393957376.0
loss_critic             1550547914168427520.0
loss_actor                    -4966074155.776
memory_size                           15189.0 

=== epoch 6/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
episodes                                  190
episode_length                      10.484211
returns                             -2.549309
return_std                           3.234728
average_reward                      -0.243497
round_time             0 days 00:11:08.390240
episodes_test                           202.0
episode_length_test                  9.876238
returns_test                        -3.255006
return_std_test                      1.863297
average_reward_test                 -0.327783
round_time_test        0 days 00:00:02.603527
round_time_total       0 days 00:11:08.391322
loss_total              1292698950261213696.0
loss_critic             1615873659720251136.0
loss_actor                    -5056241014.784
memory_size                        15197.6955 

=== epoch 6/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:11,  2.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  201
episode_length                       9.895522
returns                              -2.83894
return_std                           2.002789
average_reward                      -0.285778
round_time             0 days 00:11:08.094779
episodes_test                           194.0
episode_length_test                 10.252577
returns_test                        -2.724506
return_std_test                      3.357525
average_reward_test                 -0.263403
round_time_test        0 days 00:00:02.585499
round_time_total       0 days 00:11:08.095860
loss_total              1348178716039825152.0
loss_critic             1685223367570580480.0
loss_actor                     -5120582594.56
memory_size                           15200.0 

=== epoch 6/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  207
episode_length                       9.589372
returns                             -2.847407
return_std                           1.770741
average_reward                      -0.297522
round_time             0 days 00:11:07.983057
episodes_test                           200.0
episode_length_test                      9.99
returns_test                        -2.702885
return_std_test                       2.22882
average_reward_test                 -0.269295
round_time_test        0 days 00:00:02.571295
round_time_total       0 days 00:11:07.984154
loss_total              1374407806661928448.0
loss_critic             1718009730878274560.0
loss_actor                     -5197237681.92
memory_size                           15200.0 

=== epoch 6/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  199
episode_length                      10.030151
returns                             -2.961047
return_std                           2.629958
average_reward                      -0.297023
round_time             0 days 00:11:05.142011
episodes_test                           210.0
episode_length_test                       9.5
returns_test                        -3.064953
return_std_test                      1.544442
average_reward_test                 -0.320869
round_time_test        0 days 00:00:02.614207
round_time_total       0 days 00:11:05.143096
loss_total              1413174540260012544.0
loss_critic             1766468145401978368.0
loss_actor                    -5276897918.976
memory_size                         15202.992 

=== epoch 6/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  206
episode_length                       9.694175
returns                             -3.186789
return_std                           2.143325
average_reward                      -0.327974
round_time             0 days 00:11:07.050024
episodes_test                           201.0
episode_length_test                  9.930348
returns_test                        -3.259604
return_std_test                      1.843123
average_reward_test                 -0.326579
round_time_test        0 days 00:00:02.603086
round_time_total       0 days 00:11:07.051112
loss_total              1460219029611180288.0
loss_critic             1825273756094505728.0
loss_actor                    -5364202827.264
memory_size                           15205.0 

=== epoch 6/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
episodes                                  201
episode_length                       9.915423
returns                              -2.83508
return_std                           1.866683
average_reward                      -0.287413
round_time             0 days 00:11:08.309808
episodes_test                           198.0
episode_length_test                 10.065657
returns_test                        -2.489672
return_std_test                      5.309239
average_reward_test                 -0.244474
round_time_test        0 days 00:00:02.586961
round_time_total       0 days 00:11:08.310881
loss_total              1515217928652307968.0
loss_critic             1894022377980360192.0
loss_actor                    -5436013432.832
memory_size                           15205.0 

=== epoch 6/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  209
episode_length                       9.507177
returns                             -2.960461
return_std                            1.59631
average_reward                      -0.311428
round_time             0 days 00:11:05.158210
episodes_test                           203.0
episode_length_test                  9.827586
returns_test                         -2.95115
return_std_test                      1.967411
average_reward_test                 -0.297409
round_time_test        0 days 00:00:02.612036
round_time_total       0 days 00:11:05.159326
loss_total              1572072441495897088.0
loss_critic             1965090518652594176.0
loss_actor                    -5519221716.736
memory_size                           15205.0 

=== epoch 6/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:39,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  203
episode_length                       9.832512
returns                              -3.02128
return_std                            2.07553
average_reward                      -0.308459
round_time             0 days 00:11:09.590393
episodes_test                           215.0
episode_length_test                  9.297674
returns_test                         -3.26449
return_std_test                      1.755653
average_reward_test                 -0.350525
round_time_test        0 days 00:00:02.620056
round_time_total       0 days 00:11:09.591484
loss_total              1614982243776272128.0
loss_critic             2018727769746421248.0
loss_actor                    -5624815535.872
memory_size                           15205.0 

=== epoch 6/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  203
episode_length                       9.773399
returns                              -2.94383
return_std                           2.244952
average_reward                      -0.301315
round_time             0 days 00:11:06.645714
episodes_test                           204.0
episode_length_test                  9.779412
returns_test                        -2.769968
return_std_test                      3.530615
average_reward_test                  -0.28041
round_time_test        0 days 00:00:02.574517
round_time_total       0 days 00:11:06.646799
loss_total              1672324467044133120.0
loss_critic             2090405549466902784.0
loss_actor                    -5727906005.248
memory_size                           15205.0 

=== epoch 6/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:00,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  204
episode_length                        9.77451
returns                             -3.061403
return_std                           1.841972
average_reward                      -0.313633
round_time             0 days 00:11:09.976147
episodes_test                           209.0
episode_length_test                  9.555024
returns_test                        -3.213552
return_std_test                      2.034972
average_reward_test                 -0.334753
round_time_test        0 days 00:00:02.632782
round_time_total       0 days 00:11:09.977243
loss_total              1723833651916846336.0
loss_critic             2154792029411038208.0
loss_actor                    -5830778837.504
memory_size                           15205.0 

=== epoch 6/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.87it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  202
episode_length                       9.836634
returns                             -3.141462
return_std                           3.361118
average_reward                      -0.319496
round_time             0 days 00:11:04.246502
episodes_test                           210.0
episode_length_test                  9.495238
returns_test                        -3.126054
return_std_test                      1.777908
average_reward_test                 -0.325674
round_time_test        0 days 00:00:02.623603
round_time_total       0 days 00:11:04.247575
loss_total              1805963828185343744.0
loss_critic             2257454746332160768.0
loss_actor                    -5926513128.448
memory_size                        15218.2655 

=== epoch 6/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:20,  2.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  208
episode_length                       9.576923
returns                             -3.046978
return_std                           1.782408
average_reward                      -0.318145
round_time             0 days 00:11:07.971532
episodes_test                           208.0
episode_length_test                  9.567308
returns_test                         -3.00621
return_std_test                      1.847107
average_reward_test                 -0.313039
round_time_test        0 days 00:00:02.651636
round_time_total       0 days 00:11:07.972608
loss_total              1849252244202366976.0
loss_critic             2311565264768596992.0
loss_actor                    -6017097452.544
memory_size                           15224.0 

=== epoch 6/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  203
episode_length                       9.793103
returns                             -3.037185
return_std                           1.894414
average_reward                      -0.307844
round_time             0 days 00:11:08.772451
episodes_test                           196.0
episode_length_test                 10.204082
returns_test                        -2.956856
return_std_test                      2.124491
average_reward_test                 -0.289772
round_time_test        0 days 00:00:02.583744
round_time_total       0 days 00:11:08.773627
loss_total              1898128829712810752.0
loss_critic             2372660995866377728.0
loss_actor                    -6120969011.712
memory_size                         15225.314 

=== epoch 6/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  212
episode_length                       9.396226
returns                              -2.85877
return_std                           1.864449
average_reward                      -0.302852
round_time             0 days 00:11:10.919917
episodes_test                           214.0
episode_length_test                  9.308411
returns_test                        -2.641321
return_std_test                      2.103911
average_reward_test                 -0.279625
round_time_test        0 days 00:00:02.585813
round_time_total       0 days 00:11:10.920990
loss_total              1980099080348600576.0
loss_critic             2475123806609904640.0
loss_actor                    -6209876042.496
memory_size                           15233.0 

=== epoch 6/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  200
episode_length                          9.945
returns                             -2.407118
return_std                           5.840644
average_reward                      -0.243634
round_time             0 days 00:11:09.543706
episodes_test                           199.0
episode_length_test                 10.050251
returns_test                        -2.670062
return_std_test                      4.301444
average_reward_test                 -0.265671
round_time_test        0 days 00:00:02.596329
round_time_total       0 days 00:11:09.544787
loss_total              2042047017362605056.0
loss_critic             2552558726777898496.0
loss_actor                    -6325035439.104
memory_size                        15236.5925 

=== epoch 6/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  207
episode_length                       9.623188
returns                             -2.724491
return_std                           2.716989
average_reward                      -0.284641
round_time             0 days 00:11:08.624809
episodes_test                           211.0
episode_length_test                  9.450237
returns_test                        -2.662508
return_std_test                      2.770849
average_reward_test                 -0.280231
round_time_test        0 days 00:00:02.592490
round_time_total       0 days 00:11:08.625891
loss_total              2100656979770674688.0
loss_critic             2625821179779395584.0
loss_actor                    -6427427946.496
memory_size                        15250.4035 

=== epoch 6/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  208
episode_length                       9.548077
returns                             -2.905863
return_std                           1.854566
average_reward                       -0.30661
round_time             0 days 00:11:10.154500
episodes_test                           210.0
episode_length_test                  9.509524
returns_test                        -2.847365
return_std_test                      1.634207
average_reward_test                 -0.298347
round_time_test        0 days 00:00:02.618371
round_time_total       0 days 00:11:10.155599
loss_total              2215691135617517824.0
loss_critic             2769613870649464320.0
loss_actor                    -6509857665.024
memory_size                           15270.0 

=== epoch 6/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  204
episode_length                        9.77451
returns                             -2.680336
return_std                           3.724497
average_reward                      -0.272797
round_time             0 days 00:11:11.265630
episodes_test                           213.0
episode_length_test                  9.370892
returns_test                        -2.884516
return_std_test                      1.751532
average_reward_test                   -0.3063
round_time_test        0 days 00:00:02.613797
round_time_total       0 days 00:11:11.266708
loss_total              2270652846031478272.0
loss_critic             2838316011969744896.0
loss_actor                    -6607696242.944
memory_size                        15281.6675 

=== epoch 6/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  203
episode_length                       9.807882
returns                             -2.723388
return_std                            2.21949
average_reward                      -0.278298
round_time             0 days 00:11:10.077435
episodes_test                           219.0
episode_length_test                  9.118721
returns_test                        -3.073343
return_std_test                      1.589054
average_reward_test                 -0.335532
round_time_test        0 days 00:00:02.630264
round_time_total       0 days 00:11:10.078507
loss_total              2332991186536077824.0
loss_critic             2916238931647669248.0
loss_actor                    -6726679908.864
memory_size                           15295.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  209
episode_length                       9.569378
returns                             -2.870927
return_std                           1.733254
average_reward                      -0.300012
round_time             0 days 00:11:11.229433
episodes_test                           208.0
episode_length_test                  9.591346
returns_test                         -2.75145
return_std_test                      1.949561
average_reward_test                 -0.285621
round_time_test        0 days 00:00:02.605683
round_time_total       0 days 00:11:11.230578
loss_total              2427999610190505472.0
loss_critic             3034999461267244032.0
loss_actor                    -6831511264.512
memory_size                           15295.0 

=== epoch 7/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  210
episode_length                       9.495238
returns                             -3.020408
return_std                            1.76516
average_reward                      -0.319291
round_time             0 days 00:11:08.827946
episodes_test                           205.0
episode_length_test                  9.741463
returns_test                        -2.507976
return_std_test                      1.720056
average_reward_test                 -0.256893
round_time_test        0 days 00:00:02.587182
round_time_total       0 days 00:11:08.829036
loss_total              2503697565439907328.0
loss_critic             3129621904972513792.0
loss_actor                    -6945939936.256
memory_size                           15295.0 

=== epoch 7/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  209
episode_length                       9.545455
returns                              -2.76842
return_std                           2.165387
average_reward                      -0.287933
round_time             0 days 00:11:12.196936
episodes_test                           206.0
episode_length_test                  9.703883
returns_test                        -2.839276
return_std_test                      1.832111
average_reward_test                 -0.291959
round_time_test        0 days 00:00:02.594640
round_time_total       0 days 00:11:12.198033
loss_total              2579747543682131968.0
loss_critic             3224684372750183424.0
loss_actor                    -7074523163.392
memory_size                           15295.0 

=== epoch 7/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:24,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  205
episode_length                       9.712195
returns                              -2.59442
return_std                           3.079556
average_reward                      -0.267738
round_time             0 days 00:11:14.115889
episodes_test                           211.0
episode_length_test                  9.440758
returns_test                        -3.131475
return_std_test                      1.679167
average_reward_test                 -0.328986
round_time_test        0 days 00:00:02.605667
round_time_total       0 days 00:11:14.116967
loss_total              2680477469657278464.0
loss_critic             3350596777801049600.0
loss_actor                    -7199926282.752
memory_size                         15295.955 

=== epoch 7/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  206
episode_length                       9.679612
returns                              -2.53171
return_std                           3.749195
average_reward                      -0.263623
round_time             0 days 00:11:12.900841
episodes_test                           210.0
episode_length_test                  9.509524
returns_test                        -3.032893
return_std_test                      1.807094
average_reward_test                  -0.31738
round_time_test        0 days 00:00:02.605077
round_time_total       0 days 00:11:12.901933
loss_total              2737702223473650176.0
loss_critic             3422127720475240960.0
loss_actor                    -7302116023.552
memory_size                        15328.3535 

=== epoch 7/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                  205
episode_length                       9.726829
returns                             -2.648506
return_std                           2.342331
average_reward                      -0.272435
round_time             0 days 00:11:12.570883
episodes_test                           201.0
episode_length_test                  9.900498
returns_test                        -2.650131
return_std_test                      2.237166
average_reward_test                 -0.264813
round_time_test        0 days 00:00:02.566893
round_time_total       0 days 00:11:12.571972
loss_total              2830781226185241088.0
loss_critic             3538476472232642048.0
loss_actor                    -7397944564.992
memory_size                           15332.0 

=== epoch 7/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:23,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  208
episode_length                         9.5625
returns                             -2.945028
return_std                           1.831047
average_reward                      -0.303171
round_time             0 days 00:11:10.980193
episodes_test                           207.0
episode_length_test                  9.657005
returns_test                        -2.801676
return_std_test                      1.778042
average_reward_test                 -0.289603
round_time_test        0 days 00:00:02.620818
round_time_total       0 days 00:11:10.981291
loss_total              2928553632371782656.0
loss_critic             3660691975456102912.0
loss_actor                    -7511780943.104
memory_size                           15332.0 

=== epoch 7/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:43<00:00,  1.35it/s]
episodes                                  204
episode_length                       9.754902
returns                             -2.735932
return_std                           1.888308
average_reward                       -0.28211
round_time             0 days 00:24:43.651761
episodes_test                           208.0
episode_length_test                  9.610577
returns_test                        -3.192452
return_std_test                      1.792026
average_reward_test                 -0.331488
round_time_test        0 days 00:00:02.613887
round_time_total       0 days 00:24:43.653943
loss_total              2959821274215451136.0
loss_critic             3699776533713513472.0
loss_actor                    -7596380124.416
memory_size                           15332.0 

=== epoch 7/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:02<37:29,  1.13s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:28<00:00,  2.30it/s]
episodes                                  201
episode_length                       9.915423
returns                             -2.438662
return_std                           2.656562
average_reward                      -0.247073
round_time             0 days 00:14:30.489197
episodes_test                           211.0
episode_length_test                  9.473934
returns_test                        -3.010137
return_std_test                      1.564839
average_reward_test                 -0.317145
round_time_test        0 days 00:00:07.307707
round_time_total       0 days 00:14:30.490276
loss_total              2981218034391942656.0
loss_critic             3726522478861771264.0
loss_actor                    -7690810819.328
memory_size                         15334.805 

=== epoch 7/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:59,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:57<00:00,  2.79it/s]
episodes                                  210
episode_length                       9.514286
returns                             -2.919153
return_std                           1.751839
average_reward                      -0.305855
round_time             0 days 00:11:57.516965
episodes_test                           214.0
episode_length_test                  9.294393
returns_test                        -2.773395
return_std_test                      1.791613
average_reward_test                 -0.296358
round_time_test        0 days 00:00:02.627822
round_time_total       0 days 00:11:57.518089
loss_total              3116047050982509568.0
loss_critic             3895058743745939968.0
loss_actor                    -7784530918.144
memory_size                           15335.0 

=== epoch 7/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:58<00:00,  2.79it/s]
episodes                                  202
episode_length                       9.811881
returns                             -2.405893
return_std                           4.497887
average_reward                      -0.244326
round_time             0 days 00:11:58.550581
episodes_test                           213.0
episode_length_test                  9.375587
returns_test                        -2.865326
return_std_test                      1.822046
average_reward_test                 -0.305696
round_time_test        0 days 00:00:02.605865
round_time_total       0 days 00:11:58.551801
loss_total              3222775498381134336.0
loss_critic             4028469303973473280.0
loss_actor                    -7860359764.736
memory_size                         15345.759 

=== epoch 7/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:45<00:00,  2.83it/s]
episodes                                  210
episode_length                       9.471429
returns                             -2.860238
return_std                             1.6935
average_reward                      -0.301568
round_time             0 days 00:11:45.998743
episodes_test                           208.0
episode_length_test                  9.591346
returns_test                        -2.612248
return_std_test                       2.02046
average_reward_test                 -0.273415
round_time_test        0 days 00:00:02.641118
round_time_total       0 days 00:11:45.999851
loss_total              3310260712562311168.0
loss_critic             4137825820282605056.0
loss_actor                    -7982615213.056
memory_size                           15364.0 

=== epoch 7/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:42,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:00<00:00,  2.78it/s]
episodes                                  210
episode_length                       9.485714
returns                             -2.945313
return_std                           1.835611
average_reward                      -0.310726
round_time             0 days 00:12:01.061006
episodes_test                           211.0
episode_length_test                  9.440758
returns_test                        -2.755653
return_std_test                      2.124222
average_reward_test                 -0.287906
round_time_test        0 days 00:00:02.664273
round_time_total       0 days 00:12:01.062085
loss_total              3388645290131624960.0
loss_critic             4235806541179095552.0
loss_actor                    -8112171721.984
memory_size                           15364.0 

=== epoch 7/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:42,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  204
episode_length                       9.784314
returns                             -2.810465
return_std                           2.342194
average_reward                      -0.287464
round_time             0 days 00:11:07.707633
episodes_test                           209.0
episode_length_test                  9.564593
returns_test                        -2.682148
return_std_test                      2.206101
average_reward_test                 -0.279915
round_time_test        0 days 00:00:02.600653
round_time_total       0 days 00:11:07.708715
loss_total              3477955409184995328.0
loss_critic             4347444188939246592.0
loss_actor                    -8230500579.072
memory_size                           15364.0 

=== epoch 7/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  210
episode_length                       9.461905
returns                             -2.668978
return_std                           2.224738
average_reward                      -0.283985
round_time             0 days 00:11:10.708927
episodes_test                           209.0
episode_length_test                  9.535885
returns_test                         -2.57185
return_std_test                      1.835238
average_reward_test                 -0.270012
round_time_test        0 days 00:00:02.603446
round_time_total       0 days 00:11:10.710004
loss_total              3571737758146161152.0
loss_critic             4464672122804241408.0
loss_actor                    -8348024286.208
memory_size                           15364.0 

=== epoch 7/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
episodes                                  210
episode_length                       9.509524
returns                             -2.932046
return_std                           1.810106
average_reward                      -0.309805
round_time             0 days 00:11:08.343701
episodes_test                           209.0
episode_length_test                  9.550239
returns_test                        -2.830142
return_std_test                      2.093138
average_reward_test                  -0.29404
round_time_test        0 days 00:00:02.617925
round_time_total       0 days 00:11:08.344776
loss_total              3690929400227672576.0
loss_critic             4613661673765453824.0
loss_actor                    -8469889640.192
memory_size                           15364.0 

=== epoch 7/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:57<00:00,  2.79it/s]
episodes                                  200
episode_length                          9.995
returns                             -2.172251
return_std                           8.248387
average_reward                      -0.216749
round_time             0 days 00:11:57.849729
episodes_test                           203.0
episode_length_test                  9.832512
returns_test                        -2.716665
return_std_test                      1.723512
average_reward_test                 -0.274056
round_time_test        0 days 00:00:02.593101
round_time_total       0 days 00:11:57.850789
loss_total              3788488823762009088.0
loss_critic             4735610952204121088.0
loss_actor                 -8611674350.591999
memory_size                         15372.833 

=== epoch 7/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:21<00:00,  2.93it/s]
episodes                                  207
episode_length                       9.574879
returns                             -2.498173
return_std                           2.238357
average_reward                      -0.258001
round_time             0 days 00:11:22.389536
episodes_test                           212.0
episode_length_test                  9.400943
returns_test                        -2.660868
return_std_test                      1.929655
average_reward_test                 -0.281471
round_time_test        0 days 00:00:02.639795
round_time_total       0 days 00:11:22.390628
loss_total              3931676156515172864.0
loss_critic             4914595110792592384.0
loss_actor                 -8733313361.408001
memory_size                        15416.1825 

=== epoch 7/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:14,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  203
episode_length                        9.82266
returns                             -2.164818
return_std                           8.093717
average_reward                      -0.221771
round_time             0 days 00:11:15.993658
episodes_test                           210.0
episode_length_test                  9.495238
returns_test                        -2.581034
return_std_test                      1.736256
average_reward_test                 -0.269499
round_time_test        0 days 00:00:02.602635
round_time_total       0 days 00:11:15.994928
loss_total              4039721420373987840.0
loss_critic             5049651693184500736.0
loss_actor                    -8866016162.048
memory_size                          15450.55 

=== epoch 7/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:40,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                  199
episode_length                       9.984925
returns                             -2.284514
return_std                            3.55578
average_reward                      -0.227725
round_time             0 days 00:11:20.281791
episodes_test                           217.0
episode_length_test                  9.184332
returns_test                        -2.605784
return_std_test                      1.600619
average_reward_test                 -0.282989
round_time_test        0 days 00:00:02.640318
round_time_total       0 days 00:11:20.282998
loss_total              4214063255980236288.0
loss_critic             5267578980502537216.0
loss_actor                      -8962242726.4
memory_size                         15498.357 

=== epoch 7/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:47<00:00,  2.42it/s]
episodes                                  211
episode_length                       9.450237
returns                             -2.798823
return_std                           1.721921
average_reward                      -0.298686
round_time             0 days 00:13:48.066100
episodes_test                           204.0
episode_length_test                  9.764706
returns_test                        -2.434822
return_std_test                      4.275258
average_reward_test                 -0.248065
round_time_test        0 days 00:00:02.667248
round_time_total       0 days 00:13:48.067525
loss_total              4289833749636674560.0
loss_critic             5362292087359652864.0
loss_actor                    -9094402985.472
memory_size                           15504.0 

=== epoch 7/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:48,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:31<00:00,  1.55it/s]
episodes                                  199
episode_length                       10.01005
returns                             -2.129103
return_std                            3.95106
average_reward                      -0.211798
round_time             0 days 00:21:31.876443
episodes_test                           208.0
episode_length_test                  9.576923
returns_test                        -2.837846
return_std_test                      1.734395
average_reward_test                 -0.292346
round_time_test        0 days 00:00:03.384797
round_time_total       0 days 00:21:31.877934
loss_total              4419755494838607360.0
loss_critic             5524694272495610880.0
loss_actor                 -9197045676.544001
memory_size                        15524.4225 

=== epoch 7/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:45,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:38<00:00,  1.61it/s]
episodes                                  209
episode_length                       9.502392
returns                             -2.623315
return_std                           1.800613
average_reward                      -0.275773
round_time             0 days 00:20:39.928742
episodes_test                           225.0
episode_length_test                  8.875556
returns_test                        -2.740781
return_std_test                      1.490788
average_reward_test                 -0.307459
round_time_test        0 days 00:00:03.699441
round_time_total       0 days 00:20:39.930664
loss_total              4549531981394731008.0
loss_critic             5686914873286241280.0
loss_actor                    -9316841808.128
memory_size                           15534.0 

=== epoch 7/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:50,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:35<00:00,  1.48it/s]
episodes                                  207
episode_length                       9.652174
returns                             -2.490198
return_std                           3.800215
average_reward                      -0.258978
round_time             0 days 00:22:36.020608
episodes_test                           200.0
episode_length_test                     9.975
returns_test                        -2.377938
return_std_test                      3.306995
average_reward_test                 -0.235866
round_time_test        0 days 00:00:04.180715
round_time_total       0 days 00:22:36.022896
loss_total              4646163793216920576.0
loss_critic             5807704637411143680.0
loss_actor                 -9432832921.087999
memory_size                         15542.295 

=== epoch 7/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<29:00,  1.15it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:45<00:00,  1.46it/s]
episodes                                  198
episode_length                      10.035354
returns                             -2.621702
return_std                           3.370836
average_reward                      -0.260745
round_time             0 days 00:22:47.314306
episodes_test                           203.0
episode_length_test                  9.719212
returns_test                        -2.442897
return_std_test                      4.302334
average_reward_test                 -0.243196
round_time_test        0 days 00:00:03.391973
round_time_total       0 days 00:22:47.316034
loss_total              4791123364029149184.0
loss_critic             5988904111199991808.0
loss_actor                     -9523988956.16
memory_size                          15573.46 

=== epoch 7/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:33,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:37<00:00,  1.47it/s]
episodes                                  210
episode_length                       9.480952
returns                             -2.815701
return_std                           1.658554
average_reward                       -0.29618
round_time             0 days 00:22:38.976249
episodes_test                           195.0
episode_length_test                 10.230769
returns_test                        -2.305543
return_std_test                      4.631877
average_reward_test                 -0.224368
round_time_test        0 days 00:00:03.517533
round_time_total       0 days 00:22:38.978056
loss_total              4955973153617240064.0
loss_critic             6194966340462752768.0
loss_actor                    -9623283236.864
memory_size                           15578.0 

=== epoch 7/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<22:00,  1.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:40<00:00,  1.47it/s]
episodes                                  207
episode_length                       9.623188
returns                             -2.692607
return_std                           1.608673
average_reward                      -0.280691
round_time             0 days 00:22:41.560023
episodes_test                           209.0
episode_length_test                  9.555024
returns_test                        -2.769582
return_std_test                      1.738958
average_reward_test                 -0.288113
round_time_test        0 days 00:00:04.726159
round_time_total       0 days 00:22:41.561856
loss_total              4947742333404205056.0
loss_critic             6184677813031796736.0
loss_actor                 -9754722112.511999
memory_size                           15578.0 

=== epoch 7/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:17,  1.49it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:55<00:00,  1.52it/s]
episodes                                  205
episode_length                       9.721951
returns                             -2.677684
return_std                           1.767311
average_reward                       -0.27649
round_time             0 days 00:21:56.747646
episodes_test                           203.0
episode_length_test                  9.837438
returns_test                        -2.225194
return_std_test                      5.215672
average_reward_test                 -0.224457
round_time_test        0 days 00:00:04.243310
round_time_total       0 days 00:21:56.749541
loss_total              5084718079635493888.0
loss_critic             6355897487205202944.0
loss_actor                    -9886178093.056
memory_size                           15578.0 

=== epoch 7/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:30,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:50<00:00,  1.46it/s]
episodes                                  206
episode_length                       9.669903
returns                              -2.57487
return_std                           2.446671
average_reward                      -0.266608
round_time             0 days 00:22:51.899304
episodes_test                           213.0
episode_length_test                  9.361502
returns_test                        -2.824596
return_std_test                      1.706446
average_reward_test                 -0.300578
round_time_test        0 days 00:00:03.620696
round_time_total       0 days 00:22:51.901784
loss_total              5282610150116115456.0
loss_critic             6603262579738385408.0
loss_actor                    -10005547000.32
memory_size                           15578.0 

=== epoch 7/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:42,  1.47it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:10<00:00,  1.44it/s]
episodes                                  207
episode_length                       9.637681
returns                             -2.704966
return_std                           1.889818
average_reward                      -0.281611
round_time             0 days 00:23:12.206374
episodes_test                           210.0
episode_length_test                   9.52381
returns_test                        -2.772871
return_std_test                      1.656516
average_reward_test                 -0.291151
round_time_test        0 days 00:00:03.883206
round_time_total       0 days 00:23:12.208980
loss_total              5377208880516669440.0
loss_critic             6721510985257245696.0
loss_actor                   -10136191233.024
memory_size                           15578.0 

=== epoch 7/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:17,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:35<00:00,  1.48it/s]
episodes                                  209
episode_length                       9.516746
returns                             -2.893122
return_std                           1.655504
average_reward                      -0.304836
round_time             0 days 00:22:37.457490
episodes_test                           206.0
episode_length_test                  9.684466
returns_test                        -2.252678
return_std_test                      2.611299
average_reward_test                 -0.231996
round_time_test        0 days 00:00:03.551361
round_time_total       0 days 00:22:37.460286
loss_total              5485097912466191360.0
loss_critic             6856372270607122432.0
loss_actor                -10266572959.743999
memory_size                           15578.0 

=== epoch 7/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<31:17,  1.06it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:18<00:00,  1.49it/s]
episodes                                  208
episode_length                         9.5625
returns                             -2.735143
return_std                           4.181725
average_reward                      -0.283646
round_time             0 days 00:22:20.656925
episodes_test                           210.0
episode_length_test                  9.519048
returns_test                        -2.668483
return_std_test                      2.247755
average_reward_test                 -0.279714
round_time_test        0 days 00:00:03.418016
round_time_total       0 days 00:22:20.659096
loss_total              5640176471204694016.0
loss_critic             7050220464657974272.0
loss_actor                -10390865645.568001
memory_size                          15587.51 

=== epoch 7/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:24,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:58<00:00,  1.45it/s]
episodes                                  212
episode_length                       9.386792
returns                             -2.681058
return_std                           1.993915
average_reward                       -0.28706
round_time             0 days 00:23:00.305892
episodes_test                           210.0
episode_length_test                  9.504762
returns_test                        -2.741294
return_std_test                      1.637539
average_reward_test                 -0.286941
round_time_test        0 days 00:00:03.590064
round_time_total       0 days 00:23:00.307299
loss_total              5745375900281618432.0
loss_critic             7181719750213856256.0
loss_actor                -10520325507.072001
memory_size                           15593.0 

=== epoch 7/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:45,  1.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:37<00:00,  1.41it/s]
episodes                                  193
episode_length                      10.290155
returns                             -2.421944
return_std                           4.134435
average_reward                      -0.232912
round_time             0 days 00:23:38.215251
episodes_test                           204.0
episode_length_test                  9.769608
returns_test                        -2.707428
return_std_test                      2.300121
average_reward_test                  -0.27868
round_time_test        0 days 00:00:04.257302
round_time_total       0 days 00:23:38.217773
loss_total              5888416234927218688.0
loss_critic             7360520168589576192.0
loss_actor                -10629106260.992001
memory_size                          15603.56 

=== epoch 7/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:36,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:26<00:00,  1.49it/s]
episodes                                  211
episode_length                       9.417062
returns                             -2.755148
return_std                            1.74491
average_reward                       -0.29233
round_time             0 days 00:22:27.863035
episodes_test                           198.0
episode_length_test                 10.040404
returns_test                        -2.766516
return_std_test                      1.858382
average_reward_test                  -0.27359
round_time_test        0 days 00:00:03.732498
round_time_total       0 days 00:22:27.865931
loss_total              5798768462793625600.0
loss_critic             7248460451610107904.0
loss_actor                -10713474456.063999
memory_size                           15648.0 

=== epoch 7/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:41,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:45<00:00,  1.61it/s]
episodes                                  214
episode_length                       9.294393
returns                             -2.829309
return_std                           1.636767
average_reward                      -0.305122
round_time             0 days 00:20:47.435969
episodes_test                           212.0
episode_length_test                   9.40566
returns_test                        -2.588751
return_std_test                      1.506987
average_reward_test                 -0.274251
round_time_test        0 days 00:00:03.176095
round_time_total       0 days 00:20:47.437882
loss_total              6133319572602857472.0
loss_critic             7666649331252376576.0
loss_actor                   -10799361175.552
memory_size                           15648.0 

=== epoch 7/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:13,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:22<00:00,  1.64it/s]
episodes                                  203
episode_length                        9.79803
returns                             -2.444682
return_std                           2.995847
average_reward                      -0.251626
round_time             0 days 00:20:23.228173
episodes_test                           207.0
episode_length_test                  9.618357
returns_test                        -2.251322
return_std_test                      4.173694
average_reward_test                 -0.232975
round_time_test        0 days 00:00:03.651419
round_time_total       0 days 00:20:23.230068
loss_total              6258160169217843200.0
loss_critic             7822700089270353920.0
loss_actor                   -10887437101.056
memory_size                         15649.309 

=== epoch 7/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:52,  1.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:34<00:00,  1.62it/s]
episodes                                  208
episode_length                       9.576923
returns                             -2.681314
return_std                           1.671253
average_reward                      -0.278204
round_time             0 days 00:20:35.189050
episodes_test                           201.0
episode_length_test                  9.930348
returns_test                        -2.669066
return_std_test                      2.226057
average_reward_test                 -0.267378
round_time_test        0 days 00:00:04.433556
round_time_total       0 days 00:20:35.190561
loss_total              6424751123669964800.0
loss_critic             8030938758781905920.0
loss_actor                -10987467841.535999
memory_size                           15659.0 

=== epoch 7/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:00,  2.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:05<00:00,  1.66it/s]
episodes                                  201
episode_length                       9.910448
returns                             -2.177908
return_std                           5.296478
average_reward                      -0.221559
round_time             0 days 00:20:06.407400
episodes_test                           209.0
episode_length_test                  9.559809
returns_test                        -2.441243
return_std_test                      2.866853
average_reward_test                 -0.254036
round_time_test        0 days 00:00:03.256657
round_time_total       0 days 00:20:06.409235
loss_total              6499784639661834240.0
loss_critic             8124730661537044480.0
loss_actor                   -11110898846.208
memory_size                        15682.1275 

=== epoch 7/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:09,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:56<00:00,  1.59it/s]
episodes                                  211
episode_length                       9.393365
returns                             -2.500632
return_std                           2.478353
average_reward                      -0.261539
round_time             0 days 00:20:57.369895
episodes_test                           208.0
episode_length_test                  9.576923
returns_test                        -2.656495
return_std_test                      1.905854
average_reward_test                  -0.27412
round_time_test        0 days 00:00:03.167027
round_time_total       0 days 00:20:57.371662
loss_total              6621431649159886848.0
loss_critic             8276789411005744128.0
loss_actor                -11258603621.375999
memory_size                         15703.208 

=== epoch 7/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:08,  1.38it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:21<00:00,  1.56it/s]
episodes                                  212
episode_length                       9.382075
returns                             -2.431757
return_std                           1.803058
average_reward                      -0.258957
round_time             0 days 00:21:22.974756
episodes_test                           217.0
episode_length_test                  9.207373
returns_test                        -2.436196
return_std_test                      1.883868
average_reward_test                 -0.263726
round_time_test        0 days 00:00:03.230127
round_time_total       0 days 00:21:22.976555
loss_total              6875379115181151232.0
loss_critic             8594223746315462656.0
loss_actor                -11387796917.247999
memory_size                           15704.0 

=== epoch 7/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:50,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:19<00:00,  1.64it/s]
episodes                                  206
episode_length                       9.684466
returns                             -2.247915
return_std                           5.199439
average_reward                      -0.232924
round_time             0 days 00:20:20.061125
episodes_test                           209.0
episode_length_test                  9.545455
returns_test                        -2.579137
return_std_test                      1.700313
average_reward_test                 -0.269502
round_time_test        0 days 00:00:03.911085
round_time_total       0 days 00:20:20.063174
loss_total              7004369040118382592.0
loss_critic             8755461154119090176.0
loss_actor                -11526522797.568001
memory_size                        15706.8525 

=== epoch 7/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<20:52,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:24<00:00,  1.63it/s]
episodes                                  205
episode_length                       9.687805
returns                             -2.394294
return_std                           3.037821
average_reward                      -0.245088
round_time             0 days 00:20:25.069929
episodes_test                           218.0
episode_length_test                  9.165138
returns_test                        -2.616371
return_std_test                      1.610472
average_reward_test                 -0.284306
round_time_test        0 days 00:00:03.776478
round_time_total       0 days 00:20:25.071985
loss_total              7201579389612844032.0
loss_critic             9001974075044248576.0
loss_actor                   -11655161693.184
memory_size                         15762.881 

=== epoch 7/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:32,  1.41it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:58<00:00,  1.59it/s]
episodes                                  216
episode_length                       9.231481
returns                             -2.575332
return_std                           1.491018
average_reward                      -0.281126
round_time             0 days 00:20:59.179043
episodes_test                           210.0
episode_length_test                       9.5
returns_test                        -2.893953
return_std_test                      2.057867
average_reward_test                 -0.302784
round_time_test        0 days 00:00:03.317996
round_time_total       0 days 00:20:59.180916
loss_total              7374529519097789440.0
loss_critic             9218161738085840896.0
loss_actor                -11815492581.375999
memory_size                           15791.0 

=== epoch 7/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:50,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:54<00:00,  1.59it/s]
episodes                                  214
episode_length                       9.261682
returns                             -2.588084
return_std                           1.813158
average_reward                      -0.277854
round_time             0 days 00:20:55.641117
episodes_test                           214.0
episode_length_test                  9.336449
returns_test                        -2.673211
return_std_test                      1.654006
average_reward_test                 -0.285209
round_time_test        0 days 00:00:03.610433
round_time_total       0 days 00:20:55.642517
loss_total              7490448525532883968.0
loss_critic             9363060493398110208.0
loss_actor                -11963194444.799999
memory_size                           15791.0 

=== epoch 7/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:29,  1.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:39<00:00,  1.61it/s]
episodes                                  211
episode_length                       9.436019
returns                             -2.611421
return_std                           1.873721
average_reward                      -0.275389
round_time             0 days 00:20:40.204153
episodes_test                           203.0
episode_length_test                  9.837438
returns_test                        -2.241976
return_std_test                      4.379976
average_reward_test                 -0.227266
round_time_test        0 days 00:00:03.646256
round_time_total       0 days 00:20:40.205610
loss_total              7797679479798259712.0
loss_critic             9747099181591265280.0
loss_actor                   -12127742165.504
memory_size                           15791.0 

=== epoch 7/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:02,  1.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:33<00:00,  1.55it/s]
episodes                                  210
episode_length                        9.47619
returns                             -2.230948
return_std                           5.070959
average_reward                      -0.236663
round_time             0 days 00:21:34.679758
episodes_test                           215.0
episode_length_test                   9.27907
returns_test                        -2.307971
return_std_test                      2.362975
average_reward_test                 -0.247413
round_time_test        0 days 00:00:04.028558
round_time_total       0 days 00:21:34.682081
loss_total              7936788466432122880.0
loss_critic             9920985418233667584.0
loss_actor                -12312711333.375999
memory_size                        15840.2525 

=== epoch 7/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:05,  1.33it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:07<00:00,  1.51it/s]
episodes                                  212
episode_length                        9.40566
returns                             -2.979049
return_std                           1.562098
average_reward                      -0.317192
round_time             0 days 00:22:09.560139
episodes_test                           209.0
episode_length_test                  9.564593
returns_test                        -2.320102
return_std_test                      5.853575
average_reward_test                 -0.242068
round_time_test        0 days 00:00:04.152576
round_time_total       0 days 00:22:09.561956
loss_total              8236613913291165696.0
loss_critic            10295767221550432256.0
loss_actor                -12413936297.983999
memory_size                           15846.0 

=== epoch 7/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:14,  1.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:24<00:00,  1.49it/s]
episodes                                  216
episode_length                       9.226852
returns                             -2.792529
return_std                           1.592135
average_reward                       -0.30394
round_time             0 days 00:22:25.616841
episodes_test                           205.0
episode_length_test                  9.717073
returns_test                        -2.484989
return_std_test                      1.653454
average_reward_test                 -0.255664
round_time_test        0 days 00:00:04.439365
round_time_total       0 days 00:22:25.618599
loss_total              8358615287484110848.0
loss_critic            10448268928244957184.0
loss_actor                -12587311550.464001
memory_size                           15846.0 

=== epoch 7/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<24:14,  1.37it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:38<00:00,  1.47it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  215
episode_length                       9.223256
returns                             -2.744231
return_std                           2.040218
average_reward                      -0.294785
round_time             0 days 00:22:39.549007
episodes_test                           208.0
episode_length_test                  9.600962
returns_test                        -2.121637
return_std_test                      9.230624
average_reward_test                 -0.219057
round_time_test        0 days 00:00:03.557753
round_time_total       0 days 00:22:39.551530
loss_total              8619130796371215360.0
loss_critic            10773913314611535872.0
loss_actor                -12727990477.823999
memory_size                           15846.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:03<24:45,  1.34it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:39<00:00,  1.47it/s]
episodes                                  206
episode_length                       9.679612
returns                             -2.625697
return_std                            1.69145
average_reward                      -0.269609
round_time             0 days 00:22:39.487755
episodes_test                           203.0
episode_length_test                  9.847291
returns_test                        -2.515746
return_std_test                      3.677083
average_reward_test                 -0.254852
round_time_test        0 days 00:00:03.307667
round_time_total       0 days 00:22:39.490464
loss_total              8597749839217651712.0
loss_critic            10747187113015621632.0
loss_actor                     -12883998464.0
memory_size                           15846.0 

=== epoch 8/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<25:53,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:38<00:00,  1.47it/s]
episodes                                  206
episode_length                       9.650485
returns                              -2.72141
return_std                           1.712691
average_reward                      -0.279746
round_time             0 days 00:22:40.633297
episodes_test                           202.0
episode_length_test                  9.886139
returns_test                        -2.762856
return_std_test                      1.694411
average_reward_test                 -0.278372
round_time_test        0 days 00:00:03.438773
round_time_total       0 days 00:22:40.635872
loss_total              8724322665618504704.0
loss_critic            10905403143130468352.0
loss_actor                -12980884662.271999
memory_size                        15848.8185 

=== epoch 8/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:45,  1.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:56<00:00,  1.52it/s]
episodes                                  211
episode_length                       9.312796
returns                             -2.728362
return_std                           1.412612
average_reward                      -0.285015
round_time             0 days 00:21:58.212730
episodes_test                           219.0
episode_length_test                  9.127854
returns_test                         -2.84586
return_std_test                      1.597074
average_reward_test                 -0.311251
round_time_test        0 days 00:00:04.039796
round_time_total       0 days 00:21:58.214736
loss_total              8963151761648776192.0
loss_critic            11203939497268338688.0
loss_actor                -13116568240.639999
memory_size                        15851.9595 

=== epoch 8/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:52,  1.29it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:29<00:00,  1.55it/s]
episodes                                  210
episode_length                       9.490476
returns                             -2.486808
return_std                           3.365693
average_reward                      -0.263816
round_time             0 days 00:21:31.777476
episodes_test                           215.0
episode_length_test                  9.260465
returns_test                        -2.421123
return_std_test                      3.456369
average_reward_test                 -0.260282
round_time_test        0 days 00:00:04.107290
round_time_total       0 days 00:21:31.779459
loss_total              9226369260265957376.0
loss_critic            11532961378915002368.0
loss_actor                   -13245077967.872
memory_size                        15859.9835 

=== epoch 8/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:59,  1.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:45<00:00,  1.61it/s]
episodes                                  216
episode_length                           9.25
returns                             -2.785814
return_std                           1.526955
average_reward                      -0.300991
round_time             0 days 00:20:46.221512
episodes_test                           206.0
episode_length_test                  9.708738
returns_test                        -2.526138
return_std_test                      1.683258
average_reward_test                 -0.260192
round_time_test        0 days 00:00:04.157997
round_time_total       0 days 00:20:46.223444
loss_total              9334529058636130304.0
loss_critic            11668161131327305728.0
loss_actor                   -13386625374.208
memory_size                           15866.0 

=== epoch 8/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:35,  1.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:23<00:00,  1.63it/s]
episodes                                  214
episode_length                       9.308411
returns                              -2.59597
return_std                           1.711996
average_reward                      -0.280269
round_time             0 days 00:20:24.465421
episodes_test                           209.0
episode_length_test                  9.555024
returns_test                        -2.456374
return_std_test                      1.550037
average_reward_test                 -0.255157
round_time_test        0 days 00:00:03.504711
round_time_total       0 days 00:20:24.466895
loss_total              9666259189910525952.0
loss_critic            12082823784682878976.0
loss_actor                    -13541354531.84
memory_size                           15866.0 

=== epoch 8/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:42,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:34<00:00,  1.79it/s]
episodes                                  211
episode_length                       9.464455
returns                             -2.481477
return_std                           3.258788
average_reward                      -0.260931
round_time             0 days 00:18:35.648378
episodes_test                           212.0
episode_length_test                  9.433962
returns_test                        -2.692492
return_std_test                      1.970959
average_reward_test                 -0.285404
round_time_test        0 days 00:00:03.586176
round_time_total       0 days 00:18:35.650179
loss_total              9853451742935431168.0
loss_critic            12316814478729971712.0
loss_actor                -13691998451.200001
memory_size                          15868.43 

=== epoch 8/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:39,  1.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:27<00:00,  2.30it/s]
episodes                                  209
episode_length                       9.526316
returns                             -2.578036
return_std                           1.578992
average_reward                      -0.270565
round_time             0 days 00:14:28.824450
episodes_test                           209.0
episode_length_test                   9.54067
returns_test                        -2.618618
return_std_test                      1.594017
average_reward_test                 -0.273258
round_time_test        0 days 00:00:03.306935
round_time_total       0 days 00:14:28.825711
loss_total             10183858363888486400.0
loss_critic            12729822730474336256.0
loss_actor                   -13853852604.416
memory_size                           15875.0 

=== epoch 8/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:58,  2.08it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:40<00:00,  2.44it/s]
episodes                                  207
episode_length                       9.608696
returns                             -2.323338
return_std                           2.061482
average_reward                      -0.240403
round_time             0 days 00:13:41.082232
episodes_test                           212.0
episode_length_test                  9.382075
returns_test                        -2.346612
return_std_test                      4.575832
average_reward_test                 -0.247567
round_time_test        0 days 00:00:02.980603
round_time_total       0 days 00:13:41.083421
loss_total             10376855559166621696.0
loss_critic            12971069226960009216.0
loss_actor                   -14001864670.208
memory_size                           15875.0 

=== epoch 8/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:35<00:00,  2.65it/s]
episodes                                  220
episode_length                           9.05
returns                             -2.669807
return_std                           1.710753
average_reward                      -0.295928
round_time             0 days 00:12:35.940115
episodes_test                           203.0
episode_length_test                  9.847291
returns_test                        -2.105373
return_std_test                      1.965027
average_reward_test                 -0.213261
round_time_test        0 days 00:00:02.771788
round_time_total       0 days 00:12:35.941251
loss_total             10596391811258886144.0
loss_critic            13245489539017322496.0
loss_actor                -14172531784.191999
memory_size                           15875.0 

=== epoch 8/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:32,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:48<00:00,  2.82it/s]
episodes                                  217
episode_length                       9.170507
returns                             -2.465645
return_std                           1.552691
average_reward                      -0.269869
round_time             0 days 00:11:48.644230
episodes_test                           211.0
episode_length_test                  9.454976
returns_test                        -2.986188
return_std_test                      1.772349
average_reward_test                 -0.314045
round_time_test        0 days 00:00:02.632691
round_time_total       0 days 00:11:48.645330
loss_total             10929899386848622592.0
loss_critic            13662373994485719040.0
loss_actor                -14358603385.856001
memory_size                           15875.0 

=== epoch 8/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:29,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:33<00:00,  2.88it/s]
episodes                                  215
episode_length                       9.260465
returns                             -2.683484
return_std                           1.758064
average_reward                      -0.289964
round_time             0 days 00:11:33.964303
episodes_test                           215.0
episode_length_test                   9.27907
returns_test                        -2.917332
return_std_test                       2.29047
average_reward_test                 -0.313211
round_time_test        0 days 00:00:02.633323
round_time_total       0 days 00:11:33.965455
loss_total             11217430458349346816.0
loss_critic            14021787835201652736.0
loss_actor                -14518366344.191999
memory_size                           15875.0 

=== epoch 8/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:28,  2.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  219
episode_length                       9.077626
returns                             -2.580495
return_std                           1.676891
average_reward                      -0.281434
round_time             0 days 00:11:11.405463
episodes_test                           200.0
episode_length_test                      9.96
returns_test                        -1.858832
return_std_test                      5.489884
average_reward_test                 -0.186381
round_time_test        0 days 00:00:02.614223
round_time_total       0 days 00:11:11.406551
loss_total             11468884169601527808.0
loss_critic            14336104957739847680.0
loss_actor                -14685519157.247999
memory_size                         15875.213 

=== epoch 8/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:34,  2.65it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  211
episode_length                        9.43128
returns                             -2.703158
return_std                           2.095185
average_reward                      -0.285687
round_time             0 days 00:11:12.255793
episodes_test                           216.0
episode_length_test                  9.236111
returns_test                        -2.507262
return_std_test                      1.694242
average_reward_test                 -0.268681
round_time_test        0 days 00:00:02.630238
round_time_total       0 days 00:11:12.256881
loss_total             11628427337601454080.0
loss_critic            14535533918564386816.0
loss_actor                -14839875417.087999
memory_size                           15876.0 

=== epoch 8/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:39,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  213
episode_length                       9.356808
returns                              -2.52372
return_std                           2.050305
average_reward                      -0.271457
round_time             0 days 00:11:10.827793
episodes_test                           206.0
episode_length_test                  9.694175
returns_test                        -2.435546
return_std_test                       2.70109
average_reward_test                 -0.250004
round_time_test        0 days 00:00:02.609973
round_time_total       0 days 00:11:10.828890
loss_total             11947353240973477888.0
loss_critic            14934191310733037568.0
loss_actor                -15000622136.832001
memory_size                           15876.0 

=== epoch 8/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  219
episode_length                       9.109589
returns                             -2.244176
return_std                           4.848558
average_reward                      -0.243588
round_time             0 days 00:11:12.030937
episodes_test                           224.0
episode_length_test                  8.924107
returns_test                        -2.643089
return_std_test                      1.821503
average_reward_test                  -0.29572
round_time_test        0 days 00:00:02.612685
round_time_total       0 days 00:11:12.032013
loss_total             12145834578836805632.0
loss_critic            15182292942672326656.0
loss_actor                    -15166643146.24
memory_size                         15885.037 

=== epoch 8/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  210
episode_length                       9.490476
returns                             -1.923564
return_std                           5.277651
average_reward                      -0.205011
round_time             0 days 00:11:08.642822
episodes_test                           213.0
episode_length_test                  9.356808
returns_test                        -2.241173
return_std_test                      2.085206
average_reward_test                 -0.238832
round_time_test        0 days 00:00:02.609104
round_time_total       0 days 00:11:08.643923
loss_total             12425668760736243712.0
loss_critic            15532085675664439296.0
loss_actor                -15293476385.280001
memory_size                         15908.646 

=== epoch 8/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  218
episode_length                        9.16055
returns                             -2.711048
return_std                           1.408947
average_reward                      -0.294924
round_time             0 days 00:11:08.272522
episodes_test                           224.0
episode_length_test                  8.924107
returns_test                        -2.845477
return_std_test                      1.624178
average_reward_test                 -0.318267
round_time_test        0 days 00:00:02.626632
round_time_total       0 days 00:11:08.273612
loss_total             12589693942064066560.0
loss_critic            15737117170363082752.0
loss_actor                -15451572227.072001
memory_size                           15933.0 

=== epoch 8/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  219
episode_length                        9.09589
returns                             -2.989377
return_std                           1.790526
average_reward                      -0.327856
round_time             0 days 00:11:11.803630
episodes_test                           211.0
episode_length_test                  9.473934
returns_test                        -2.528198
return_std_test                      1.638799
average_reward_test                 -0.266316
round_time_test        0 days 00:00:02.588759
round_time_total       0 days 00:11:11.804753
loss_total             12924316877436024832.0
loss_critic            16155395825542076416.0
loss_actor                   -15582212143.104
memory_size                        15933.6645 

=== epoch 8/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  213
episode_length                       9.319249
returns                              -2.50106
return_std                            1.90216
average_reward                       -0.26651
round_time             0 days 00:11:14.272555
episodes_test                           214.0
episode_length_test                  9.341121
returns_test                        -2.645114
return_std_test                      3.564375
average_reward_test                 -0.282712
round_time_test        0 days 00:00:02.663274
round_time_total       0 days 00:11:14.273632
loss_total             13132693747037972480.0
loss_critic            16415866914915332096.0
loss_actor                   -15733281398.784
memory_size                           15934.0 

=== epoch 8/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  211
episode_length                       9.417062
returns                             -2.415238
return_std                           1.500627
average_reward                      -0.259271
round_time             0 days 00:11:16.180184
episodes_test                           216.0
episode_length_test                   9.25463
returns_test                        -2.611295
return_std_test                      1.580859
average_reward_test                 -0.281481
round_time_test        0 days 00:00:02.610025
round_time_total       0 days 00:11:16.181290
loss_total             13231863876491919360.0
loss_critic            16539829565720469504.0
loss_actor                -15881460078.591999
memory_size                           15934.0 

=== epoch 8/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:52,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  211
episode_length                       9.421801
returns                             -2.511369
return_std                           1.560521
average_reward                      -0.263713
round_time             0 days 00:11:12.968536
episodes_test                           212.0
episode_length_test                  9.396226
returns_test                        -2.363322
return_std_test                      1.644586
average_reward_test                 -0.248252
round_time_test        0 days 00:00:02.635718
round_time_total       0 days 00:11:12.969609
loss_total             13518285116142569472.0
loss_critic            16897856105199198208.0
loss_actor                -16012877898.752001
memory_size                           15934.0 

=== epoch 8/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:51,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  225
episode_length                       8.853333
returns                             -2.660027
return_std                           1.402878
average_reward                      -0.299123
round_time             0 days 00:11:13.094436
episodes_test                           213.0
episode_length_test                  9.356808
returns_test                        -2.181778
return_std_test                       1.92835
average_reward_test                 -0.231561
round_time_test        0 days 00:00:02.617676
round_time_total       0 days 00:11:13.095519
loss_total             13819228776222664704.0
loss_critic            17274035685144041472.0
loss_actor                -16150831347.200001
memory_size                           15934.0 

=== epoch 8/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  214
episode_length                       9.299065
returns                             -2.610137
return_std                           1.688698
average_reward                      -0.280933
round_time             0 days 00:11:15.629074
episodes_test                           219.0
episode_length_test                  9.109589
returns_test                        -2.535943
return_std_test                      1.569632
average_reward_test                 -0.275918
round_time_test        0 days 00:00:02.611875
round_time_total       0 days 00:11:15.630164
loss_total             14136672249229924352.0
loss_critic            17670840012779479040.0
loss_actor                -16299926947.327999
memory_size                           15934.0 

=== epoch 8/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  211
episode_length                       9.436019
returns                             -2.586236
return_std                            1.75171
average_reward                      -0.273626
round_time             0 days 00:11:11.317831
episodes_test                           205.0
episode_length_test                  9.741463
returns_test                        -2.417406
return_std_test                      2.090566
average_reward_test                 -0.246957
round_time_test        0 days 00:00:02.594212
round_time_total       0 days 00:11:11.318910
loss_total             14280416274890713088.0
loss_critic            17850520021284685824.0
loss_actor                    -16467554030.08
memory_size                           15934.0 

=== epoch 8/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  200
episode_length                          9.995
returns                               -2.4266
return_std                           1.684092
average_reward                      -0.242132
round_time             0 days 00:11:17.204345
episodes_test                           205.0
episode_length_test                  9.726829
returns_test                        -2.439425
return_std_test                      3.695263
average_reward_test                 -0.250802
round_time_test        0 days 00:00:02.625279
round_time_total       0 days 00:11:17.205426
loss_total             14696842706388011008.0
loss_critic            18371053066119507968.0
loss_actor                   -16606896717.312
memory_size                           15934.0 

=== epoch 8/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  206
episode_length                       9.587379
returns                             -2.401042
return_std                           3.360284
average_reward                      -0.245676
round_time             0 days 00:11:11.853308
episodes_test                           211.0
episode_length_test                  9.450237
returns_test                        -2.800668
return_std_test                      1.620514
average_reward_test                 -0.294684
round_time_test        0 days 00:00:02.607788
round_time_total       0 days 00:11:11.854436
loss_total             14794307191972372480.0
loss_critic            18492883669870141440.0
loss_actor                   -16761100022.784
memory_size                         15937.279 

=== epoch 8/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:15,  2.51it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  216
episode_length                       9.236111
returns                             -2.821986
return_std                            1.53858
average_reward                      -0.304111
round_time             0 days 00:11:09.535606
episodes_test                           214.0
episode_length_test                  9.327103
returns_test                        -2.847858
return_std_test                      1.543851
average_reward_test                 -0.303579
round_time_test        0 days 00:00:02.579977
round_time_total       0 days 00:11:09.536681
loss_total             15164533770361026560.0
loss_critic            18955666879386943488.0
loss_actor                -16917811117.568001
memory_size                           15938.0 

=== epoch 8/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:02,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  215
episode_length                       9.251163
returns                             -2.589541
return_std                           1.623612
average_reward                      -0.279953
round_time             0 days 00:11:10.364196
episodes_test                           214.0
episode_length_test                  9.317757
returns_test                        -2.606548
return_std_test                      1.657965
average_reward_test                 -0.277352
round_time_test        0 days 00:00:02.623310
round_time_total       0 days 00:11:10.365309
loss_total             15472746983372118016.0
loss_critic            19340933391321481216.0
loss_actor                -17075725772.799999
memory_size                           15938.0 

=== epoch 8/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  215
episode_length                       9.223256
returns                             -2.623157
return_std                           1.624623
average_reward                      -0.283264
round_time             0 days 00:11:10.804791
episodes_test                           224.0
episode_length_test                   8.90625
returns_test                        -2.813077
return_std_test                      1.506988
average_reward_test                 -0.313835
round_time_test        0 days 00:00:02.622352
round_time_total       0 days 00:11:10.805883
loss_total             15769204401540032512.0
loss_critic            19711505171143839744.0
loss_actor                -17250188484.096001
memory_size                           15938.0 

=== epoch 8/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:49,  2.59it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  215
episode_length                       9.218605
returns                             -2.552229
return_std                           1.514573
average_reward                      -0.272118
round_time             0 days 00:11:13.081989
episodes_test                           218.0
episode_length_test                   9.16055
returns_test                        -2.771911
return_std_test                      1.689091
average_reward_test                 -0.301243
round_time_test        0 days 00:00:02.617791
round_time_total       0 days 00:11:13.083088
loss_total             16124950651209533440.0
loss_critic            20156187971204808704.0
loss_actor                -17446579501.568001
memory_size                           15938.0 

=== epoch 8/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  213
episode_length                       9.323944
returns                             -2.686463
return_std                           1.503378
average_reward                      -0.288783
round_time             0 days 00:11:11.531818
episodes_test                           216.0
episode_length_test                  9.259259
returns_test                        -2.850227
return_std_test                      1.757242
average_reward_test                 -0.307825
round_time_test        0 days 00:00:02.625494
round_time_total       0 days 00:11:11.532910
loss_total             16582595912021528576.0
loss_critic            20728244533063589888.0
loss_actor                -17642529563.136002
memory_size                           15938.0 

=== epoch 8/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  209
episode_length                       9.507177
returns                             -2.573269
return_std                           1.539943
average_reward                      -0.270774
round_time             0 days 00:11:13.630053
episodes_test                           217.0
episode_length_test                  9.193548
returns_test                        -2.641745
return_std_test                      1.523977
average_reward_test                 -0.285631
round_time_test        0 days 00:00:02.598479
round_time_total       0 days 00:11:13.631151
loss_total             16983818769932257280.0
loss_critic            21229773100779077632.0
loss_actor                -17880242380.287998
memory_size                           15938.0 

=== epoch 8/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  207
episode_length                       9.613527
returns                             -2.487019
return_std                           3.539309
average_reward                      -0.260468
round_time             0 days 00:11:10.707652
episodes_test                           212.0
episode_length_test                  9.424528
returns_test                        -2.956265
return_std_test                      1.881549
average_reward_test                 -0.312942
round_time_test        0 days 00:00:02.606383
round_time_total       0 days 00:11:10.708721
loss_total             17348928876789049344.0
loss_critic            21686160705831485440.0
loss_actor                -18056943102.976002
memory_size                        15945.7935 

=== epoch 8/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:29,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  212
episode_length                       9.424528
returns                             -2.474378
return_std                           1.671177
average_reward                      -0.263654
round_time             0 days 00:11:13.560724
episodes_test                           216.0
episode_length_test                      9.25
returns_test                        -2.677613
return_std_test                      1.572812
average_reward_test                 -0.288193
round_time_test        0 days 00:00:02.614632
round_time_total       0 days 00:11:13.561801
loss_total             17578231624203040768.0
loss_critic            21972789170369900544.0
loss_actor                -18277670198.271999
memory_size                           15949.0 

=== epoch 8/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:40,  2.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  208
episode_length                       9.596154
returns                             -2.556308
return_std                           2.977837
average_reward                      -0.265245
round_time             0 days 00:11:09.160090
episodes_test                           219.0
episode_length_test                   9.13242
returns_test                        -2.726355
return_std_test                      1.463768
average_reward_test                 -0.298536
round_time_test        0 days 00:00:02.629736
round_time_total       0 days 00:11:09.161163
loss_total             18054058007145562112.0
loss_critic            22567572135372877824.0
loss_actor                -18505304459.776001
memory_size                        15951.0475 

=== epoch 8/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                  203
episode_length                       9.778325
returns                             -2.217777
return_std                           3.243891
average_reward                      -0.224393
round_time             0 days 00:11:12.555515
episodes_test                           212.0
episode_length_test                  9.415094
returns_test                        -2.683842
return_std_test                      1.758499
average_reward_test                 -0.282874
round_time_test        0 days 00:00:02.617690
round_time_total       0 days 00:11:12.556599
loss_total             18719958250098495488.0
loss_critic            23399947393331232768.0
loss_actor                -18683509348.352001
memory_size                         15955.469 

=== epoch 8/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:10,  2.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  216
episode_length                       9.240741
returns                             -2.739115
return_std                           1.924923
average_reward                      -0.297461
round_time             0 days 00:11:16.599280
episodes_test                           206.0
episode_length_test                  9.674757
returns_test                        -2.752585
return_std_test                      1.755238
average_reward_test                 -0.284185
round_time_test        0 days 00:00:02.612051
round_time_total       0 days 00:11:16.600350
loss_total             18941792600102629376.0
loss_critic            23677240340457127936.0
loss_actor                -18830699595.776001
memory_size                           15959.0 

=== epoch 8/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  213
episode_length                       9.370892
returns                               -2.6428
return_std                           1.499479
average_reward                      -0.282046
round_time             0 days 00:11:16.281782
episodes_test                           209.0
episode_length_test                  9.564593
returns_test                        -2.714758
return_std_test                      1.606758
average_reward_test                 -0.283301
round_time_test        0 days 00:00:02.603520
round_time_total       0 days 00:11:16.282931
loss_total             19466446084553752576.0
loss_critic            24333057188083929088.0
loss_actor                -18998883587.071999
memory_size                           15959.0 

=== epoch 8/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:42,  2.62it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                  210
episode_length                       9.504762
returns                              -2.60684
return_std                           1.476094
average_reward                      -0.273945
round_time             0 days 00:11:19.454268
episodes_test                           212.0
episode_length_test                  9.419811
returns_test                        -2.633652
return_std_test                      1.848364
average_reward_test                 -0.278382
round_time_test        0 days 00:00:02.604831
round_time_total       0 days 00:11:19.455347
loss_total             19606354545264902144.0
loss_critic            24507942748098670592.0
loss_actor                -19224645592.063999
memory_size                           15959.0 

=== epoch 8/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:24<00:00,  2.92it/s]
episodes                                  206
episode_length                       9.616505
returns                              -2.42457
return_std                           1.584436
average_reward                      -0.247867
round_time             0 days 00:11:25.168974
episodes_test                           211.0
episode_length_test                  9.445498
returns_test                        -2.672934
return_std_test                      1.707693
average_reward_test                 -0.282653
round_time_test        0 days 00:00:02.618676
round_time_total       0 days 00:11:25.170042
loss_total             20148644984163074048.0
loss_critic            25185805799745036288.0
loss_actor                -19393457766.400002
memory_size                           15959.0 

=== epoch 8/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                  209
episode_length                       9.545455
returns                             -2.520899
return_std                           1.589136
average_reward                      -0.265869
round_time             0 days 00:11:28.279295
episodes_test                           213.0
episode_length_test                  9.356808
returns_test                        -2.493802
return_std_test                      1.737268
average_reward_test                 -0.265531
round_time_test        0 days 00:00:02.618996
round_time_total       0 days 00:11:28.280378
loss_total             20595590152543490048.0
loss_critic            25744487242868891648.0
loss_actor                    -19595623004.16
memory_size                           15959.0 

=== epoch 8/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:25<00:00,  2.92it/s]
episodes                                  209
episode_length                       9.497608
returns                             -2.539173
return_std                           1.779337
average_reward                      -0.266172
round_time             0 days 00:11:25.994360
episodes_test                           214.0
episode_length_test                   9.32243
returns_test                        -2.615795
return_std_test                       1.74187
average_reward_test                 -0.278412
round_time_test        0 days 00:00:02.609501
round_time_total       0 days 00:11:25.995574
loss_total             20738820300691759104.0
loss_critic            25923524938396508160.0
loss_actor                -19819489241.088001
memory_size                           15959.0 

=== epoch 8/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:43,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  201
episode_length                       9.885572
returns                             -2.656864
return_std                           2.275815
average_reward                      -0.271553
round_time             0 days 00:11:13.142348
episodes_test                           216.0
episode_length_test                   9.25463
returns_test                        -2.806653
return_std_test                      1.453192
average_reward_test                 -0.302628
round_time_test        0 days 00:00:02.642499
round_time_total       0 days 00:11:13.143413
loss_total             21006915148055560192.0
loss_critic            26258643486537424896.0
loss_actor                -19964466137.088001
memory_size                           15959.0 

=== epoch 8/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:52,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:26<00:00,  2.91it/s]
episodes                                  209
episode_length                       9.526316
returns                             -2.582076
return_std                           1.702572
average_reward                      -0.268334
round_time             0 days 00:11:26.771350
episodes_test                           203.0
episode_length_test                  9.852217
returns_test                        -2.652602
return_std_test                       1.95216
average_reward_test                 -0.269239
round_time_test        0 days 00:00:02.611827
round_time_total       0 days 00:11:26.772462
loss_total             21418046418140028928.0
loss_critic            26772557564419207168.0
loss_actor                -20218837794.816002
memory_size                           15959.0 

=== epoch 8/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:23,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:26<00:00,  2.48it/s]
episodes                                  206
episode_length                       9.660194
returns                             -2.589399
return_std                           1.691399
average_reward                      -0.268469
round_time             0 days 00:13:26.902044
episodes_test                           207.0
episode_length_test                  9.618357
returns_test                        -2.362886
return_std_test                      3.855341
average_reward_test                  -0.24288
round_time_test        0 days 00:00:02.674765
round_time_total       0 days 00:13:26.903843
loss_total             22328562500598476800.0
loss_critic            27910702609664823296.0
loss_actor                -20446132258.816002
memory_size                           15959.0 

=== epoch 8/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:42,  1.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:58<00:00,  1.45it/s]
episodes                                  211
episode_length                       9.398104
returns                             -2.528885
return_std                           1.570102
average_reward                      -0.267843
round_time             0 days 00:22:59.607649
episodes_test                           212.0
episode_length_test                  9.415094
returns_test                        -2.817936
return_std_test                      2.100149
average_reward_test                 -0.296439
round_time_test        0 days 00:00:03.663416
round_time_total       0 days 00:22:59.610233
loss_total             22417011240973967360.0
loss_critic            28021263567088746496.0
loss_actor                   -20648487655.424
memory_size                           15959.0 

=== epoch 8/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:40,  1.30it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:42<00:00,  1.41it/s]
episodes                                  210
episode_length                       9.504762
returns                             -2.617078
return_std                           1.599494
average_reward                      -0.273722
round_time             0 days 00:23:44.138153
episodes_test                           216.0
episode_length_test                   9.24537
returns_test                        -2.596242
return_std_test                      2.222718
average_reward_test                   -0.2804
round_time_test        0 days 00:00:04.129044
round_time_total       0 days 00:23:44.140788
loss_total             22849629596235411456.0
loss_critic            28562036534530170880.0
loss_actor                -20844768613.375999
memory_size                           15959.0 

=== epoch 8/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:32,  1.21it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:23<00:00,  1.42it/s]
episodes                                  218
episode_length                       9.142202
returns                             -2.657165
return_std                           1.472718
average_reward                      -0.292724
round_time             0 days 00:23:25.771079
episodes_test                           213.0
episode_length_test                  9.389671
returns_test                        -2.595518
return_std_test                      1.513016
average_reward_test                 -0.276423
round_time_test        0 days 00:00:04.504753
round_time_total       0 days 00:23:25.773617
loss_total             23517415609931878400.0
loss_critic            29396769009628794880.0
loss_actor                -21044883126.271999
memory_size                           15959.0 

=== epoch 8/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:40,  1.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:22<00:00,  1.43it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  212
episode_length                       9.419811
returns                             -2.556922
return_std                           3.225248
average_reward                      -0.270278
round_time             0 days 00:23:24.160566
episodes_test                           207.0
episode_length_test                  9.637681
returns_test                        -2.529397
return_std_test                       1.62025
average_reward_test                 -0.261183
round_time_test        0 days 00:00:03.352931
round_time_total       0 days 00:23:24.163346
loss_total             23825362674883473408.0
loss_critic            29781702832297074688.0
loss_actor                   -21238842285.056
memory_size                        15961.9025 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:03<23:43,  1.40it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:51<00:00,  1.46it/s]
episodes                                  222
episode_length                       9.009009
returns                             -2.757712
return_std                           1.452016
average_reward                      -0.306106
round_time             0 days 00:22:51.785731
episodes_test                           210.0
episode_length_test                       9.5
returns_test                        -2.595511
return_std_test                      1.784406
average_reward_test                 -0.273452
round_time_test        0 days 00:00:04.291030
round_time_total       0 days 00:22:51.788527
loss_total             24595564336871702528.0
loss_critic            30744454923698057216.0
loss_actor                -21386721483.776001
memory_size                           15965.0 

=== epoch 9/10 ===== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:45,  1.35it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:17<00:00,  1.37it/s]
episodes                                  213
episode_length                       9.366197
returns                             -2.531765
return_std                            1.77193
average_reward                      -0.269061
round_time             0 days 00:24:19.509424
episodes_test                           219.0
episode_length_test                  9.091324
returns_test                        -2.799462
return_std_test                      1.516175
average_reward_test                 -0.304452
round_time_test        0 days 00:00:03.562586
round_time_total       0 days 00:24:19.511995
loss_total             24706988072823607296.0
loss_critic            30883734567387095040.0
loss_actor                -21593690488.832001
memory_size                           15965.0 

=== epoch 9/10 ===== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:34,  1.36it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:45<00:00,  1.40it/s]
episodes                                  212
episode_length                       9.382075
returns                              -2.77959
return_std                           1.438609
average_reward                      -0.296232
round_time             0 days 00:23:47.042923
episodes_test                           215.0
episode_length_test                  9.269767
returns_test                        -2.656772
return_std_test                      1.633263
average_reward_test                 -0.287337
round_time_test        0 days 00:00:04.511469
round_time_total       0 days 00:23:47.045537
loss_total             25366167121180151808.0
loss_critic            31707708386353991680.0
loss_actor                -21780512353.279999
memory_size                           15965.0 

=== epoch 9/10 ===== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:07,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:05<00:00,  1.44it/s]
episodes                                  214
episode_length                        9.28972
returns                             -2.739949
return_std                           1.571654
average_reward                      -0.294885
round_time             0 days 00:23:08.002147
episodes_test                           217.0
episode_length_test                  9.179724
returns_test                        -2.788545
return_std_test                      1.442157
average_reward_test                 -0.302077
round_time_test        0 days 00:00:03.481769
round_time_total       0 days 00:23:08.004926
loss_total             25690713409822187520.0
loss_critic            32113391220218503168.0
loss_actor                -21995323374.591999
memory_size                           15965.0 

=== epoch 9/10 ===== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:04,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:32<00:00,  1.42it/s]
episodes                                  214
episode_length                       9.313084
returns                             -2.562383
return_std                           1.393631
average_reward                      -0.274717
round_time             0 days 00:23:34.128401
episodes_test                           218.0
episode_length_test                  9.146789
returns_test                        -2.562484
return_std_test                      2.027604
average_reward_test                 -0.280072
round_time_test        0 days 00:00:03.868052
round_time_total       0 days 00:23:34.130689
loss_total             26723754783420616704.0
loss_critic            33404692903406559232.0
loss_actor                -22156431480.832001
memory_size                           15965.0 

=== epoch 9/10 ===== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:19,  1.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:35<00:00,  1.48it/s]
episodes                                  214
episode_length                       9.313084
returns                             -2.668792
return_std                           1.658732
average_reward                      -0.287574
round_time             0 days 00:22:37.364054
episodes_test                           216.0
episode_length_test                  9.222222
returns_test                        -2.769969
return_std_test                      1.455702
average_reward_test                 -0.299213
round_time_test        0 days 00:00:03.773706
round_time_total       0 days 00:22:37.366954
loss_total             26805653910188261376.0
loss_critic            33507066870861783040.0
loss_actor                -22386184465.408001
memory_size                           15965.0 

=== epoch 9/10 ===== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<27:14,  1.22it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:14<00:00,  1.43it/s]
episodes                                  214
episode_length                       9.285047
returns                             -2.680094
return_std                           2.082889
average_reward                      -0.286319
round_time             0 days 00:23:16.168216
episodes_test                           212.0
episode_length_test                  9.396226
returns_test                        -2.685585
return_std_test                       1.36657
average_reward_test                 -0.285803
round_time_test        0 days 00:00:03.985199
round_time_total       0 days 00:23:16.170847
loss_total             27168832674568208384.0
loss_critic            33961040255967969280.0
loss_actor                   -22556007542.784
memory_size                           15965.0 

=== epoch 9/10 ===== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:05,  1.28it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:21<00:00,  1.43it/s]
episodes                                  209
episode_length                       9.535885
returns                             -2.802019
return_std                           1.603496
average_reward                      -0.294435
round_time             0 days 00:23:23.303698
episodes_test                           218.0
episode_length_test                  9.146789
returns_test                         -2.86285
return_std_test                      1.581931
average_reward_test                 -0.310604
round_time_test        0 days 00:00:03.379880
round_time_total       0 days 00:23:23.306162
loss_total             27510792579907653632.0
loss_critic            34388490125925609472.0
loss_actor                    -22737021291.52
memory_size                           15965.0 

=== epoch 9/10 ===== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:07,  1.27it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:02<00:00,  1.39it/s]
episodes                                  216
episode_length                       9.189815
returns                             -2.790263
return_std                           1.432227
average_reward                      -0.303821
round_time             0 days 00:24:04.268470
episodes_test                           211.0
episode_length_test                  9.454976
returns_test                        -2.688747
return_std_test                      2.021633
average_reward_test                 -0.283424
round_time_test        0 days 00:00:04.231340
round_time_total       0 days 00:24:04.271237
loss_total             27827552543641812992.0
loss_critic            34784440098529087488.0
loss_actor                   -22985150942.208
memory_size                           15965.0 

=== epoch 9/10 ===== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<27:10,  1.23it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:00<00:00,  1.19it/s]
episodes                                  220
episode_length                       9.081818
returns                             -2.807681
return_std                           2.403233
average_reward                      -0.308038
round_time             0 days 00:28:02.359808
episodes_test                           213.0
episode_length_test                  9.389671
returns_test                        -2.691946
return_std_test                       1.43916
average_reward_test                 -0.286692
round_time_test        0 days 00:00:04.056625
round_time_total       0 days 00:28:02.362016
loss_total             28594278174248181760.0
loss_critic            35742847096998473728.0
loss_actor                   -23101380942.848
memory_size                           15965.0 

=== epoch 9/10 ===== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:03<38:28,  1.16s/it]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [30:27<00:00,  1.09it/s]
episodes                                  206
episode_length                       9.616505
returns                             -2.667715
return_std                           1.666041
average_reward                      -0.275884
round_time             0 days 00:30:30.270863
episodes_test                           215.0
episode_length_test                   9.27907
returns_test                        -2.742217
return_std_test                      1.677921
average_reward_test                 -0.296169
round_time_test        0 days 00:00:06.065524
round_time_total       0 days 00:30:30.272918
loss_total             29048760547201245184.0
loss_critic            36310950063395962880.0
loss_actor                -23364506921.984001
memory_size                           15965.0 

=== epoch 9/10 ===== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:12,  1.18it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:56<00:00,  1.52it/s]
episodes                                  212
episode_length                       9.391509
returns                             -2.686308
return_std                           1.647487
average_reward                      -0.284232
round_time             0 days 00:21:59.017312
episodes_test                           213.0
episode_length_test                  9.370892
returns_test                        -2.743015
return_std_test                      1.619163
average_reward_test                 -0.290371
round_time_test        0 days 00:00:04.072327
round_time_total       0 days 00:21:59.019264
loss_total             29814295135547678720.0
loss_critic            37267868289586233344.0
loss_actor                -23607291758.591999
memory_size                        15968.5925 

=== epoch 9/10 ===== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:17,  1.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:49<00:00,  1.60it/s]
episodes                                  208
episode_length                       9.591346
returns                             -2.683071
return_std                           1.751748
average_reward                      -0.281311
round_time             0 days 00:20:50.936668
episodes_test                           217.0
episode_length_test                  9.193548
returns_test                        -2.742249
return_std_test                      1.576254
average_reward_test                 -0.295432
round_time_test        0 days 00:00:03.389761
round_time_total       0 days 00:20:50.939330
loss_total             30576021773728759808.0
loss_critic            38220026563913605120.0
loss_actor                -23780875991.040001
memory_size                           15971.0 

=== epoch 9/10 ===== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:29,  1.31it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:30<00:00,  1.55it/s]
episodes                                  212
episode_length                       9.353774
returns                             -2.835459
return_std                            1.50918
average_reward                      -0.304983
round_time             0 days 00:21:32.448751
episodes_test                           217.0
episode_length_test                  9.175115
returns_test                        -2.768269
return_std_test                       1.47974
average_reward_test                 -0.299379
round_time_test        0 days 00:00:03.493579
round_time_total       0 days 00:21:32.450207
loss_total             30694814637408329728.0
loss_critic            38368517652961992704.0
loss_actor                   -24053105511.424
memory_size                           15971.0 

=== epoch 9/10 ===== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<23:14,  1.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:28<00:00,  1.63it/s]
episodes                                  214
episode_length                       9.327103
returns                             -3.026846
return_std                           1.767883
average_reward                      -0.323116
round_time             0 days 00:20:29.375570
episodes_test                           219.0
episode_length_test                   9.13242
returns_test                        -2.953291
return_std_test                      1.655974
average_reward_test                 -0.323385
round_time_test        0 days 00:00:03.753460
round_time_total       0 days 00:20:29.377595
loss_total             31798060867616182272.0
loss_critic            39747575356437372928.0
loss_actor                -24271762974.720001
memory_size                           15971.0 

=== epoch 9/10 ===== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:17,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:03<00:00,  1.66it/s]
episodes                                  218
episode_length                        9.12844
returns                             -2.855509
return_std                           1.445244
average_reward                      -0.311898
round_time             0 days 00:20:04.773788
episodes_test                           211.0
episode_length_test                  9.478673
returns_test                        -2.872264
return_std_test                       1.62846
average_reward_test                 -0.303024
round_time_test        0 days 00:00:03.651531
round_time_total       0 days 00:20:04.775781
loss_total             32151896309287997440.0
loss_critic            40189869709516996608.0
loss_actor                -24499445859.327999
memory_size                           15971.0 

=== epoch 9/10 ===== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:57,  1.45it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:00<00:00,  1.67it/s]
episodes                                  213
episode_length                       9.342723
returns                             -2.783951
return_std                           1.317081
average_reward                       -0.29801
round_time             0 days 00:20:01.867626
episodes_test                           218.0
episode_length_test                  9.142202
returns_test                         -2.99559
return_std_test                      1.527006
average_reward_test                  -0.32791
round_time_test        0 days 00:00:04.656219
round_time_total       0 days 00:20:01.869694
loss_total             32640075613559361536.0
loss_critic            40800093837176135680.0
loss_actor                -24709711334.400002
memory_size                           15971.0 

=== epoch 9/10 ===== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<22:43,  1.46it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:24<00:00,  1.63it/s]
episodes                                  208
episode_length                       9.591346
returns                              -2.89572
return_std                           1.654904
average_reward                       -0.30244
round_time             0 days 00:20:25.281832
episodes_test                           207.0
episode_length_test                   9.63285
returns_test                         -2.63941
return_std_test                      1.765744
average_reward_test                 -0.273846
round_time_test        0 days 00:00:04.463385
round_time_total       0 days 00:20:25.283761
loss_total             33533787407773790208.0
loss_critic            41917233538231451648.0
loss_actor                -24833107440.639999
memory_size                           15971.0 

=== epoch 9/10 ===== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:36<00:00,  1.62it/s]
episodes                                  210
episode_length                       9.442857
returns                              -3.05451
return_std                           1.667318
average_reward                      -0.321225
round_time             0 days 00:20:37.544534
episodes_test                           213.0
episode_length_test                  9.375587
returns_test                        -2.996958
return_std_test                      1.406048
average_reward_test                 -0.318632
round_time_test        0 days 00:00:03.164353
round_time_total       0 days 00:20:37.546446
loss_total             34084694281693212672.0
loss_critic            42605867140801208320.0
loss_actor                   -25050832835.584
memory_size                           15971.0 

=== epoch 9/10 ===== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:14,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:19<00:00,  1.64it/s]
episodes                                  213
episode_length                       9.342723
returns                             -2.719663
return_std                           1.423555
average_reward                      -0.292251
round_time             0 days 00:20:19.999881
episodes_test                           214.0
episode_length_test                  9.308411
returns_test                        -3.027555
return_std_test                      1.463163
average_reward_test                 -0.323616
round_time_test        0 days 00:00:04.059764
round_time_total       0 days 00:20:20.001770
loss_total             33929851159764656128.0
loss_critic            42412313267252699136.0
loss_actor                   -25284520845.312
memory_size                           15971.0 

=== epoch 9/10 ===== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:04,  1.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:59<00:00,  1.67it/s]
episodes                                  218
episode_length                       9.146789
returns                             -2.687476
return_std                           1.414392
average_reward                      -0.293612
round_time             0 days 00:20:00.476833
episodes_test                           217.0
episode_length_test                  9.198157
returns_test                        -2.737654
return_std_test                      1.673826
average_reward_test                 -0.295736
round_time_test        0 days 00:00:03.382271
round_time_total       0 days 00:20:00.478611
loss_total             34875999201443786752.0
loss_critic            43594998258053832704.0
loss_actor                   -25526185714.688
memory_size                           15971.0 

=== epoch 9/10 ===== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:02<19:36,  1.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:08<00:00,  1.66it/s]
episodes                                  218
episode_length                       9.155963
returns                              -2.83728
return_std                           1.599685
average_reward                      -0.310598
round_time             0 days 00:20:09.055560
episodes_test                           215.0
episode_length_test                  9.255814
returns_test                        -2.639909
return_std_test                      1.546758
average_reward_test                  -0.28349
round_time_test        0 days 00:00:03.266290
round_time_total       0 days 00:20:09.057058
loss_total             35498929785201545216.0
loss_critic            44373661442258739200.0
loss_actor                -25762843421.695999
memory_size                           15971.0 

=== epoch 9/10 ===== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:50,  1.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:04<00:00,  1.66it/s]
episodes                                  215
episode_length                        9.24186
returns                              -2.69311
return_std                           1.562413
average_reward                      -0.292632
round_time             0 days 00:20:05.725917
episodes_test                           219.0
episode_length_test                  9.105023
returns_test                        -2.613311
return_std_test                      1.475973
average_reward_test                  -0.28577
round_time_test        0 days 00:00:03.220255
round_time_total       0 days 00:20:05.727713
loss_total             36761523692853248000.0
loss_critic            45951903808818864128.0
loss_actor                -25983540027.391998
memory_size                           15971.0 

=== epoch 9/10 ===== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:28,  1.63it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:56<00:00,  1.67it/s]
episodes                                  213
episode_length                       9.380282
returns                             -2.696999
return_std                           1.808704
average_reward                      -0.288243
round_time             0 days 00:19:57.733557
episodes_test                           217.0
episode_length_test                  9.202765
returns_test                        -2.760932
return_std_test                      1.436543
average_reward_test                 -0.299014
round_time_test        0 days 00:00:03.451819
round_time_total       0 days 00:19:57.735436
loss_total             37136746319862243328.0
loss_critic            46420932106255286272.0
loss_actor                   -26159725278.208
memory_size                           15971.0 

=== epoch 9/10 ===== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:30,  1.55it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:22<00:00,  1.64it/s]
episodes                                  209
episode_length                       9.555024
returns                             -2.752216
return_std                           1.720074
average_reward                      -0.286128
round_time             0 days 00:20:22.936380
episodes_test                           210.0
episode_length_test                  9.490476
returns_test                         -2.73378
return_std_test                       1.69702
average_reward_test                 -0.288575
round_time_test        0 days 00:00:04.450927
round_time_total       0 days 00:20:22.938281
loss_total             37364583790598529024.0
loss_critic            46705728972025995264.0
loss_actor                -26420514302.976002
memory_size                           15971.0 

=== epoch 9/10 ===== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:51,  1.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:31<00:00,  1.62it/s]
episodes                                  220
episode_length                       9.054545
returns                             -2.854926
return_std                           1.529254
average_reward                      -0.317141
round_time             0 days 00:20:32.339159
episodes_test                           208.0
episode_length_test                  9.610577
returns_test                        -2.733891
return_std_test                      1.524874
average_reward_test                 -0.283807
round_time_test        0 days 00:00:03.127418
round_time_total       0 days 00:20:32.340925
loss_total             37857279154712788992.0
loss_critic            47321598105631842304.0
loss_actor                   -26567689097.216
memory_size                           15971.0 

=== epoch 9/10 ===== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:57,  1.85it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:06<00:00,  1.66it/s]
episodes                                  217
episode_length                        9.16129
returns                             -2.861627
return_std                           1.512267
average_reward                       -0.30983
round_time             0 days 00:20:07.166201
episodes_test                           215.0
episode_length_test                  9.283721
returns_test                        -2.832117
return_std_test                      1.572508
average_reward_test                 -0.302473
round_time_test        0 days 00:00:03.207289
round_time_total       0 days 00:20:07.168132
loss_total             38705206384521478144.0
loss_critic            48381507147015872512.0
loss_actor                -26718128191.487999
memory_size                           15971.0 

=== epoch 9/10 ===== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:09,  1.32it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:28<00:00,  1.63it/s]
episodes                                  210
episode_length                       9.514286
returns                              -2.66235
return_std                           1.692256
average_reward                       -0.27971
round_time             0 days 00:20:29.841697
episodes_test                           223.0
episode_length_test                  8.946188
returns_test                        -2.642254
return_std_test                      1.781377
average_reward_test                 -0.292322
round_time_test        0 days 00:00:03.333126
round_time_total       0 days 00:20:29.843719
loss_total             39425410937635561472.0
loss_critic            49281762820747575296.0
loss_actor                    -26968073016.32
memory_size                           15971.0 

=== epoch 9/10 ===== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:16,  1.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:35<00:00,  1.62it/s]
episodes                                  213
episode_length                       9.309859
returns                             -2.655105
return_std                           1.473738
average_reward                      -0.285499
round_time             0 days 00:20:36.082074
episodes_test                           211.0
episode_length_test                  9.469194
returns_test                          -2.7984
return_std_test                      1.681639
average_reward_test                 -0.294338
round_time_test        0 days 00:00:03.248773
round_time_total       0 days 00:20:36.083568
loss_total             39901878118978412544.0
loss_critic            49877346791103946752.0
loss_actor                -27210347036.672001
memory_size                           15971.0 

=== epoch 9/10 ===== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:16,  1.73it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:02<00:00,  1.66it/s]
episodes                                  210
episode_length                       9.480952
returns                             -2.907734
return_std                           1.645496
average_reward                      -0.307935
round_time             0 days 00:20:03.351056
episodes_test                           212.0
episode_length_test                  9.396226
returns_test                         -2.60037
return_std_test                      1.652827
average_reward_test                 -0.276945
round_time_test        0 days 00:00:03.788089
round_time_total       0 days 00:20:03.352882
loss_total             39947528737304240128.0
loss_critic            49934410034461851648.0
loss_actor                -27533926111.231998
memory_size                           15971.0 

=== epoch 9/10 ===== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:26,  1.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:49<00:00,  1.60it/s]
episodes                                  223
episode_length                       8.950673
returns                             -2.870782
return_std                           1.530542
average_reward                        -0.3199
round_time             0 days 00:20:50.966261
episodes_test                           217.0
episode_length_test                  9.184332
returns_test                         -2.92422
return_std_test                      1.387198
average_reward_test                 -0.317844
round_time_test        0 days 00:00:03.383617
round_time_total       0 days 00:20:50.967691
loss_total             41586747250170544128.0
loss_critic            51983433172108754944.0
loss_actor                -27774901615.616001
memory_size                           15971.0 

=== epoch 9/10 ===== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:02<21:49,  1.52it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:13<00:00,  1.73it/s]
episodes                                  214
episode_length                       9.313084
returns                             -2.567903
return_std                           1.617611
average_reward                      -0.277192
round_time             0 days 00:19:14.829600
episodes_test                           214.0
episode_length_test                  9.317757
returns_test                        -2.573584
return_std_test                       1.38921
average_reward_test                 -0.275585
round_time_test        0 days 00:00:04.259006
round_time_total       0 days 00:19:14.831395
loss_total             42038696187804508160.0
loss_critic            52548369273438879744.0
loss_actor                -28037650877.439999
memory_size                           15971.0 

=== epoch 9/10 ===== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:00,  1.75it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:45<00:00,  2.26it/s]
episodes                                  217
episode_length                       9.170507
returns                             -2.619641
return_std                           1.439747
average_reward                      -0.286766
round_time             0 days 00:14:46.594310
episodes_test                           216.0
episode_length_test                  9.217593
returns_test                        -2.352323
return_std_test                      1.512053
average_reward_test                 -0.253634
round_time_test        0 days 00:00:03.103931
round_time_total       0 days 00:14:46.595609
loss_total             43558062378785366016.0
loss_critic            54447577032574631936.0
loss_actor                   -28335567241.216
memory_size                           15971.0 

=== epoch 9/10 ===== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:18,  2.50it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:07<00:00,  2.54it/s]
episodes                                  209
episode_length                       9.516746
returns                             -2.439254
return_std                           3.797302
average_reward                      -0.255026
round_time             0 days 00:13:08.417321
episodes_test                           212.0
episode_length_test                  9.419811
returns_test                        -2.821858
return_std_test                      1.459623
average_reward_test                 -0.298056
round_time_test        0 days 00:00:02.944178
round_time_total       0 days 00:13:08.418507
loss_total             44338571319207133184.0
loss_critic            55423213196556967936.0
loss_actor                -28581109221.375999
memory_size                         15990.825 

=== epoch 9/10 ===== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:10,  2.53it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:12<00:00,  2.73it/s]
episodes                                  214
episode_length                       9.327103
returns                             -2.770696
return_std                           1.547535
average_reward                      -0.297201
round_time             0 days 00:12:12.728963
episodes_test                           221.0
episode_length_test                  9.036199
returns_test                        -2.987941
return_std_test                      1.549336
average_reward_test                 -0.330258
round_time_test        0 days 00:00:02.726540
round_time_total       0 days 00:12:12.730106
loss_total             45087926349721821184.0
loss_critic            56359906985799835648.0
loss_actor                -28817421009.919998
memory_size                           15996.0 

=== epoch 9/10 ===== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:48<00:00,  2.82it/s]
episodes                                  211
episode_length                       9.421801
returns                             -2.512237
return_std                           1.561441
average_reward                      -0.267959
round_time             0 days 00:11:48.596902
episodes_test                           213.0
episode_length_test                  9.361502
returns_test                        -2.723078
return_std_test                      1.621227
average_reward_test                 -0.291122
round_time_test        0 days 00:00:02.671142
round_time_total       0 days 00:11:48.597983
loss_total             45876821571267747840.0
loss_critic            57346025965659406336.0
loss_actor                -29004643639.296001
memory_size                           15996.0 

=== epoch 9/10 ===== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                  220
episode_length                       9.063636
returns                             -2.545857
return_std                           1.489319
average_reward                      -0.280998
round_time             0 days 00:11:27.724443
episodes_test                           212.0
episode_length_test                  9.429245
returns_test                        -2.481388
return_std_test                      1.773898
average_reward_test                 -0.262594
round_time_test        0 days 00:00:02.601938
round_time_total       0 days 00:11:27.725524
loss_total             45784093067917918208.0
loss_critic            57230115339908096000.0
loss_actor                -29279237462.015999
memory_size                           15996.0 

=== epoch 9/10 ===== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.76it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  211
episode_length                        9.42654
returns                             -2.670166
return_std                           1.690612
average_reward                      -0.282649
round_time             0 days 00:11:12.371851
episodes_test                           211.0
episode_length_test                  9.464455
returns_test                        -2.376697
return_std_test                      2.612218
average_reward_test                 -0.249432
round_time_test        0 days 00:00:02.616012
round_time_total       0 days 00:11:12.372927
loss_total             47622495850793558016.0
loss_critic            59528118806476734464.0
loss_actor                -29471338703.872002
memory_size                           15996.0 

=== epoch 9/10 ===== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  218
episode_length                       9.133028
returns                             -2.783987
return_std                           1.558527
average_reward                       -0.30793
round_time             0 days 00:11:11.262467
episodes_test                           218.0
episode_length_test                  9.165138
returns_test                        -2.700206
return_std_test                      1.472506
average_reward_test                 -0.293508
round_time_test        0 days 00:00:02.619774
round_time_total       0 days 00:11:11.263561
loss_total             48147856796347801600.0
loss_critic            60184819995428929536.0
loss_actor                   -29683922501.632
memory_size                           15996.0 

=== epoch 9/10 ===== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  215
episode_length                       9.265116
returns                             -2.872892
return_std                            1.51537
average_reward                      -0.309019
round_time             0 days 00:11:13.420784
episodes_test                           216.0
episode_length_test                  9.226852
returns_test                         -2.61162
return_std_test                      1.706112
average_reward_test                 -0.280512
round_time_test        0 days 00:00:02.618978
round_time_total       0 days 00:11:13.421882
loss_total             48747645133142523904.0
loss_critic            60934555343717130240.0
loss_actor                -29885054362.624001
memory_size                           15996.0 

=== epoch 9/10 ===== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  215
episode_length                       9.251163
returns                              -2.76978
return_std                           1.694895
average_reward                      -0.298828
round_time             0 days 00:11:12.208907
episodes_test                           211.0
episode_length_test                  9.473934
returns_test                        -2.755162
return_std_test                      1.341586
average_reward_test                 -0.290288
round_time_test        0 days 00:00:02.632443
round_time_total       0 days 00:11:12.209985
loss_total             49219130268695371776.0
loss_critic            61523911775321530368.0
loss_actor                   -30102692071.424
memory_size                           15996.0 

=== epoch 9/10 ===== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:30,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  218
episode_length                       9.151376
returns                             -2.785971
return_std                           1.404286
average_reward                      -0.304383
round_time             0 days 00:11:13.843062
episodes_test                           222.0
episode_length_test                  8.977477
returns_test                        -2.777317
return_std_test                      1.452903
average_reward_test                 -0.309406
round_time_test        0 days 00:00:02.627866
round_time_total       0 days 00:11:13.844121
loss_total             49981629302937288704.0
loss_critic            62477035583860686848.0
loss_actor                -30332710337.535999
memory_size                           15996.0 

=== epoch 9/10 ===== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  210
episode_length                        9.47619
returns                             -2.772205
return_std                           1.769192
average_reward                      -0.293423
round_time             0 days 00:11:09.259269
episodes_test                           210.0
episode_length_test                  9.509524
returns_test                        -2.792638
return_std_test                      1.861185
average_reward_test                 -0.292167
round_time_test        0 days 00:00:02.609629
round_time_total       0 days 00:11:09.260331
loss_total             50331059616693362688.0
loss_critic            62913823447193600000.0
loss_actor                    -30529002260.48
memory_size                           15996.0 

=== epoch 9/10 ===== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:17,  2.95it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:15<00:00,  2.96it/s]
episodes                                  212
episode_length                       9.400943
returns                             -2.649192
return_std                           1.738179
average_reward                      -0.283038
round_time             0 days 00:11:16.038270
episodes_test                           208.0
episode_length_test                  9.600962
returns_test                        -2.677179
return_std_test                      1.615426
average_reward_test                 -0.277046
round_time_test        0 days 00:00:02.575025
round_time_total       0 days 00:11:16.039343
loss_total             51509096474314874880.0
loss_critic            64386369510836715520.0
loss_actor                -30758372636.672001
memory_size                           15996.0 

=== epoch 9/10 ===== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:43,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  213
episode_length                       9.338028
returns                             -2.740215
return_std                           1.476071
average_reward                      -0.292645
round_time             0 days 00:11:08.231992
episodes_test                           217.0
episode_length_test                  9.198157
returns_test                        -2.700382
return_std_test                      1.442341
average_reward_test                 -0.292503
round_time_test        0 days 00:00:02.613484
round_time_total       0 days 00:11:08.233060
loss_total             52104026877367508992.0
loss_critic            65130032476307038208.0
loss_actor                -31010151452.672001
memory_size                           15996.0 

=== epoch 9/10 ===== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  214
episode_length                       9.261682
returns                             -2.823948
return_std                           1.458766
average_reward                      -0.302325
round_time             0 days 00:11:14.283320
episodes_test                           211.0
episode_length_test                   9.43128
returns_test                        -2.644834
return_std_test                      1.795993
average_reward_test                 -0.279007
round_time_test        0 days 00:00:02.616355
round_time_total       0 days 00:11:14.284391
loss_total             52956334316310167552.0
loss_critic            66195416762340974592.0
loss_actor                -31218341633.023998
memory_size                           15996.0 

=== epoch 9/10 ===== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:28,  2.90it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  208
episode_length                       9.591346
returns                             -2.814938
return_std                            1.71824
average_reward                      -0.294523
round_time             0 days 00:11:14.293300
episodes_test                           208.0
episode_length_test                  9.586538
returns_test                        -2.747025
return_std_test                      1.648936
average_reward_test                 -0.283867
round_time_test        0 days 00:00:02.588289
round_time_total       0 days 00:11:14.294363
loss_total             54443960788308901888.0
loss_critic            68054949787330764800.0
loss_actor                -31546502890.495998
memory_size                           15996.0 

=== epoch 9/10 ===== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:57,  2.57it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  210
episode_length                       9.485714
returns                             -2.747847
return_std                           1.634188
average_reward                      -0.290656
round_time             0 days 00:11:09.300490
episodes_test                           213.0
episode_length_test                  9.342723
returns_test                        -3.000154
return_std_test                       1.66052
average_reward_test                 -0.318683
round_time_test        0 days 00:00:02.630280
round_time_total       0 days 00:11:09.301548
loss_total             54731475531924512768.0
loss_critic            68414343244338069504.0
loss_actor                -31740789498.880001
memory_size                           15996.0 

=== epoch 9/10 ===== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  218
episode_length                       9.146789
returns                              -2.89004
return_std                           1.536573
average_reward                      -0.317864
round_time             0 days 00:11:10.593797
episodes_test                           208.0
episode_length_test                  9.610577
returns_test                        -2.934587
return_std_test                      1.865591
average_reward_test                 -0.304808
round_time_test        0 days 00:00:02.693839
round_time_total       0 days 00:11:10.594882
loss_total             56105782640655818752.0
loss_critic            70132227161588293632.0
loss_actor                   -32050066189.312
memory_size                           15996.0 

=== epoch 9/10 ===== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  218
episode_length                       9.155963
returns                             -2.717949
return_std                           1.359113
average_reward                      -0.295867
round_time             0 days 00:11:08.264384
episodes_test                           213.0
episode_length_test                  9.342723
returns_test                        -2.881934
return_std_test                      1.633378
average_reward_test                 -0.303486
round_time_test        0 days 00:00:02.660262
round_time_total       0 days 00:11:08.265452
loss_total             56786292316296044544.0
loss_critic            70982864246655287296.0
loss_actor                -32287946534.911999
memory_size                           15996.0 


<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:46,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.97it/s]
episodes                                  217
episode_length                       9.207373
returns                             -2.607373
return_std                           1.449341
average_reward                      -0.282261
round_time             0 days 00:11:14.376750
episodes_test                           211.0
episode_length_test                   9.43128
returns_test                        -2.689374
return_std_test                      1.608297
average_reward_test                 -0.284743
round_time_test        0 days 00:00:02.619041
round_time_total       0 days 00:11:14.377856
loss_total             57731317887994855424.0
loss_critic            72164146138023837696.0
loss_actor                -32597024014.335999
memory_size                           15996.0 

=== epoch 10/10 ==== round 2/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:47,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  216
episode_length                       9.226852
returns                             -2.803135
return_std                           1.492092
average_reward                      -0.304978
round_time             0 days 00:11:13.854556
episodes_test                           217.0
episode_length_test                  9.184332
returns_test                        -2.705823
return_std_test                      1.534774
average_reward_test                 -0.292945
round_time_test        0 days 00:00:02.596457
round_time_total       0 days 00:11:13.855646
loss_total             58325173109138137088.0
loss_critic            72906465139095445504.0
loss_actor                   -32809745953.792
memory_size                           15996.0 

=== epoch 10/10 ==== round 3/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:37,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.96it/s]
episodes                                  210
episode_length                       9.480952
returns                              -2.59071
return_std                           1.873069
average_reward                      -0.273721
round_time             0 days 00:11:15.199819
episodes_test                           212.0
episode_length_test                  9.433962
returns_test                        -2.775318
return_std_test                       1.46863
average_reward_test                 -0.294184
round_time_test        0 days 00:00:02.638749
round_time_total       0 days 00:11:15.200915
loss_total             59409536056857894912.0
loss_critic            74261918746573176832.0
loss_actor                -33172727847.936001
memory_size                           15996.0 

=== epoch 10/10 ==== round 4/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  224
episode_length                       8.892857
returns                              -2.83228
return_std                           1.393878
average_reward                      -0.318555
round_time             0 days 00:11:11.781240
episodes_test                           222.0
episode_length_test                  8.977477
returns_test                        -2.690727
return_std_test                      1.653888
average_reward_test                 -0.299966
round_time_test        0 days 00:00:02.643801
round_time_total       0 days 00:11:11.782331
loss_total             60857369643390173184.0
loss_critic            76071710702800486400.0
loss_actor                -33396604119.040001
memory_size                           15996.0 

=== epoch 10/10 ==== round 5/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.96it/s]
episodes                                  215
episode_length                       9.255814
returns                             -2.657058
return_std                           1.598737
average_reward                      -0.286965
round_time             0 days 00:11:15.177510
episodes_test                           220.0
episode_length_test                  9.063636
returns_test                        -2.931076
return_std_test                      1.351379
average_reward_test                 -0.321629
round_time_test        0 days 00:00:02.660589
round_time_total       0 days 00:11:15.178597
loss_total             61501493246703108096.0
loss_critic            76876865262879309824.0
loss_actor                   -33766336206.848
memory_size                           15996.0 

=== epoch 10/10 ==== round 6/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  220
episode_length                       9.031818
returns                             -2.647323
return_std                            1.43394
average_reward                      -0.291795
round_time             0 days 00:11:11.126034
episodes_test                           220.0
episode_length_test                  9.081818
returns_test                        -3.015944
return_std_test                       1.39456
average_reward_test                 -0.331153
round_time_test        0 days 00:00:02.642900
round_time_total       0 days 00:11:11.127129
loss_total             63483262536822620160.0
loss_critic            79354076824538857472.0
loss_actor                   -33897890597.888
memory_size                           15996.0 

=== epoch 10/10 ==== round 7/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  214
episode_length                       9.280374
returns                             -2.745298
return_std                           1.378045
average_reward                      -0.294951
round_time             0 days 00:11:09.837375
episodes_test                           214.0
episode_length_test                   9.32243
returns_test                        -2.760503
return_std_test                      1.351307
average_reward_test                 -0.295366
round_time_test        0 days 00:00:02.617959
round_time_total       0 days 00:11:09.838467
loss_total             64101043655728226304.0
loss_critic            80126303234028535808.0
loss_actor                -34145763799.040001
memory_size                           15996.0 

=== epoch 10/10 ==== round 8/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:37,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  208
episode_length                       9.557692
returns                             -2.634359
return_std                           1.615508
average_reward                      -0.277605
round_time             0 days 00:11:13.502525
episodes_test                           211.0
episode_length_test                  9.445498
returns_test                        -2.868627
return_std_test                       1.51436
average_reward_test                  -0.30407
round_time_test        0 days 00:00:02.618008
round_time_total       0 days 00:11:13.503613
loss_total             64973848830351654912.0
loss_critic            81217309611598102528.0
loss_actor                -34525405344.767998
memory_size                           15996.0 

=== epoch 10/10 ==== round 9/50 ======================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                  211
episode_length                       9.459716
returns                             -2.902151
return_std                            1.47721
average_reward                      -0.305317
round_time             0 days 00:11:13.238022
episodes_test                           216.0
episode_length_test                  9.259259
returns_test                        -2.518496
return_std_test                      1.860901
average_reward_test                 -0.271998
round_time_test        0 days 00:00:02.618011
round_time_total       0 days 00:11:13.239161
loss_total             65330533645411737600.0
loss_critic            81663165640456257536.0
loss_actor                   -34879981729.792
memory_size                           15996.0 

=== epoch 10/10 ==== round 10/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:16<00:00,  2.96it/s]
episodes                                  222
episode_length                       8.986486
returns                             -2.810698
return_std                           1.634301
average_reward                      -0.313783
round_time             0 days 00:11:16.950678
episodes_test                           218.0
episode_length_test                  9.174312
returns_test                        -2.979195
return_std_test                      1.284138
average_reward_test                 -0.324732
round_time_test        0 days 00:00:02.634163
round_time_total       0 days 00:11:16.951758
loss_total             67556326442834116608.0
loss_critic            84445406547486588928.0
loss_actor                -35101628639.232002
memory_size                           15996.0 

=== epoch 10/10 ==== round 11/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:24,  2.68it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  216
episode_length                           9.25
returns                             -2.827228
return_std                           1.893325
average_reward                      -0.304427
round_time             0 days 00:11:12.019687
episodes_test                           213.0
episode_length_test                  9.366197
returns_test                        -2.744938
return_std_test                      1.578886
average_reward_test                 -0.292002
round_time_test        0 days 00:00:02.621287
round_time_total       0 days 00:11:12.020772
loss_total             69304132431039725568.0
loss_critic            86630164015701180416.0
loss_actor                -35390384132.096001
memory_size                           15996.0 

=== epoch 10/10 ==== round 12/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  218
episode_length                        9.12844
returns                             -2.744839
return_std                           1.819961
average_reward                       -0.29972
round_time             0 days 00:11:08.704083
episodes_test                           216.0
episode_length_test                  9.240741
returns_test                        -2.923706
return_std_test                      1.406666
average_reward_test                 -0.315121
round_time_test        0 days 00:00:02.610107
round_time_total       0 days 00:11:08.705164
loss_total             69121669764488167424.0
loss_critic            86402085729378418688.0
loss_actor                -35728476329.984001
memory_size                           15996.0 

=== epoch 10/10 ==== round 13/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:23,  2.92it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  216
episode_length                       9.226852
returns                             -2.988829
return_std                           1.581712
average_reward                      -0.324971
round_time             0 days 00:11:07.454937
episodes_test                           218.0
episode_length_test                  9.142202
returns_test                        -2.970303
return_std_test                      1.561504
average_reward_test                 -0.324647
round_time_test        0 days 00:00:02.618385
round_time_total       0 days 00:11:07.456029
loss_total             71012631664153403392.0
loss_critic            88765788063003148288.0
loss_actor                   -36025695009.792
memory_size                           15996.0 

=== epoch 10/10 ==== round 14/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  210
episode_length                       9.495238
returns                             -2.732201
return_std                            1.53008
average_reward                       -0.28937
round_time             0 days 00:11:10.111880
episodes_test                           213.0
episode_length_test                  9.356808
returns_test                        -2.797715
return_std_test                      1.606016
average_reward_test                 -0.296432
round_time_test        0 days 00:00:02.625896
round_time_total       0 days 00:11:10.112953
loss_total             71186402511517237248.0
loss_critic            88983001588397965312.0
loss_actor                -36267053186.047997
memory_size                           15996.0 

=== epoch 10/10 ==== round 15/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  217
episode_length                        9.18894
returns                              -2.78269
return_std                           1.494544
average_reward                       -0.30165
round_time             0 days 00:11:11.398502
episodes_test                           214.0
episode_length_test                  9.336449
returns_test                        -2.616517
return_std_test                      1.754344
average_reward_test                 -0.279348
round_time_test        0 days 00:00:02.617814
round_time_total       0 days 00:11:11.399593
loss_total             73341535428376567808.0
loss_critic            91676917694477385728.0
loss_actor                -36595726895.103996
memory_size                           15996.0 

=== epoch 10/10 ==== round 16/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  212
episode_length                       9.419811
returns                             -2.886518
return_std                           1.407776
average_reward                      -0.307765
round_time             0 days 00:11:08.676897
episodes_test                           214.0
episode_length_test                  9.317757
returns_test                        -2.593222
return_std_test                      1.487214
average_reward_test                 -0.277772
round_time_test        0 days 00:00:02.584613
round_time_total       0 days 00:11:08.677969
loss_total             74708447490730049536.0
loss_critic            93385557704112078848.0
loss_actor                   -36831874782.208
memory_size                           15996.0 

=== epoch 10/10 ==== round 17/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  218
episode_length                       9.165138
returns                             -3.006888
return_std                           1.570681
average_reward                       -0.32721
round_time             0 days 00:11:10.889867
episodes_test                           210.0
episode_length_test                  9.495238
returns_test                        -2.544811
return_std_test                      4.035892
average_reward_test                 -0.266465
round_time_test        0 days 00:00:02.618129
round_time_total       0 days 00:11:10.890941
loss_total             75566882580542078976.0
loss_critic            94458601564727836672.0
loss_actor                -37209163899.903999
memory_size                           15996.0 

=== epoch 10/10 ==== round 18/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:08,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                  212
episode_length                       9.358491
returns                             -3.053032
return_std                           1.708762
average_reward                      -0.326756
round_time             0 days 00:11:13.633704
episodes_test                           217.0
episode_length_test                  9.207373
returns_test                        -2.911842
return_std_test                      1.665876
average_reward_test                 -0.314821
round_time_test        0 days 00:00:02.604808
round_time_total       0 days 00:11:13.634784
loss_total             76899100931718905856.0
loss_critic            96123874443225743360.0
loss_actor                -37412156618.751999
memory_size                           15996.0 

=== epoch 10/10 ==== round 19/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  216
episode_length                        9.24537
returns                             -2.710613
return_std                           1.523039
average_reward                      -0.294219
round_time             0 days 00:11:08.005128
episodes_test                           213.0
episode_length_test                  9.389671
returns_test                         -2.79934
return_std_test                       1.56107
average_reward_test                  -0.29813
round_time_test        0 days 00:00:02.584247
round_time_total       0 days 00:11:08.006203
loss_total             77483506362518994944.0
loss_critic            96854381310066049024.0
loss_actor                -37740526965.760002
memory_size                           15996.0 

=== epoch 10/10 ==== round 20/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                  223
episode_length                       8.950673
returns                             -2.968207
return_std                           1.389405
average_reward                      -0.329182
round_time             0 days 00:11:11.524616
episodes_test                           222.0
episode_length_test                  8.968468
returns_test                        -2.746867
return_std_test                      1.828327
average_reward_test                 -0.304555
round_time_test        0 days 00:00:02.610913
round_time_total       0 days 00:11:11.525698
loss_total             78925832440332156928.0
loss_critic            98657288839712538624.0
loss_actor                -37970993426.431999
memory_size                           15996.0 

=== epoch 10/10 ==== round 21/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                   213
episode_length                        9.333333
returns                              -2.875165
return_std                            1.519913
average_reward                       -0.307429
round_time              0 days 00:11:12.677156
episodes_test                            225.0
episode_length_test                   8.888889
returns_test                         -3.136699
return_std_test                       1.367689
average_reward_test                  -0.352879
round_time_test         0 days 00:00:02.611029
round_time_total        0 days 00:11:12.678238
loss_total              80063574891148935168.0
loss_critic            100079466982673219584.0
loss_actor                 -38379787174.912003
memory_size                            15996.0 

=== epoch 10/10 ==== round 22/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                   212
episode_length                        9.396226
returns                              -2.955177
return_std                            1.499472
average_reward                       -0.315399
round_time              0 days 00:11:14.235065
episodes_test                            207.0
episode_length_test                   9.628019
returns_test                         -2.786683
return_std_test                       2.105449
average_reward_test                  -0.288627
round_time_test         0 days 00:00:02.591886
round_time_total        0 days 00:11:14.236134
loss_total              82920087257780207616.0
loss_critic            103650107346679201792.0
loss_actor                 -38541912080.384003
memory_size                            15996.0 

=== epoch 10/10 ==== round 23/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:01,  2.56it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                   214
episode_length                        9.313084
returns                              -3.016494
return_std                              1.3918
average_reward                       -0.326072
round_time              0 days 00:11:12.561232
episodes_test                            213.0
episode_length_test                   9.352113
returns_test                         -2.843087
return_std_test                       1.206701
average_reward_test                  -0.304595
round_time_test         0 days 00:00:02.612332
round_time_total        0 days 00:11:12.562323
loss_total              82342495854030438400.0
loss_critic            102928118059968724992.0
loss_actor                 -38904517060.608002
memory_size                            15996.0 

=== epoch 10/10 ==== round 24/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:22,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   213
episode_length                        9.342723
returns                              -2.815966
return_std                            1.317966
average_reward                       -0.301347
round_time              0 days 00:11:11.850257
episodes_test                            215.0
episode_length_test                   9.265116
returns_test                         -2.868693
return_std_test                       1.698602
average_reward_test                  -0.307308
round_time_test         0 days 00:00:02.590909
round_time_total        0 days 00:11:11.851335
loss_total              84371019799143677952.0
loss_critic            105463772875224334336.0
loss_actor                 -39221015554.047997
memory_size                            15996.0 

=== epoch 10/10 ==== round 25/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:30,  2.89it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   207
episode_length                        9.608696
returns                              -2.741014
return_std                            1.829199
average_reward                       -0.284219
round_time              0 days 00:11:09.023962
episodes_test                            203.0
episode_length_test                   9.827586
returns_test                          -1.84648
return_std_test                       8.413021
average_reward_test                  -0.185439
round_time_test         0 days 00:00:02.597748
round_time_total        0 days 00:11:09.025033
loss_total              84659344439725768704.0
loss_critic            105824178664132214784.0
loss_actor                 -39400159746.047997
memory_size                            15996.0 

=== epoch 10/10 ==== round 26/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:35,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   216
episode_length                        9.217593
returns                              -2.653963
return_std                            1.508449
average_reward                       -0.289661
round_time              0 days 00:11:08.067138
episodes_test                            220.0
episode_length_test                   9.090909
returns_test                         -2.630542
return_std_test                       1.695718
average_reward_test                   -0.28936
round_time_test         0 days 00:00:02.607223
round_time_total        0 days 00:11:08.068224
loss_total              86783058704069263360.0
loss_critic            108478821485902921728.0
loss_actor                    -39678813605.888
memory_size                            15996.0 

=== epoch 10/10 ==== round 27/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  2.99it/s]
episodes                                   209
episode_length                         9.54067
returns                               -2.87166
return_std                            1.617733
average_reward                       -0.301542
round_time              0 days 00:11:08.293352
episodes_test                            211.0
episode_length_test                   9.469194
returns_test                         -2.725504
return_std_test                       1.658124
average_reward_test                  -0.286804
round_time_test         0 days 00:00:02.584413
round_time_total        0 days 00:11:08.294422
loss_total              87463191502857617408.0
loss_critic            109328987434635460608.0
loss_actor                 -40064923713.536003
memory_size                            15996.0 

=== epoch 10/10 ==== round 28/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   213
episode_length                        9.352113
returns                              -2.541253
return_std                             1.74247
average_reward                       -0.270642
round_time              0 days 00:11:11.999924
episodes_test                            218.0
episode_length_test                    9.16055
returns_test                         -2.770427
return_std_test                       1.438494
average_reward_test                  -0.300616
round_time_test         0 days 00:00:02.634315
round_time_total        0 days 00:11:12.001001
loss_total              90219602656889716736.0
loss_critic            112774501378687418368.0
loss_actor                 -40255489644.543999
memory_size                            15996.0 

=== epoch 10/10 ==== round 29/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   208
episode_length                        9.591346
returns                              -2.630812
return_std                            1.776245
average_reward                       -0.275355
round_time              0 days 00:11:11.709329
episodes_test                            204.0
episode_length_test                   9.803922
returns_test                         -2.631178
return_std_test                        2.08364
average_reward_test                   -0.26838
round_time_test         0 days 00:00:02.627131
round_time_total        0 days 00:11:11.710408
loss_total              91142451737663750144.0
loss_critic            113928062828748439552.0
loss_actor                 -40496878424.064003
memory_size                            15996.0 

=== epoch 10/10 ==== round 30/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   213
episode_length                        9.323944
returns                              -2.692914
return_std                            1.651994
average_reward                       -0.287727
round_time              0 days 00:11:10.751478
episodes_test                            215.0
episode_length_test                   9.274419
returns_test                         -3.048506
return_std_test                       1.458031
average_reward_test                  -0.328954
round_time_test         0 days 00:00:02.623404
round_time_total        0 days 00:11:10.752552
loss_total              93006199775113232384.0
loss_critic            116257747643700789248.0
loss_actor                 -40877820776.447998
memory_size                            15996.0 

=== epoch 10/10 ==== round 31/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   208
episode_length                          9.5625
returns                              -2.546379
return_std                             1.60638
average_reward                       -0.267705
round_time              0 days 00:11:06.957114
episodes_test                            213.0
episode_length_test                   9.370892
returns_test                         -2.574598
return_std_test                       1.607519
average_reward_test                    -0.2725
round_time_test         0 days 00:00:02.611133
round_time_total        0 days 00:11:06.958185
loss_total              94232353909190230016.0
loss_critic            117790440402968821760.0
loss_actor                 -41127437135.872002
memory_size                            15996.0 

=== epoch 10/10 ==== round 32/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   213
episode_length                        9.342723
returns                              -2.742816
return_std                            1.768075
average_reward                       -0.291703
round_time              0 days 00:11:08.135770
episodes_test                            221.0
episode_length_test                   9.036199
returns_test                         -2.831659
return_std_test                       1.780412
average_reward_test                  -0.312512
round_time_test         0 days 00:00:02.632774
round_time_total        0 days 00:11:08.136838
loss_total              95599051421091004416.0
loss_critic            119498812204334104576.0
loss_actor                 -41334632368.127998
memory_size                            15996.0 

=== epoch 10/10 ==== round 33/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   208
episode_length                        9.572115
returns                              -2.761029
return_std                            1.460943
average_reward                       -0.289561
round_time              0 days 00:11:11.073003
episodes_test                            208.0
episode_length_test                   9.610577
returns_test                         -2.738512
return_std_test                       1.659178
average_reward_test                  -0.284314
round_time_test         0 days 00:00:02.595689
round_time_total        0 days 00:11:11.074074
loss_total              96386986690645688320.0
loss_critic            120483731187648479232.0
loss_actor                 -41635546238.975998
memory_size                            15996.0 

=== epoch 10/10 ==== round 34/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   207
episode_length                        9.584541
returns                              -2.558425
return_std                            1.946577
average_reward                       -0.267091
round_time              0 days 00:11:10.151796
episodes_test                            213.0
episode_length_test                   9.375587
returns_test                         -2.597434
return_std_test                       5.027303
average_reward_test                  -0.275838
round_time_test         0 days 00:00:02.601295
round_time_total        0 days 00:11:10.152863
loss_total              98719529264411721728.0
loss_critic            123399409388775653376.0
loss_actor                 -41940430837.760002
memory_size                            15996.0 

=== epoch 10/10 ==== round 35/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:13<00:00,  2.97it/s]
episodes                                   217
episode_length                        9.211982
returns                              -2.719375
return_std                            1.461465
average_reward                       -0.295486
round_time              0 days 00:11:13.578384
episodes_test                            207.0
episode_length_test                   9.628019
returns_test                         -2.729196
return_std_test                       1.504937
average_reward_test                  -0.281917
round_time_test         0 days 00:00:02.646149
round_time_total        0 days 00:11:13.579451
loss_total              98671726659937779712.0
loss_critic            123339656185272598528.0
loss_actor                 -42305605226.496002
memory_size                            15996.0 

=== epoch 10/10 ==== round 36/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:14<00:00,  2.96it/s]
episodes                                   218
episode_length                        9.155963
returns                              -2.576207
return_std                            1.462149
average_reward                       -0.281065
round_time              0 days 00:11:15.158584
episodes_test                            211.0
episode_length_test                   9.464455
returns_test                         -2.825978
return_std_test                       1.701519
average_reward_test                  -0.297557
round_time_test         0 days 00:00:02.617340
round_time_total        0 days 00:11:15.159657
loss_total             100730611022705082368.0
loss_critic            125913261625125273600.0
loss_actor                 -42636624848.896004
memory_size                            15996.0 

=== epoch 10/10 ==== round 37/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   220
episode_length                        9.036364
returns                              -2.575082
return_std                            1.824024
average_reward                       -0.287109
round_time              0 days 00:11:08.911101
episodes_test                            218.0
episode_length_test                   9.137615
returns_test                         -2.697587
return_std_test                        1.50339
average_reward_test                  -0.294263
round_time_test         0 days 00:00:02.641887
round_time_total        0 days 00:11:08.912188
loss_total             101681803486843256832.0
loss_critic            127102252111702048768.0
loss_actor                 -43046267932.671997
memory_size                            15996.0 

=== epoch 10/10 ==== round 38/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.97it/s]
episodes                                   219
episode_length                        9.127854
returns                              -2.880805
return_std                            1.379568
average_reward                       -0.315739
round_time              0 days 00:11:13.008914
episodes_test                            216.0
episode_length_test                   9.231481
returns_test                         -2.752509
return_std_test                       1.632528
average_reward_test                  -0.296658
round_time_test         0 days 00:00:02.675534
round_time_total        0 days 00:11:13.009972
loss_total             104534554931070779392.0
loss_critic            130668191434578640896.0
loss_actor                 -43449667993.599998
memory_size                            15996.0 

=== epoch 10/10 ==== round 39/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.82it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   218
episode_length                         9.16055
returns                              -2.680662
return_std                            1.495424
average_reward                       -0.291437
round_time              0 days 00:11:11.834291
episodes_test                            213.0
episode_length_test                   9.366197
returns_test                         -2.650899
return_std_test                        1.34842
average_reward_test                  -0.281178
round_time_test         0 days 00:00:02.630106
round_time_total        0 days 00:11:11.835363
loss_total             106899657480658845696.0
loss_critic            133624569558341828608.0
loss_actor                 -43934000683.008003
memory_size                            15996.0 

=== epoch 10/10 ==== round 40/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   214
episode_length                        9.303738
returns                              -2.643764
return_std                            1.643662
average_reward                       -0.284638
round_time              0 days 00:11:07.815159
episodes_test                            219.0
episode_length_test                   9.118721
returns_test                         -2.751389
return_std_test                       1.583904
average_reward_test                  -0.300989
round_time_test         0 days 00:00:02.595472
round_time_total        0 days 00:11:07.816231
loss_total             107735696942480637952.0
loss_critic            134669618843562721280.0
loss_actor                 -44321762217.984001
memory_size                            15996.0 

=== epoch 10/10 ==== round 41/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:20,  2.70it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   213
episode_length                        9.333333
returns                              -2.785057
return_std                            1.555836
average_reward                       -0.297489
round_time              0 days 00:11:09.105126
episodes_test                            216.0
episode_length_test                   9.240741
returns_test                         -2.591543
return_std_test                       1.510983
average_reward_test                  -0.278277
round_time_test         0 days 00:00:02.616670
round_time_total        0 days 00:11:09.106193
loss_total             110533811355944665088.0
loss_critic            138167261805966934016.0
loss_actor                 -44590114922.496002
memory_size                            15996.0 

=== epoch 10/10 ==== round 42/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:31,  2.66it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   207
episode_length                        9.623188
returns                              -2.794869
return_std                            1.688343
average_reward                       -0.289689
round_time              0 days 00:11:09.943447
episodes_test                            214.0
episode_length_test                   9.345794
returns_test                         -2.666665
return_std_test                       1.525844
average_reward_test                  -0.285333
round_time_test         0 days 00:00:02.673015
round_time_total        0 days 00:11:09.944507
loss_total             111316199392524632064.0
loss_critic            139145246930306973696.0
loss_actor                 -44968748013.568001
memory_size                            15996.0 

=== epoch 10/10 ==== round 43/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:07,  2.74it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:18<00:00,  2.95it/s]
episodes                                   210
episode_length                        9.461905
returns                              -2.779966
return_std                             1.61179
average_reward                       -0.295547
round_time              0 days 00:11:18.630936
episodes_test                            210.0
episode_length_test                   9.519048
returns_test                         -2.655143
return_std_test                       1.485714
average_reward_test                  -0.278298
round_time_test         0 days 00:00:02.588772
round_time_total        0 days 00:11:18.631998
loss_total             114741865350695174144.0
loss_critic            143427329357404323840.0
loss_actor                 -45267854301.183998
memory_size                            15996.0 

=== epoch 10/10 ==== round 44/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:40,  2.43it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   211
episode_length                        9.464455
returns                              -2.629176
return_std                            1.435797
average_reward                       -0.278209
round_time              0 days 00:11:11.838070
episodes_test                            211.0
episode_length_test                   9.445498
returns_test                         -2.502061
return_std_test                       1.681721
average_reward_test                  -0.264183
round_time_test         0 days 00:00:02.631325
round_time_total        0 days 00:11:11.839137
loss_total             115472939869000630272.0
loss_critic            144341172338984992768.0
loss_actor                 -45652789366.783997
memory_size                            15996.0 

=== epoch 10/10 ==== round 45/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:05,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   216
episode_length                        9.217593
returns                              -2.677034
return_std                            1.796396
average_reward                       -0.291023
round_time              0 days 00:11:05.029147
episodes_test                            223.0
episode_length_test                   8.959641
returns_test                          -2.76012
return_std_test                       1.300963
average_reward_test                  -0.306755
round_time_test         0 days 00:00:02.616765
round_time_total        0 days 00:11:05.030216
loss_total             115068168776016396288.0
loss_critic            143835208505327730688.0
loss_actor                 -46101064837.120003
memory_size                            15996.0 

=== epoch 10/10 ==== round 46/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                   215
episode_length                        9.246512
returns                              -2.589121
return_std                            1.372465
average_reward                       -0.281794
round_time              0 days 00:11:20.149052
episodes_test                            210.0
episode_length_test                   9.514286
returns_test                         -2.600124
return_std_test                       1.487745
average_reward_test                  -0.272025
round_time_test         0 days 00:00:02.590599
round_time_total        0 days 00:11:20.150130
loss_total             117374858097471930368.0
loss_critic            146718570181748719616.0
loss_actor                 -46460473364.480003
memory_size                            15996.0 

=== epoch 10/10 ==== round 47/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:07,  2.54it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   210
episode_length                        9.514286
returns                              -2.763033
return_std                            1.598165
average_reward                       -0.290204
round_time              0 days 00:11:11.922937
episodes_test                            212.0
episode_length_test                   9.429245
returns_test                         -2.936803
return_std_test                       1.894184
average_reward_test                  -0.310853
round_time_test         0 days 00:00:02.586882
round_time_total        0 days 00:11:11.924000
loss_total             119844634409110978560.0
loss_critic            149805790545459019776.0
loss_actor                 -46612673804.288002
memory_size                            15996.0 

=== epoch 10/10 ==== round 48/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<11:58,  2.78it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   210
episode_length                        9.457143
returns                              -2.554757
return_std                            1.639886
average_reward                       -0.272961
round_time              0 days 00:11:04.893602
episodes_test                            198.0
episode_length_test                  10.050505
returns_test                         -2.503352
return_std_test                       1.485779
average_reward_test                  -0.249372
round_time_test         0 days 00:00:02.561675
round_time_total        0 days 00:11:04.894702
loss_total             120960989840341942272.0
loss_critic            151201234694310002688.0
loss_actor                 -46918153156.608002
memory_size                            15996.0 

=== epoch 10/10 ==== round 49/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   213
episode_length                        9.366197
returns                              -2.808827
return_std                            1.626317
average_reward                       -0.299687
round_time              0 days 00:11:05.334073
episodes_test                            209.0
episode_length_test                   9.521531
returns_test                         -2.849961
return_std_test                       1.603258
average_reward_test                  -0.296298
round_time_test         0 days 00:00:02.639733
round_time_total        0 days 00:11:05.335135
loss_total             123942410487668588544.0
loss_critic            154928010502231359488.0
loss_actor                 -47190959951.872002
memory_size                            15996.0 

=== epoch 10/10 ==== round 50/50 =====================================
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/home/anon/.local/share/miniforge3/envs/dcac/lib/python3.12/site-packages/gymnasium/envs/registration.py:512: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
<DatasetOffice_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
episodes                                   213
episode_length                        9.370892
returns                              -2.775604
return_std                            1.409922
average_reward                       -0.296528
round_time              0 days 00:11:05.482171
episodes_test                            215.0
episode_length_test                   9.246512
returns_test                         -2.676935
return_std_test                       1.549576
average_reward_test                  -0.287563
round_time_test         0 days 00:00:02.588168
round_time_total        0 days 00:11:05.483228
loss_total             123223482819123200000.0
loss_critic            154029350960392634368.0
loss_actor                    -47455978940.416
memory_size                            15996.0 


