Logging to logs/HopperFH-v0/exp-4/fkl/2024_08_11_05_54_52
--2024-08-11 05:56:27.475159 UTC--
| Itration            | 0        |
| Real Det Return     | 223      |
| Real Sto Return     | 214      |
| Reward Loss         | 3.4e+05  |
| Running Env Steps   | 0        |
| Running Forward KL  | 20       |
| Running Reverse KL  | 237      |
| Running Update Time | 0        |
----------------------------------
--2024-08-11 05:57:43.931187 UTC--
| Itration            | 1        |
| Real Det Return     | 264      |
| Real Sto Return     | 225      |
| Reward Loss         | 2.4e+05  |
| Running Env Steps   | 5000     |
| Running Forward KL  | 19.2     |
| Running Reverse KL  | 236      |
| Running Update Time | 1        |
----------------------------------
--2024-08-11 05:58:43.486797 UTC--
| Itration            | 2        |
| Real Det Return     | 267      |
| Real Sto Return     | 257      |
| Reward Loss         | 4.77e+05 |
| Running Env Steps   | 10000    |
| Running Forward KL  | 19.1     |
| Running Reverse KL  | 238      |
| Running Update Time | 2        |
----------------------------------
--2024-08-11 05:59:44.734115 UTC--
| Itration            | 3        |
| Real Det Return     | 301      |
| Real Sto Return     | 271      |
| Reward Loss         | 7.54e+05 |
| Running Env Steps   | 15000    |
| Running Forward KL  | 19.1     |
| Running Reverse KL  | 233      |
| Running Update Time | 3        |
----------------------------------
--2024-08-11 06:00:47.044272 UTC--
| Itration            | 4        |
| Real Det Return     | 350      |
| Real Sto Return     | 357      |
| Reward Loss         | 9.24e+05 |
| Running Env Steps   | 20000    |
| Running Forward KL  | 18.8     |
| Running Reverse KL  | 206      |
| Running Update Time | 4        |
----------------------------------
--2024-08-11 06:01:51.633734 UTC--
| Itration            | 5        |
| Real Det Return     | 435      |
| Real Sto Return     | 412      |
| Reward Loss         | 1.33e+06 |
| Running Env Steps   | 25000    |
| Running Forward KL  | 19.2     |
| Running Reverse KL  | 200      |
| Running Update Time | 5        |
----------------------------------
--2024-08-11 06:02:57.766840 UTC--
| Itration            | 6        |
| Real Det Return     | 472      |
| Real Sto Return     | 413      |
| Reward Loss         | 9.82e+05 |
| Running Env Steps   | 30000    |
| Running Forward KL  | 19.5     |
| Running Reverse KL  | 202      |
| Running Update Time | 6        |
----------------------------------
--2024-08-11 06:04:11.700576 UTC--
| Itration            | 7        |
| Real Det Return     | 878      |
| Real Sto Return     | 446      |
| Reward Loss         | 1.14e+06 |
| Running Env Steps   | 35000    |
| Running Forward KL  | 19       |
| Running Reverse KL  | 179      |
| Running Update Time | 7        |
----------------------------------
--2024-08-11 06:05:36.827411 UTC--
| Itration            | 8        |
| Real Det Return     | 956      |
| Real Sto Return     | 671      |
| Reward Loss         | 3.84e+05 |
| Running Env Steps   | 40000    |
| Running Forward KL  | 19.5     |
| Running Reverse KL  | 114      |
| Running Update Time | 8        |
----------------------------------
--2024-08-11 06:07:22.825562 UTC--
| Itration            | 9        |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 911      |
| Reward Loss         | 4.05e+05 |
| Running Env Steps   | 45000    |
| Running Forward KL  | 19.6     |
| Running Reverse KL  | 13.3     |
| Running Update Time | 9        |
----------------------------------
--2024-08-11 06:09:13.838156 UTC--
| Itration            | 10       |
| Real Det Return     | 1.04e+03 |
| Real Sto Return     | 917      |
| Reward Loss         | 5.76e+05 |
| Running Env Steps   | 50000    |
| Running Forward KL  | 19       |
| Running Reverse KL  | 34.5     |
| Running Update Time | 10       |
----------------------------------
--2024-08-11 06:11:07.790271 UTC--
| Itration            | 11       |
| Real Det Return     | 1.01e+03 |
| Real Sto Return     | 913      |
| Reward Loss         | 3.18e+05 |
| Running Env Steps   | 55000    |
| Running Forward KL  | 20       |
| Running Reverse KL  | 28.6     |
| Running Update Time | 11       |
----------------------------------
--2024-08-11 06:13:00.407172 UTC--
| Itration            | 12       |
| Real Det Return     | 1.08e+03 |
| Real Sto Return     | 1.03e+03 |
| Reward Loss         | 4.71e+05 |
| Running Env Steps   | 60000    |
| Running Forward KL  | 19.6     |
| Running Reverse KL  | 22.8     |
| Running Update Time | 12       |
----------------------------------
--2024-08-11 06:14:54.886952 UTC--
| Itration            | 13       |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 1.02e+03 |
| Reward Loss         | 3.42e+05 |
| Running Env Steps   | 65000    |
| Running Forward KL  | 20       |
| Running Reverse KL  | 13.4     |
| Running Update Time | 13       |
----------------------------------
--2024-08-11 06:16:58.304888 UTC--
| Itration            | 14       |
| Real Det Return     | 1.04e+03 |
| Real Sto Return     | 1.04e+03 |
| Reward Loss         | 3.45e+05 |
| Running Env Steps   | 70000    |
| Running Forward KL  | 20.4     |
| Running Reverse KL  | 14.2     |
| Running Update Time | 14       |
----------------------------------
--2024-08-11 06:19:19.635735 UTC--
| Itration            | 15       |
| Real Det Return     | 1.09e+03 |
| Real Sto Return     | 1.05e+03 |
| Reward Loss         | 3.44e+05 |
| Running Env Steps   | 75000    |
| Running Forward KL  | 20.3     |
| Running Reverse KL  | 38       |
| Running Update Time | 15       |
----------------------------------
--2024-08-11 06:21:40.267320 UTC--
| Itration            | 16       |
| Real Det Return     | 1.06e+03 |
| Real Sto Return     | 1.04e+03 |
| Reward Loss         | 2.68e+05 |
| Running Env Steps   | 80000    |
| Running Forward KL  | 19.8     |
| Running Reverse KL  | 25.5     |
| Running Update Time | 16       |
----------------------------------
--2024-08-11 06:24:02.712703 UTC--
| Itration            | 17       |
| Real Det Return     | 1.05e+03 |
| Real Sto Return     | 1.03e+03 |
| Reward Loss         | 2.61e+05 |
| Running Env Steps   | 85000    |
| Running Forward KL  | 20.4     |
| Running Reverse KL  | 14.1     |
| Running Update Time | 17       |
----------------------------------
--2024-08-11 06:26:25.284847 UTC--
| Itration            | 18       |
| Real Det Return     | 1.06e+03 |
| Real Sto Return     | 1.05e+03 |
| Reward Loss         | 1.13e+05 |
| Running Env Steps   | 90000    |
| Running Forward KL  | 20.5     |
| Running Reverse KL  | 38.2     |
| Running Update Time | 18       |
----------------------------------
--2024-08-11 06:28:45.157132 UTC--
| Itration            | 19       |
| Real Det Return     | 1.08e+03 |
| Real Sto Return     | 1.07e+03 |
| Reward Loss         | 1.39e+05 |
| Running Env Steps   | 95000    |
| Running Forward KL  | 20.4     |
| Running Reverse KL  | 13.8     |
| Running Update Time | 19       |
----------------------------------
--2024-08-11 06:31:06.897015 UTC--
| Itration            | 20       |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 1.05e+03 |
| Reward Loss         | 1.17e+05 |
| Running Env Steps   | 100000   |
| Running Forward KL  | 20.4     |
| Running Reverse KL  | 14.2     |
| Running Update Time | 20       |
----------------------------------
--2024-08-11 06:33:27.850221 UTC--
| Itration            | 21       |
| Real Det Return     | 1.04e+03 |
| Real Sto Return     | 1.05e+03 |
| Reward Loss         | 9.14e+04 |
| Running Env Steps   | 105000   |
| Running Forward KL  | 20       |
| Running Reverse KL  | 13.8     |
| Running Update Time | 21       |
----------------------------------
--2024-08-11 06:35:48.022332 UTC--
| Itration            | 22       |
| Real Det Return     | 1.09e+03 |
| Real Sto Return     | 1.07e+03 |
| Reward Loss         | 1.27e+04 |
| Running Env Steps   | 110000   |
| Running Forward KL  | 20.4     |
| Running Reverse KL  | 13.9     |
| Running Update Time | 22       |
----------------------------------
--2024-08-11 06:38:10.407243 UTC---
| Itration            | 23        |
| Real Det Return     | 1.08e+03  |
| Real Sto Return     | 1.07e+03  |
| Reward Loss         | -4.05e+04 |
| Running Env Steps   | 115000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 13.7      |
| Running Update Time | 23        |
-----------------------------------
--2024-08-11 06:40:31.431113 UTC---
| Itration            | 24        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -7.04e+04 |
| Running Env Steps   | 120000    |
| Running Forward KL  | 19.9      |
| Running Reverse KL  | 13.7      |
| Running Update Time | 24        |
-----------------------------------
--2024-08-11 06:42:50.282335 UTC---
| Itration            | 25        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.04e+03  |
| Reward Loss         | -1.52e+05 |
| Running Env Steps   | 125000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 13.7      |
| Running Update Time | 25        |
-----------------------------------
--2024-08-11 06:45:13.100403 UTC--
| Itration            | 26       |
| Real Det Return     | 1.07e+03 |
| Real Sto Return     | 1.07e+03 |
| Reward Loss         | 7.98e+04 |
| Running Env Steps   | 130000   |
| Running Forward KL  | 20.2     |
| Running Reverse KL  | 39.9     |
| Running Update Time | 26       |
----------------------------------
--2024-08-11 06:47:32.961994 UTC---
| Itration            | 27        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -2.34e+05 |
| Running Env Steps   | 135000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 13.6      |
| Running Update Time | 27        |
-----------------------------------
--2024-08-11 06:49:52.165030 UTC---
| Itration            | 28        |
| Real Det Return     | 1.06e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -2.97e+05 |
| Running Env Steps   | 140000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 13.5      |
| Running Update Time | 28        |
-----------------------------------
--2024-08-11 06:52:14.560175 UTC---
| Itration            | 29        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -3.57e+05 |
| Running Env Steps   | 145000    |
| Running Forward KL  | 20.5      |
| Running Reverse KL  | 13.8      |
| Running Update Time | 29        |
-----------------------------------
--2024-08-11 06:54:33.384438 UTC---
| Itration            | 30        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.07e+03  |
| Reward Loss         | -3.85e+05 |
| Running Env Steps   | 150000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 13.4      |
| Running Update Time | 30        |
-----------------------------------
--2024-08-11 06:56:55.358640 UTC---
| Itration            | 31        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.07e+03  |
| Reward Loss         | -4.38e+05 |
| Running Env Steps   | 155000    |
| Running Forward KL  | 20        |
| Running Reverse KL  | 13.2      |
| Running Update Time | 31        |
-----------------------------------
--2024-08-11 06:59:18.012056 UTC---
| Itration            | 32        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -5.22e+05 |
| Running Env Steps   | 160000    |
| Running Forward KL  | 19.8      |
| Running Reverse KL  | 12.9      |
| Running Update Time | 32        |
-----------------------------------
--2024-08-11 07:01:36.891127 UTC---
| Itration            | 33        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -5.58e+05 |
| Running Env Steps   | 165000    |
| Running Forward KL  | 20        |
| Running Reverse KL  | 13.2      |
| Running Update Time | 33        |
-----------------------------------
--2024-08-11 07:03:56.762755 UTC---
| Itration            | 34        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 945       |
| Reward Loss         | -3.86e+05 |
| Running Env Steps   | 170000    |
| Running Forward KL  | 20.3      |
| Running Reverse KL  | 41.4      |
| Running Update Time | 34        |
-----------------------------------
--2024-08-11 07:06:18.418529 UTC---
| Itration            | 35        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.04e+03  |
| Reward Loss         | -6.52e+05 |
| Running Env Steps   | 175000    |
| Running Forward KL  | 20.1      |
| Running Reverse KL  | 13.6      |
| Running Update Time | 35        |
-----------------------------------
--2024-08-11 07:08:35.741420 UTC---
| Itration            | 36        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.07e+03  |
| Reward Loss         | -6.75e+05 |
| Running Env Steps   | 180000    |
| Running Forward KL  | 19.6      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 36        |
-----------------------------------
--2024-08-11 07:11:00.372471 UTC---
| Itration            | 37        |
| Real Det Return     | 1.02e+03  |
| Real Sto Return     | 1.05e+03  |
| Reward Loss         | -7.59e+05 |
| Running Env Steps   | 185000    |
| Running Forward KL  | 20        |
| Running Reverse KL  | 13.1      |
| Running Update Time | 37        |
-----------------------------------
--2024-08-11 07:13:22.353579 UTC---
| Itration            | 38        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.04e+03  |
| Reward Loss         | -7.68e+05 |
| Running Env Steps   | 190000    |
| Running Forward KL  | 19.7      |
| Running Reverse KL  | 13.1      |
| Running Update Time | 38        |
-----------------------------------
--2024-08-11 07:15:41.067232 UTC---
| Itration            | 39        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -8.15e+05 |
| Running Env Steps   | 195000    |
| Running Forward KL  | 19.9      |
| Running Reverse KL  | 13        |
| Running Update Time | 39        |
-----------------------------------
--2024-08-11 07:18:04.728711 UTC--
| Itration            | 40       |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 1.03e+03 |
| Reward Loss         | -7.9e+05 |
| Running Env Steps   | 200000   |
| Running Forward KL  | 18.7     |
| Running Reverse KL  | 22       |
| Running Update Time | 40       |
----------------------------------
--2024-08-11 07:20:23.043742 UTC--
| Itration            | 41       |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 1.05e+03 |
| Reward Loss         | -7.9e+05 |
| Running Env Steps   | 205000   |
| Running Forward KL  | 19.1     |
| Running Reverse KL  | 55.2     |
| Running Update Time | 41       |
----------------------------------
--2024-08-11 07:22:43.365369 UTC---
| Itration            | 42        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.05e+03  |
| Reward Loss         | -9.69e+05 |
| Running Env Steps   | 210000    |
| Running Forward KL  | 19.3      |
| Running Reverse KL  | 12.9      |
| Running Update Time | 42        |
-----------------------------------
--2024-08-11 07:24:59.821974 UTC---
| Itration            | 43        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 878       |
| Reward Loss         | -7.87e+05 |
| Running Env Steps   | 215000    |
| Running Forward KL  | 18.9      |
| Running Reverse KL  | 54.6      |
| Running Update Time | 43        |
-----------------------------------
--2024-08-11 07:27:18.776770 UTC---
| Itration            | 44        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.09e+03  |
| Reward Loss         | -9.76e+05 |
| Running Env Steps   | 220000    |
| Running Forward KL  | 18.5      |
| Running Reverse KL  | 26.2      |
| Running Update Time | 44        |
-----------------------------------
--2024-08-11 07:29:42.187660 UTC---
| Itration            | 45        |
| Real Det Return     | 1.03e+03  |
| Real Sto Return     | 1.09e+03  |
| Reward Loss         | -1.07e+06 |
| Running Env Steps   | 225000    |
| Running Forward KL  | 19.4      |
| Running Reverse KL  | 12.6      |
| Running Update Time | 45        |
-----------------------------------
--2024-08-11 07:32:05.010126 UTC--
| Itration            | 46       |
| Real Det Return     | 1.03e+03 |
| Real Sto Return     | 1.07e+03 |
| Reward Loss         | -1.1e+06 |
| Running Env Steps   | 230000   |
| Running Forward KL  | 19.5     |
| Running Reverse KL  | 13.1     |
| Running Update Time | 46       |
----------------------------------
--2024-08-11 07:34:20.591662 UTC---
| Itration            | 47        |
| Real Det Return     | 1.02e+03  |
| Real Sto Return     | 1.06e+03  |
| Reward Loss         | -1.09e+06 |
| Running Env Steps   | 235000    |
| Running Forward KL  | 19.7      |
| Running Reverse KL  | 27.1      |
| Running Update Time | 47        |
-----------------------------------
--2024-08-11 07:36:45.863757 UTC---
| Itration            | 48        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.12e+03  |
| Reward Loss         | -1.15e+06 |
| Running Env Steps   | 240000    |
| Running Forward KL  | 19.3      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 48        |
-----------------------------------
--2024-08-11 07:39:03.926365 UTC---
| Itration            | 49        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 980       |
| Reward Loss         | -1.27e+06 |
| Running Env Steps   | 245000    |
| Running Forward KL  | 19        |
| Running Reverse KL  | 13.6      |
| Running Update Time | 49        |
-----------------------------------
--2024-08-11 07:41:23.423596 UTC---
| Itration            | 50        |
| Real Det Return     | 1.02e+03  |
| Real Sto Return     | 1.11e+03  |
| Reward Loss         | -1.28e+06 |
| Running Env Steps   | 250000    |
| Running Forward KL  | 19.6      |
| Running Reverse KL  | 13        |
| Running Update Time | 50        |
-----------------------------------
--2024-08-11 07:43:46.499394 UTC---
| Itration            | 51        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.08e+03  |
| Reward Loss         | -1.26e+06 |
| Running Env Steps   | 255000    |
| Running Forward KL  | 19.1      |
| Running Reverse KL  | 19.1      |
| Running Update Time | 51        |
-----------------------------------
--2024-08-11 07:46:08.098049 UTC--
| Itration            | 52       |
| Real Det Return     | 1.04e+03 |
| Real Sto Return     | 1.07e+03 |
| Reward Loss         | -1.2e+06 |
| Running Env Steps   | 260000   |
| Running Forward KL  | 19.1     |
| Running Reverse KL  | 30.8     |
| Running Update Time | 52       |
----------------------------------
--2024-08-11 07:48:26.353361 UTC---
| Itration            | 53        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.1e+03   |
| Reward Loss         | -1.39e+06 |
| Running Env Steps   | 265000    |
| Running Forward KL  | 19.6      |
| Running Reverse KL  | 13.3      |
| Running Update Time | 53        |
-----------------------------------
--2024-08-11 07:50:51.401936 UTC---
| Itration            | 54        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.15e+03  |
| Reward Loss         | -1.47e+06 |
| Running Env Steps   | 270000    |
| Running Forward KL  | 18.9      |
| Running Reverse KL  | 36.7      |
| Running Update Time | 54        |
-----------------------------------
--2024-08-11 07:53:11.545630 UTC---
| Itration            | 55        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.15e+03  |
| Reward Loss         | -1.36e+06 |
| Running Env Steps   | 275000    |
| Running Forward KL  | 19        |
| Running Reverse KL  | 12.6      |
| Running Update Time | 55        |
-----------------------------------
--2024-08-11 07:55:31.634009 UTC---
| Itration            | 56        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.11e+03  |
| Reward Loss         | -1.31e+06 |
| Running Env Steps   | 280000    |
| Running Forward KL  | 18.8      |
| Running Reverse KL  | 38.5      |
| Running Update Time | 56        |
-----------------------------------
--2024-08-11 07:57:54.505680 UTC---
| Itration            | 57        |
| Real Det Return     | 1.04e+03  |
| Real Sto Return     | 1.11e+03  |
| Reward Loss         | -1.51e+06 |
| Running Env Steps   | 285000    |
| Running Forward KL  | 19.1      |
| Running Reverse KL  | 12.9      |
| Running Update Time | 57        |
-----------------------------------
--2024-08-11 08:00:13.210116 UTC---
| Itration            | 58        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.16e+03  |
| Reward Loss         | -1.46e+06 |
| Running Env Steps   | 290000    |
| Running Forward KL  | 19        |
| Running Reverse KL  | 13        |
| Running Update Time | 58        |
-----------------------------------
--2024-08-11 08:02:35.256974 UTC---
| Itration            | 59        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.09e+03  |
| Reward Loss         | -1.51e+06 |
| Running Env Steps   | 295000    |
| Running Forward KL  | 18.6      |
| Running Reverse KL  | 12.4      |
| Running Update Time | 59        |
-----------------------------------
--2024-08-11 08:04:56.884427 UTC---
| Itration            | 60        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.13e+03  |
| Reward Loss         | -1.58e+06 |
| Running Env Steps   | 300000    |
| Running Forward KL  | 19        |
| Running Reverse KL  | 12.9      |
| Running Update Time | 60        |
-----------------------------------
--2024-08-11 08:07:15.855849 UTC---
| Itration            | 61        |
| Real Det Return     | 1.05e+03  |
| Real Sto Return     | 1.19e+03  |
| Reward Loss         | -9.94e+05 |
| Running Env Steps   | 305000    |
| Running Forward KL  | 18.1      |
| Running Reverse KL  | 51.8      |
| Running Update Time | 61        |
-----------------------------------
--2024-08-11 08:09:38.082699 UTC---
| Itration            | 62        |
| Real Det Return     | 1.19e+03  |
| Real Sto Return     | 1.14e+03  |
| Reward Loss         | -1.46e+06 |
| Running Env Steps   | 310000    |
| Running Forward KL  | 18.3      |
| Running Reverse KL  | 12        |
| Running Update Time | 62        |
-----------------------------------
--2024-08-11 08:11:58.958589 UTC---
| Itration            | 63        |
| Real Det Return     | 1.06e+03  |
| Real Sto Return     | 1.16e+03  |
| Reward Loss         | -1.49e+06 |
| Running Env Steps   | 315000    |
| Running Forward KL  | 18.5      |
| Running Reverse KL  | 12.2      |
| Running Update Time | 63        |
-----------------------------------
--2024-08-11 08:14:18.526523 UTC---
| Itration            | 64        |
| Real Det Return     | 1.11e+03  |
| Real Sto Return     | 1.21e+03  |
| Reward Loss         | -1.53e+06 |
| Running Env Steps   | 320000    |
| Running Forward KL  | 18.4      |
| Running Reverse KL  | 12.1      |
| Running Update Time | 64        |
-----------------------------------
--2024-08-11 08:16:41.766332 UTC--
| Itration            | 65       |
| Real Det Return     | 1.06e+03 |
| Real Sto Return     | 1.19e+03 |
| Reward Loss         | -1.3e+06 |
| Running Env Steps   | 325000   |
| Running Forward KL  | 17.3     |
| Running Reverse KL  | 27.5     |
| Running Update Time | 65       |
----------------------------------
--2024-08-11 08:19:01.171734 UTC---
| Itration            | 66        |
| Real Det Return     | 1.13e+03  |
| Real Sto Return     | 1.23e+03  |
| Reward Loss         | -1.49e+06 |
| Running Env Steps   | 330000    |
| Running Forward KL  | 18.1      |
| Running Reverse KL  | 12.1      |
| Running Update Time | 66        |
-----------------------------------
--2024-08-11 08:21:20.315261 UTC---
| Itration            | 67        |
| Real Det Return     | 1.12e+03  |
| Real Sto Return     | 1.21e+03  |
| Reward Loss         | -1.59e+06 |
| Running Env Steps   | 335000    |
| Running Forward KL  | 18.5      |
| Running Reverse KL  | 12.6      |
| Running Update Time | 67        |
-----------------------------------
--2024-08-11 08:23:42.985630 UTC---
| Itration            | 68        |
| Real Det Return     | 1.12e+03  |
| Real Sto Return     | 1.23e+03  |
| Reward Loss         | -1.48e+06 |
| Running Env Steps   | 340000    |
| Running Forward KL  | 18.1      |
| Running Reverse KL  | 17.5      |
| Running Update Time | 68        |
-----------------------------------
--2024-08-11 08:26:01.677235 UTC---
| Itration            | 69        |
| Real Det Return     | 1.14e+03  |
| Real Sto Return     | 1.26e+03  |
| Reward Loss         | -1.67e+06 |
| Running Env Steps   | 345000    |
| Running Forward KL  | 18.3      |
| Running Reverse KL  | 32.8      |
| Running Update Time | 69        |
-----------------------------------
--2024-08-11 08:28:20.250264 UTC---
| Itration            | 70        |
| Real Det Return     | 1.15e+03  |
| Real Sto Return     | 1.09e+03  |
| Reward Loss         | -1.54e+06 |
| Running Env Steps   | 350000    |
| Running Forward KL  | 18        |
| Running Reverse KL  | 12.4      |
| Running Update Time | 70        |
-----------------------------------
--2024-08-11 08:30:39.570573 UTC---
| Itration            | 71        |
| Real Det Return     | 1.26e+03  |
| Real Sto Return     | 1.3e+03   |
| Reward Loss         | -1.57e+06 |
| Running Env Steps   | 355000    |
| Running Forward KL  | 17.8      |
| Running Reverse KL  | 89.6      |
| Running Update Time | 71        |
-----------------------------------
--2024-08-11 08:32:57.227691 UTC--
| Itration            | 72       |
| Real Det Return     | 1.13e+03 |
| Real Sto Return     | 1.28e+03 |
| Reward Loss         | -1.7e+06 |
| Running Env Steps   | 360000   |
| Running Forward KL  | 18.1     |
| Running Reverse KL  | 12.5     |
| Running Update Time | 72       |
----------------------------------
--2024-08-11 08:35:21.031505 UTC---
| Itration            | 73        |
| Real Det Return     | 1.17e+03  |
| Real Sto Return     | 1.31e+03  |
| Reward Loss         | -1.66e+06 |
| Running Env Steps   | 365000    |
| Running Forward KL  | 17.8      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 73        |
-----------------------------------
--2024-08-11 08:37:39.690022 UTC---
| Itration            | 74        |
| Real Det Return     | 1.21e+03  |
| Real Sto Return     | 1.31e+03  |
| Reward Loss         | -1.69e+06 |
| Running Env Steps   | 370000    |
| Running Forward KL  | 18        |
| Running Reverse KL  | 12.5      |
| Running Update Time | 74        |
-----------------------------------
--2024-08-11 08:39:57.418117 UTC---
| Itration            | 75        |
| Real Det Return     | 1.14e+03  |
| Real Sto Return     | 1.29e+03  |
| Reward Loss         | -1.71e+06 |
| Running Env Steps   | 375000    |
| Running Forward KL  | 18.1      |
| Running Reverse KL  | 12.3      |
| Running Update Time | 75        |
-----------------------------------
--2024-08-11 08:42:19.854886 UTC---
| Itration            | 76        |
| Real Det Return     | 1.27e+03  |
| Real Sto Return     | 1.27e+03  |
| Reward Loss         | -1.75e+06 |
| Running Env Steps   | 380000    |
| Running Forward KL  | 17.9      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 76        |
-----------------------------------
--2024-08-11 08:44:34.004242 UTC--
| Itration            | 77       |
| Real Det Return     | 1.33e+03 |
| Real Sto Return     | 1.24e+03 |
| Reward Loss         | -1.5e+06 |
| Running Env Steps   | 385000   |
| Running Forward KL  | 17.3     |
| Running Reverse KL  | 32.8     |
| Running Update Time | 77       |
----------------------------------
--2024-08-11 08:46:53.510061 UTC---
| Itration            | 78        |
| Real Det Return     | 1.18e+03  |
| Real Sto Return     | 1.24e+03  |
| Reward Loss         | -1.78e+06 |
| Running Env Steps   | 390000    |
| Running Forward KL  | 17.8      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 78        |
-----------------------------------
--2024-08-11 08:49:14.971395 UTC---
| Itration            | 79        |
| Real Det Return     | 1.32e+03  |
| Real Sto Return     | 1.35e+03  |
| Reward Loss         | -1.75e+06 |
| Running Env Steps   | 395000    |
| Running Forward KL  | 17.2      |
| Running Reverse KL  | 12.1      |
| Running Update Time | 79        |
-----------------------------------
--2024-08-11 08:51:31.555891 UTC---
| Itration            | 80        |
| Real Det Return     | 1.39e+03  |
| Real Sto Return     | 1.33e+03  |
| Reward Loss         | -1.73e+06 |
| Running Env Steps   | 400000    |
| Running Forward KL  | 17        |
| Running Reverse KL  | 36.6      |
| Running Update Time | 80        |
-----------------------------------
--2024-08-11 08:53:49.377733 UTC---
| Itration            | 81        |
| Real Det Return     | 1.23e+03  |
| Real Sto Return     | 1.12e+03  |
| Reward Loss         | -1.74e+06 |
| Running Env Steps   | 405000    |
| Running Forward KL  | 17.3      |
| Running Reverse KL  | 37.1      |
| Running Update Time | 81        |
-----------------------------------
--2024-08-11 08:56:10.284116 UTC--
| Itration            | 82       |
| Real Det Return     | 1.28e+03 |
| Real Sto Return     | 1.4e+03  |
| Reward Loss         | -1.7e+06 |
| Running Env Steps   | 410000   |
| Running Forward KL  | 16.5     |
| Running Reverse KL  | 11.5     |
| Running Update Time | 82       |
----------------------------------
--2024-08-11 08:58:19.659211 UTC---
| Itration            | 83        |
| Real Det Return     | 1.39e+03  |
| Real Sto Return     | 966       |
| Reward Loss         | -1.51e+06 |
| Running Env Steps   | 415000    |
| Running Forward KL  | 15.6      |
| Running Reverse KL  | 19.5      |
| Running Update Time | 83        |
-----------------------------------
--2024-08-11 09:00:43.197169 UTC---
| Itration            | 84        |
| Real Det Return     | 1.26e+03  |
| Real Sto Return     | 1.4e+03   |
| Reward Loss         | -1.69e+06 |
| Running Env Steps   | 420000    |
| Running Forward KL  | 16.4      |
| Running Reverse KL  | 17.4      |
| Running Update Time | 84        |
-----------------------------------
--2024-08-11 09:03:02.408972 UTC---
| Itration            | 85        |
| Real Det Return     | 1.25e+03  |
| Real Sto Return     | 1.44e+03  |
| Reward Loss         | -1.89e+06 |
| Running Env Steps   | 425000    |
| Running Forward KL  | 16.9      |
| Running Reverse KL  | 11.9      |
| Running Update Time | 85        |
-----------------------------------
--2024-08-11 09:05:23.153022 UTC---
| Itration            | 86        |
| Real Det Return     | 1.14e+03  |
| Real Sto Return     | 1.27e+03  |
| Reward Loss         | -1.62e+06 |
| Running Env Steps   | 430000    |
| Running Forward KL  | 17.3      |
| Running Reverse KL  | 33.1      |
| Running Update Time | 86        |
-----------------------------------
--2024-08-11 09:07:33.608264 UTC---
| Itration            | 87        |
| Real Det Return     | 1.45e+03  |
| Real Sto Return     | 788       |
| Reward Loss         | -2.16e+06 |
| Running Env Steps   | 435000    |
| Running Forward KL  | 16.9      |
| Running Reverse KL  | 148       |
| Running Update Time | 87        |
-----------------------------------
--2024-08-11 09:09:49.549248 UTC---
| Itration            | 88        |
| Real Det Return     | 1.27e+03  |
| Real Sto Return     | 1.33e+03  |
| Reward Loss         | -2.04e+06 |
| Running Env Steps   | 440000    |
| Running Forward KL  | 16.5      |
| Running Reverse KL  | 81        |
| Running Update Time | 88        |
-----------------------------------
--2024-08-11 09:12:13.254361 UTC---
| Itration            | 89        |
| Real Det Return     | 1.28e+03  |
| Real Sto Return     | 1.33e+03  |
| Reward Loss         | -1.99e+06 |
| Running Env Steps   | 445000    |
| Running Forward KL  | 16.9      |
| Running Reverse KL  | 11.9      |
| Running Update Time | 89        |
-----------------------------------
--2024-08-11 09:14:33.025370 UTC--
| Itration            | 90       |
| Real Det Return     | 1.58e+03 |
| Real Sto Return     | 1.36e+03 |
| Reward Loss         | -1.8e+06 |
| Running Env Steps   | 450000   |
| Running Forward KL  | 16.4     |
| Running Reverse KL  | 12       |
| Running Update Time | 90       |
----------------------------------
--2024-08-11 09:16:51.343441 UTC---
| Itration            | 91        |
| Real Det Return     | 1.41e+03  |
| Real Sto Return     | 1.45e+03  |
| Reward Loss         | -1.73e+06 |
| Running Env Steps   | 455000    |
| Running Forward KL  | 15.9      |
| Running Reverse KL  | 11.5      |
| Running Update Time | 91        |
-----------------------------------
--2024-08-11 09:19:10.856089 UTC---
| Itration            | 92        |
| Real Det Return     | 1.53e+03  |
| Real Sto Return     | 1.29e+03  |
| Reward Loss         | -1.71e+06 |
| Running Env Steps   | 460000    |
| Running Forward KL  | 16.2      |
| Running Reverse KL  | 33.5      |
| Running Update Time | 92        |
-----------------------------------
--2024-08-11 09:21:04.496923 UTC---
| Itration            | 93        |
| Real Det Return     | 471       |
| Real Sto Return     | 1.35e+03  |
| Reward Loss         | -1.76e+06 |
| Running Env Steps   | 465000    |
| Running Forward KL  | 16.1      |
| Running Reverse KL  | 31.8      |
| Running Update Time | 93        |
-----------------------------------
--2024-08-11 09:23:14.073440 UTC---
| Itration            | 94        |
| Real Det Return     | 1.08e+03  |
| Real Sto Return     | 1.05e+03  |
| Reward Loss         | -1.51e+06 |
| Running Env Steps   | 470000    |
| Running Forward KL  | 16.6      |
| Running Reverse KL  | 75.3      |
| Running Update Time | 94        |
-----------------------------------
--2024-08-11 09:25:07.378218 UTC---
| Itration            | 95        |
| Real Det Return     | 600       |
| Real Sto Return     | 1.21e+03  |
| Reward Loss         | -1.34e+06 |
| Running Env Steps   | 475000    |
| Running Forward KL  | 15.4      |
| Running Reverse KL  | 73.1      |
| Running Update Time | 95        |
-----------------------------------
--2024-08-11 09:27:28.318335 UTC---
| Itration            | 96        |
| Real Det Return     | 1.28e+03  |
| Real Sto Return     | 1.39e+03  |
| Reward Loss         | -1.86e+06 |
| Running Env Steps   | 480000    |
| Running Forward KL  | 16.6      |
| Running Reverse KL  | 12        |
| Running Update Time | 96        |
-----------------------------------
--2024-08-11 09:29:51.000125 UTC---
| Itration            | 97        |
| Real Det Return     | 1.31e+03  |
| Real Sto Return     | 1.44e+03  |
| Reward Loss         | -1.99e+06 |
| Running Env Steps   | 485000    |
| Running Forward KL  | 16.3      |
| Running Reverse KL  | 12        |
| Running Update Time | 97        |
-----------------------------------
--2024-08-11 09:32:10.109502 UTC---
| Itration            | 98        |
| Real Det Return     | 1.38e+03  |
| Real Sto Return     | 1.46e+03  |
| Reward Loss         | -1.81e+06 |
| Running Env Steps   | 490000    |
| Running Forward KL  | 16.2      |
| Running Reverse KL  | 51.4      |
| Running Update Time | 98        |
-----------------------------------
--2024-08-11 09:34:30.875075 UTC---
| Itration            | 99        |
| Real Det Return     | 1.23e+03  |
| Real Sto Return     | 1.38e+03  |
| Reward Loss         | -1.97e+06 |
| Running Env Steps   | 495000    |
| Running Forward KL  | 16.4      |
| Running Reverse KL  | 11.8      |
| Running Update Time | 99        |
-----------------------------------
--2024-08-11 09:36:52.988417 UTC---
| Itration            | 100       |
| Real Det Return     | 1.56e+03  |
| Real Sto Return     | 1.49e+03  |
| Reward Loss         | -1.99e+06 |
| Running Env Steps   | 500000    |
| Running Forward KL  | 16.1      |
| Running Reverse KL  | 29.9      |
| Running Update Time | 100       |
-----------------------------------
--2024-08-11 09:39:11.499971 UTC---
| Itration            | 101       |
| Real Det Return     | 1.29e+03  |
| Real Sto Return     | 1.5e+03   |
| Reward Loss         | -2.07e+06 |
| Running Env Steps   | 505000    |
| Running Forward KL  | 16.3      |
| Running Reverse KL  | 11.9      |
| Running Update Time | 101       |
-----------------------------------
--2024-08-11 09:41:07.087202 UTC--
| Itration            | 102      |
| Real Det Return     | 695      |
| Real Sto Return     | 1.23e+03 |
| Reward Loss         | -1.6e+06 |
| Running Env Steps   | 510000   |
| Running Forward KL  | 14.8     |
| Running Reverse KL  | 70.6     |
| Running Update Time | 102      |
----------------------------------
--2024-08-11 09:43:26.266219 UTC---
| Itration            | 103       |
| Real Det Return     | 1.38e+03  |
| Real Sto Return     | 1.47e+03  |
| Reward Loss         | -1.97e+06 |
| Running Env Steps   | 515000    |
| Running Forward KL  | 15.9      |
| Running Reverse KL  | 13.6      |
| Running Update Time | 103       |
-----------------------------------
--2024-08-11 09:45:34.451361 UTC---
| Itration            | 104       |
| Real Det Return     | 773       |
| Real Sto Return     | 1.5e+03   |
| Reward Loss         | -1.64e+06 |
| Running Env Steps   | 520000    |
| Running Forward KL  | 15.5      |
| Running Reverse KL  | 38.4      |
| Running Update Time | 104       |
-----------------------------------
--2024-08-11 09:47:56.568848 UTC---
| Itration            | 105       |
| Real Det Return     | 1.4e+03   |
| Real Sto Return     | 1.41e+03  |
| Reward Loss         | -1.98e+06 |
| Running Env Steps   | 525000    |
| Running Forward KL  | 16        |
| Running Reverse KL  | 12        |
| Running Update Time | 105       |
-----------------------------------
--2024-08-11 09:50:12.125670 UTC---
| Itration            | 106       |
| Real Det Return     | 1.53e+03  |
| Real Sto Return     | 1.52e+03  |
| Reward Loss         | -1.95e+06 |
| Running Env Steps   | 530000    |
| Running Forward KL  | 15.8      |
| Running Reverse KL  | 11.8      |
| Running Update Time | 106       |
-----------------------------------
--2024-08-11 09:52:34.561674 UTC---
| Itration            | 107       |
| Real Det Return     | 1.44e+03  |
| Real Sto Return     | 1.56e+03  |
| Reward Loss         | -2.01e+06 |
| Running Env Steps   | 535000    |
| Running Forward KL  | 16.1      |
| Running Reverse KL  | 12.1      |
| Running Update Time | 107       |
-----------------------------------
--2024-08-11 09:54:54.801717 UTC---
| Itration            | 108       |
| Real Det Return     | 1.61e+03  |
| Real Sto Return     | 1.61e+03  |
| Reward Loss         | -2.04e+06 |
| Running Env Steps   | 540000    |
| Running Forward KL  | 16        |
| Running Reverse KL  | 11.7      |
| Running Update Time | 108       |
-----------------------------------
--2024-08-11 09:57:15.787707 UTC---
| Itration            | 109       |
| Real Det Return     | 1.56e+03  |
| Real Sto Return     | 1.64e+03  |
| Reward Loss         | -1.94e+06 |
| Running Env Steps   | 545000    |
| Running Forward KL  | 15.2      |
| Running Reverse KL  | 11.5      |
| Running Update Time | 109       |
-----------------------------------
--2024-08-11 09:59:39.403309 UTC---
| Itration            | 110       |
| Real Det Return     | 1.74e+03  |
| Real Sto Return     | 1.59e+03  |
| Reward Loss         | -2.12e+06 |
| Running Env Steps   | 550000    |
| Running Forward KL  | 15.2      |
| Running Reverse KL  | 64.8      |
| Running Update Time | 110       |
-----------------------------------
--2024-08-11 10:01:33.738753 UTC---
| Itration            | 111       |
| Real Det Return     | 574       |
| Real Sto Return     | 1.49e+03  |
| Reward Loss         | -1.78e+06 |
| Running Env Steps   | 555000    |
| Running Forward KL  | 15.5      |
| Running Reverse KL  | 48.2      |
| Running Update Time | 111       |
-----------------------------------
--2024-08-11 10:03:52.492714 UTC--
| Itration            | 112      |
| Real Det Return     | 1.83e+03 |
| Real Sto Return     | 1.5e+03  |
| Reward Loss         | -1.8e+06 |
| Running Env Steps   | 560000   |
| Running Forward KL  | 14.8     |
| Running Reverse KL  | 11       |
| Running Update Time | 112      |
----------------------------------
--2024-08-11 10:06:13.999574 UTC---
| Itration            | 113       |
| Real Det Return     | 1.62e+03  |
| Real Sto Return     | 1.62e+03  |
| Reward Loss         | -1.82e+06 |
| Running Env Steps   | 565000    |
| Running Forward KL  | 15        |
| Running Reverse KL  | 31.6      |
| Running Update Time | 113       |
-----------------------------------
--2024-08-11 10:08:29.261399 UTC---
| Itration            | 114       |
| Real Det Return     | 1.47e+03  |
| Real Sto Return     | 1.67e+03  |
| Reward Loss         | -2.03e+06 |
| Running Env Steps   | 570000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 11.9      |
| Running Update Time | 114       |
-----------------------------------
--2024-08-11 10:10:52.165304 UTC--
| Itration            | 115      |
| Real Det Return     | 1.63e+03 |
| Real Sto Return     | 1.66e+03 |
| Reward Loss         | -2.1e+06 |
| Running Env Steps   | 575000   |
| Running Forward KL  | 15.5     |
| Running Reverse KL  | 12       |
| Running Update Time | 115      |
----------------------------------
--2024-08-11 10:13:12.073378 UTC---
| Itration            | 116       |
| Real Det Return     | 1.63e+03  |
| Real Sto Return     | 1.66e+03  |
| Reward Loss         | -2.02e+06 |
| Running Env Steps   | 580000    |
| Running Forward KL  | 15.4      |
| Running Reverse KL  | 12        |
| Running Update Time | 116       |
-----------------------------------
--2024-08-11 10:15:33.121105 UTC---
| Itration            | 117       |
| Real Det Return     | 1.51e+03  |
| Real Sto Return     | 1.6e+03   |
| Reward Loss         | -2.18e+06 |
| Running Env Steps   | 585000    |
| Running Forward KL  | 15.9      |
| Running Reverse KL  | 12.2      |
| Running Update Time | 117       |
-----------------------------------
--2024-08-11 10:17:54.656783 UTC---
| Itration            | 118       |
| Real Det Return     | 1.56e+03  |
| Real Sto Return     | 1.69e+03  |
| Reward Loss         | -2.03e+06 |
| Running Env Steps   | 590000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 11.9      |
| Running Update Time | 118       |
-----------------------------------
--2024-08-11 10:20:14.277581 UTC---
| Itration            | 119       |
| Real Det Return     | 1.58e+03  |
| Real Sto Return     | 1.68e+03  |
| Reward Loss         | -2.04e+06 |
| Running Env Steps   | 595000    |
| Running Forward KL  | 15.1      |
| Running Reverse KL  | 11.8      |
| Running Update Time | 119       |
-----------------------------------
--2024-08-11 10:22:34.915721 UTC---
| Itration            | 120       |
| Real Det Return     | 1.59e+03  |
| Real Sto Return     | 1.71e+03  |
| Reward Loss         | -2.03e+06 |
| Running Env Steps   | 600000    |
| Running Forward KL  | 15.5      |
| Running Reverse KL  | 12        |
| Running Update Time | 120       |
-----------------------------------
--2024-08-11 10:24:55.455727 UTC---
| Itration            | 121       |
| Real Det Return     | 1.83e+03  |
| Real Sto Return     | 1.63e+03  |
| Reward Loss         | -2.11e+06 |
| Running Env Steps   | 605000    |
| Running Forward KL  | 15.1      |
| Running Reverse KL  | 11.7      |
| Running Update Time | 121       |
-----------------------------------
--2024-08-11 10:27:17.172725 UTC--
| Itration            | 122      |
| Real Det Return     | 1.54e+03 |
| Real Sto Return     | 1.7e+03  |
| Reward Loss         | -2e+06   |
| Running Env Steps   | 610000   |
| Running Forward KL  | 15       |
| Running Reverse KL  | 11.4     |
| Running Update Time | 122      |
----------------------------------
--2024-08-11 10:29:35.186238 UTC---
| Itration            | 123       |
| Real Det Return     | 1.86e+03  |
| Real Sto Return     | 1.65e+03  |
| Reward Loss         | -1.89e+06 |
| Running Env Steps   | 615000    |
| Running Forward KL  | 13.9      |
| Running Reverse KL  | 10.8      |
| Running Update Time | 123       |
-----------------------------------
--2024-08-11 10:31:57.465936 UTC--
| Itration            | 124      |
| Real Det Return     | 1.66e+03 |
| Real Sto Return     | 1.74e+03 |
| Reward Loss         | -2.2e+06 |
| Running Env Steps   | 620000   |
| Running Forward KL  | 15.4     |
| Running Reverse KL  | 12.6     |
| Running Update Time | 124      |
----------------------------------
--2024-08-11 10:34:18.895413 UTC---
| Itration            | 125       |
| Real Det Return     | 1.53e+03  |
| Real Sto Return     | 1.78e+03  |
| Reward Loss         | -2.13e+06 |
| Running Env Steps   | 625000    |
| Running Forward KL  | 14.9      |
| Running Reverse KL  | 12        |
| Running Update Time | 125       |
-----------------------------------
--2024-08-11 10:36:41.303817 UTC---
| Itration            | 126       |
| Real Det Return     | 1.61e+03  |
| Real Sto Return     | 1.69e+03  |
| Reward Loss         | -2.27e+06 |
| Running Env Steps   | 630000    |
| Running Forward KL  | 15.8      |
| Running Reverse KL  | 12.1      |
| Running Update Time | 126       |
-----------------------------------
--2024-08-11 10:39:02.907390 UTC---
| Itration            | 127       |
| Real Det Return     | 1.79e+03  |
| Real Sto Return     | 1.81e+03  |
| Reward Loss         | -2.15e+06 |
| Running Env Steps   | 635000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 12.6      |
| Running Update Time | 127       |
-----------------------------------
--2024-08-11 10:41:24.031405 UTC--
| Itration            | 128      |
| Real Det Return     | 1.78e+03 |
| Real Sto Return     | 1.75e+03 |
| Reward Loss         | -2.1e+06 |
| Running Env Steps   | 640000   |
| Running Forward KL  | 15.3     |
| Running Reverse KL  | 12.7     |
| Running Update Time | 128      |
----------------------------------
--2024-08-11 10:43:42.401872 UTC---
| Itration            | 129       |
| Real Det Return     | 1.73e+03  |
| Real Sto Return     | 1.55e+03  |
| Reward Loss         | -2.28e+06 |
| Running Env Steps   | 645000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 13.8      |
| Running Update Time | 129       |
-----------------------------------
--2024-08-11 10:46:03.056136 UTC---
| Itration            | 130       |
| Real Det Return     | 1.78e+03  |
| Real Sto Return     | 1.82e+03  |
| Reward Loss         | -2.28e+06 |
| Running Env Steps   | 650000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 12.3      |
| Running Update Time | 130       |
-----------------------------------
--2024-08-11 10:48:23.581112 UTC---
| Itration            | 131       |
| Real Det Return     | 1.75e+03  |
| Real Sto Return     | 1.76e+03  |
| Reward Loss         | -2.24e+06 |
| Running Env Steps   | 655000    |
| Running Forward KL  | 15.3      |
| Running Reverse KL  | 12.3      |
| Running Update Time | 131       |
-----------------------------------
--2024-08-11 10:50:45.004886 UTC---
| Itration            | 132       |
| Real Det Return     | 1.85e+03  |
| Real Sto Return     | 1.87e+03  |
| Reward Loss         | -2.17e+06 |
| Running Env Steps   | 660000    |
| Running Forward KL  | 15        |
| Running Reverse KL  | 12.7      |
| Running Update Time | 132       |
-----------------------------------
--2024-08-11 10:53:06.525045 UTC---
| Itration            | 133       |
| Real Det Return     | 1.85e+03  |
| Real Sto Return     | 1.88e+03  |
| Reward Loss         | -2.19e+06 |
| Running Env Steps   | 665000    |
| Running Forward KL  | 15.2      |
| Running Reverse KL  | 13        |
| Running Update Time | 133       |
-----------------------------------
--2024-08-11 10:55:27.960342 UTC---
| Itration            | 134       |
| Real Det Return     | 1.85e+03  |
| Real Sto Return     | 1.88e+03  |
| Reward Loss         | -2.08e+06 |
| Running Env Steps   | 670000    |
| Running Forward KL  | 14.7      |
| Running Reverse KL  | 12        |
| Running Update Time | 134       |
-----------------------------------
--2024-08-11 10:57:47.368127 UTC---
| Itration            | 135       |
| Real Det Return     | 1.94e+03  |
| Real Sto Return     | 1.75e+03  |
| Reward Loss         | -2.24e+06 |
| Running Env Steps   | 675000    |
| Running Forward KL  | 14.9      |
| Running Reverse KL  | 12.4      |
| Running Update Time | 135       |
-----------------------------------
--2024-08-11 11:00:09.549287 UTC--
| Itration            | 136      |
| Real Det Return     | 1.98e+03 |
| Real Sto Return     | 1.99e+03 |
| Reward Loss         | -2.1e+06 |
| Running Env Steps   | 680000   |
| Running Forward KL  | 14.5     |
| Running Reverse KL  | 12.5     |
| Running Update Time | 136      |
----------------------------------
--2024-08-11 11:02:31.290628 UTC---
| Itration            | 137       |
| Real Det Return     | 1.99e+03  |
| Real Sto Return     | 1.99e+03  |
| Reward Loss         | -2.12e+06 |
| Running Env Steps   | 685000    |
| Running Forward KL  | 14.3      |
| Running Reverse KL  | 11.8      |
| Running Update Time | 137       |
-----------------------------------
--2024-08-11 11:04:51.774872 UTC---
| Itration            | 138       |
| Real Det Return     | 2e+03     |
| Real Sto Return     | 1.84e+03  |
| Reward Loss         | -2.15e+06 |
| Running Env Steps   | 690000    |
| Running Forward KL  | 14.8      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 138       |
-----------------------------------
--2024-08-11 11:07:13.205277 UTC---
| Itration            | 139       |
| Real Det Return     | 2.02e+03  |
| Real Sto Return     | 1.83e+03  |
| Reward Loss         | -2.09e+06 |
| Running Env Steps   | 695000    |
| Running Forward KL  | 14.1      |
| Running Reverse KL  | 34.4      |
| Running Update Time | 139       |
-----------------------------------
--2024-08-11 11:09:35.978121 UTC---
| Itration            | 140       |
| Real Det Return     | 2.06e+03  |
| Real Sto Return     | 2.01e+03  |
| Reward Loss         | -1.87e+06 |
| Running Env Steps   | 700000    |
| Running Forward KL  | 13.5      |
| Running Reverse KL  | 33.1      |
| Running Update Time | 140       |
-----------------------------------
--2024-08-11 11:11:57.489639 UTC---
| Itration            | 141       |
| Real Det Return     | 1.97e+03  |
| Real Sto Return     | 1.99e+03  |
| Reward Loss         | -2.09e+06 |
| Running Env Steps   | 705000    |
| Running Forward KL  | 13.8      |
| Running Reverse KL  | 12.2      |
| Running Update Time | 141       |
-----------------------------------
--2024-08-11 11:14:21.311999 UTC--
| Itration            | 142      |
| Real Det Return     | 1.98e+03 |
| Real Sto Return     | 1.97e+03 |
| Reward Loss         | -2.2e+06 |
| Running Env Steps   | 710000   |
| Running Forward KL  | 14.3     |
| Running Reverse KL  | 12       |
| Running Update Time | 142      |
----------------------------------
--2024-08-11 11:16:43.283366 UTC--
| Itration            | 143      |
| Real Det Return     | 2.08e+03 |
| Real Sto Return     | 2.09e+03 |
| Reward Loss         | -2.1e+06 |
| Running Env Steps   | 715000   |
| Running Forward KL  | 14       |
| Running Reverse KL  | 12       |
| Running Update Time | 143      |
----------------------------------
--2024-08-11 11:19:02.068734 UTC---
| Itration            | 144       |
| Real Det Return     | 1.98e+03  |
| Real Sto Return     | 1.69e+03  |
| Reward Loss         | -2.18e+06 |
| Running Env Steps   | 720000    |
| Running Forward KL  | 14.5      |
| Running Reverse KL  | 12.3      |
| Running Update Time | 144       |
-----------------------------------
--2024-08-11 11:21:17.557367 UTC---
| Itration            | 145       |
| Real Det Return     | 1.95e+03  |
| Real Sto Return     | 1.6e+03   |
| Reward Loss         | -2.44e+06 |
| Running Env Steps   | 725000    |
| Running Forward KL  | 14.2      |
| Running Reverse KL  | 89.7      |
| Running Update Time | 145       |
-----------------------------------
--2024-08-11 11:23:39.717398 UTC---
| Itration            | 146       |
| Real Det Return     | 2.03e+03  |
| Real Sto Return     | 2.05e+03  |
| Reward Loss         | -2.13e+06 |
| Running Env Steps   | 730000    |
| Running Forward KL  | 13.9      |
| Running Reverse KL  | 12        |
| Running Update Time | 146       |
-----------------------------------
--2024-08-11 11:26:02.890142 UTC---
| Itration            | 147       |
| Real Det Return     | 2.09e+03  |
| Real Sto Return     | 2.13e+03  |
| Reward Loss         | -2.09e+06 |
| Running Env Steps   | 735000    |
| Running Forward KL  | 13.7      |
| Running Reverse KL  | 11.8      |
| Running Update Time | 147       |
-----------------------------------
--2024-08-11 11:28:22.544147 UTC---
| Itration            | 148       |
| Real Det Return     | 2.02e+03  |
| Real Sto Return     | 1.96e+03  |
| Reward Loss         | -2.25e+06 |
| Running Env Steps   | 740000    |
| Running Forward KL  | 13.7      |
| Running Reverse KL  | 16.9      |
| Running Update Time | 148       |
-----------------------------------
--2024-08-11 11:30:43.111355 UTC---
| Itration            | 149       |
| Real Det Return     | 2.17e+03  |
| Real Sto Return     | 2.05e+03  |
| Reward Loss         | -2.09e+06 |
| Running Env Steps   | 745000    |
| Running Forward KL  | 13.1      |
| Running Reverse KL  | 11.1      |
| Running Update Time | 149       |
-----------------------------------
--2024-08-11 11:33:05.023583 UTC--
| Itration            | 150      |
| Real Det Return     | 2.08e+03 |
| Real Sto Return     | 2.12e+03 |
| Reward Loss         | -2.1e+06 |
| Running Env Steps   | 750000   |
| Running Forward KL  | 13.2     |
| Running Reverse KL  | 30.7     |
| Running Update Time | 150      |
----------------------------------
--2024-08-11 11:35:25.762052 UTC---
| Itration            | 151       |
| Real Det Return     | 2.23e+03  |
| Real Sto Return     | 2.23e+03  |
| Reward Loss         | -1.98e+06 |
| Running Env Steps   | 755000    |
| Running Forward KL  | 12.5      |
| Running Reverse KL  | 11.4      |
| Running Update Time | 151       |
-----------------------------------
--2024-08-11 11:37:46.057314 UTC--
| Itration            | 152      |
| Real Det Return     | 2.17e+03 |
| Real Sto Return     | 2.06e+03 |
| Reward Loss         | -2e+06   |
| Running Env Steps   | 760000   |
| Running Forward KL  | 13       |
| Running Reverse KL  | 11.7     |
| Running Update Time | 152      |
----------------------------------
--2024-08-11 11:39:58.507225 UTC---
| Itration            | 153       |
| Real Det Return     | 2.2e+03   |
| Real Sto Return     | 1.52e+03  |
| Reward Loss         | -2.16e+06 |
| Running Env Steps   | 765000    |
| Running Forward KL  | 13.6      |
| Running Reverse KL  | 67.9      |
| Running Update Time | 153       |
-----------------------------------
--2024-08-11 11:42:20.514514 UTC---
| Itration            | 154       |
| Real Det Return     | 2.11e+03  |
| Real Sto Return     | 2.14e+03  |
| Reward Loss         | -2.16e+06 |
| Running Env Steps   | 770000    |
| Running Forward KL  | 13.2      |
| Running Reverse KL  | 11.3      |
| Running Update Time | 154       |
-----------------------------------
--2024-08-11 11:44:37.656183 UTC---
| Itration            | 155       |
| Real Det Return     | 2.22e+03  |
| Real Sto Return     | 1.8e+03   |
| Reward Loss         | -2.07e+06 |
| Running Env Steps   | 775000    |
| Running Forward KL  | 12.5      |
| Running Reverse KL  | 22.1      |
| Running Update Time | 155       |
-----------------------------------
--2024-08-11 11:46:58.372548 UTC---
| Itration            | 156       |
| Real Det Return     | 2.22e+03  |
| Real Sto Return     | 2.27e+03  |
| Reward Loss         | -2.05e+06 |
| Running Env Steps   | 780000    |
| Running Forward KL  | 12.3      |
| Running Reverse KL  | 11.1      |
| Running Update Time | 156       |
-----------------------------------
--2024-08-11 11:49:20.172042 UTC---
| Itration            | 157       |
| Real Det Return     | 2.29e+03  |
| Real Sto Return     | 2.25e+03  |
| Reward Loss         | -2.17e+06 |
| Running Env Steps   | 785000    |
| Running Forward KL  | 12.8      |
| Running Reverse KL  | 26.7      |
| Running Update Time | 157       |
-----------------------------------
--2024-08-11 11:51:42.944813 UTC---
| Itration            | 158       |
| Real Det Return     | 2.45e+03  |
| Real Sto Return     | 2.34e+03  |
| Reward Loss         | -1.83e+06 |
| Running Env Steps   | 790000    |
| Running Forward KL  | 11.6      |
| Running Reverse KL  | 10.6      |
| Running Update Time | 158       |
-----------------------------------
--2024-08-11 11:54:02.311362 UTC---
| Itration            | 159       |
| Real Det Return     | 2.36e+03  |
| Real Sto Return     | 2.24e+03  |
| Reward Loss         | -1.78e+06 |
| Running Env Steps   | 795000    |
| Running Forward KL  | 10.7      |
| Running Reverse KL  | 22.4      |
| Running Update Time | 159       |
-----------------------------------
--2024-08-11 11:56:24.518676 UTC---
| Itration            | 160       |
| Real Det Return     | 2.26e+03  |
| Real Sto Return     | 2.33e+03  |
| Reward Loss         | -1.95e+06 |
| Running Env Steps   | 800000    |
| Running Forward KL  | 11.8      |
| Running Reverse KL  | 11        |
| Running Update Time | 160       |
-----------------------------------
--2024-08-11 11:58:45.477045 UTC---
| Itration            | 161       |
| Real Det Return     | 2.42e+03  |
| Real Sto Return     | 2.33e+03  |
| Reward Loss         | -1.76e+06 |
| Running Env Steps   | 805000    |
| Running Forward KL  | 10.2      |
| Running Reverse KL  | 12.5      |
| Running Update Time | 161       |
-----------------------------------
--2024-08-11 12:01:04.987917 UTC--
| Itration            | 162      |
| Real Det Return     | 2.37e+03 |
| Real Sto Return     | 2.24e+03 |
| Reward Loss         | -1.9e+06 |
| Running Env Steps   | 810000   |
| Running Forward KL  | 11.6     |
| Running Reverse KL  | 10.7     |
| Running Update Time | 162      |
----------------------------------
--2024-08-11 12:03:22.362289 UTC---
| Itration            | 163       |
| Real Det Return     | 2.09e+03  |
| Real Sto Return     | 2.4e+03   |
| Reward Loss         | -1.39e+06 |
| Running Env Steps   | 815000    |
| Running Forward KL  | 9.92      |
| Running Reverse KL  | 23.3      |
| Running Update Time | 163       |
-----------------------------------
--2024-08-11 12:05:43.339963 UTC---
| Itration            | 164       |
| Real Det Return     | 2.4e+03   |
| Real Sto Return     | 2.43e+03  |
| Reward Loss         | -1.84e+06 |
| Running Env Steps   | 820000    |
| Running Forward KL  | 11.3      |
| Running Reverse KL  | 10.2      |
| Running Update Time | 164       |
-----------------------------------
--2024-08-11 12:08:04.665381 UTC---
| Itration            | 165       |
| Real Det Return     | 2.42e+03  |
| Real Sto Return     | 2.48e+03  |
| Reward Loss         | -1.71e+06 |
| Running Env Steps   | 825000    |
| Running Forward KL  | 10.6      |
| Running Reverse KL  | 10        |
| Running Update Time | 165       |
-----------------------------------
--2024-08-11 12:10:28.392528 UTC---
| Itration            | 166       |
| Real Det Return     | 2.56e+03  |
| Real Sto Return     | 2.67e+03  |
| Reward Loss         | -1.42e+06 |
| Running Env Steps   | 830000    |
| Running Forward KL  | 8.53      |
| Running Reverse KL  | 8.33      |
| Running Update Time | 166       |
-----------------------------------
--2024-08-11 12:12:41.123700 UTC---
| Itration            | 167       |
| Real Det Return     | 2.53e+03  |
| Real Sto Return     | 1.9e+03   |
| Reward Loss         | -1.43e+06 |
| Running Env Steps   | 835000    |
| Running Forward KL  | 10.4      |
| Running Reverse KL  | 33.9      |
| Running Update Time | 167       |
-----------------------------------
--2024-08-11 12:15:03.263357 UTC---
| Itration            | 168       |
| Real Det Return     | 2.49e+03  |
| Real Sto Return     | 2.55e+03  |
| Reward Loss         | -1.71e+06 |
| Running Env Steps   | 840000    |
| Running Forward KL  | 10.6      |
| Running Reverse KL  | 9.3       |
| Running Update Time | 168       |
-----------------------------------
--2024-08-11 12:17:26.463716 UTC---
| Itration            | 169       |
| Real Det Return     | 2.54e+03  |
| Real Sto Return     | 2.59e+03  |
| Reward Loss         | -1.54e+06 |
| Running Env Steps   | 845000    |
| Running Forward KL  | 10        |
| Running Reverse KL  | 9.72      |
| Running Update Time | 169       |
-----------------------------------
--2024-08-11 12:19:46.954330 UTC---
| Itration            | 170       |
| Real Det Return     | 2.56e+03  |
| Real Sto Return     | 2.58e+03  |
| Reward Loss         | -1.51e+06 |
| Running Env Steps   | 850000    |
| Running Forward KL  | 10.2      |
| Running Reverse KL  | 9.77      |
| Running Update Time | 170       |
-----------------------------------
--2024-08-11 12:22:06.842955 UTC---
| Itration            | 171       |
| Real Det Return     | 2.62e+03  |
| Real Sto Return     | 2.55e+03  |
| Reward Loss         | -1.36e+06 |
| Running Env Steps   | 855000    |
| Running Forward KL  | 9.17      |
| Running Reverse KL  | 33.7      |
| Running Update Time | 171       |
-----------------------------------
--2024-08-11 12:24:30.528847 UTC---
| Itration            | 172       |
| Real Det Return     | 2.6e+03   |
| Real Sto Return     | 2.59e+03  |
| Reward Loss         | -1.55e+06 |
| Running Env Steps   | 860000    |
| Running Forward KL  | 10.2      |
| Running Reverse KL  | 9.81      |
| Running Update Time | 172       |
-----------------------------------
--2024-08-11 12:26:45.618071 UTC---
| Itration            | 173       |
| Real Det Return     | 2.83e+03  |
| Real Sto Return     | 2.52e+03  |
| Reward Loss         | -1.05e+06 |
| Running Env Steps   | 865000    |
| Running Forward KL  | 8.43      |
| Running Reverse KL  | 18        |
| Running Update Time | 173       |
-----------------------------------
--2024-08-11 12:29:09.217972 UTC---
| Itration            | 174       |
| Real Det Return     | 2.77e+03  |
| Real Sto Return     | 2.76e+03  |
| Reward Loss         | -9.38e+05 |
| Running Env Steps   | 870000    |
| Running Forward KL  | 8.11      |
| Running Reverse KL  | 7.44      |
| Running Update Time | 174       |
-----------------------------------
--2024-08-11 12:31:30.626498 UTC---
| Itration            | 175       |
| Real Det Return     | 2.77e+03  |
| Real Sto Return     | 2.74e+03  |
| Reward Loss         | -1.08e+06 |
| Running Env Steps   | 875000    |
| Running Forward KL  | 8.49      |
| Running Reverse KL  | 8.58      |
| Running Update Time | 175       |
-----------------------------------
--2024-08-11 12:33:48.666248 UTC---
| Itration            | 176       |
| Real Det Return     | 2.74e+03  |
| Real Sto Return     | 2.82e+03  |
| Reward Loss         | -1.17e+06 |
| Running Env Steps   | 880000    |
| Running Forward KL  | 7.28      |
| Running Reverse KL  | 7.19      |
| Running Update Time | 176       |
-----------------------------------
--2024-08-11 12:36:10.167214 UTC---
| Itration            | 177       |
| Real Det Return     | 2.84e+03  |
| Real Sto Return     | 2.62e+03  |
| Reward Loss         | -1.12e+06 |
| Running Env Steps   | 885000    |
| Running Forward KL  | 7.41      |
| Running Reverse KL  | 35.7      |
| Running Update Time | 177       |
-----------------------------------
--2024-08-11 12:38:30.474768 UTC---
| Itration            | 178       |
| Real Det Return     | 2.82e+03  |
| Real Sto Return     | 2.83e+03  |
| Reward Loss         | -9.59e+05 |
| Running Env Steps   | 890000    |
| Running Forward KL  | 6.14      |
| Running Reverse KL  | 6.83      |
| Running Update Time | 178       |
-----------------------------------
--2024-08-11 12:40:50.341752 UTC--
| Itration            | 179      |
| Real Det Return     | 2.78e+03 |
| Real Sto Return     | 2.72e+03 |
| Reward Loss         | -1.2e+06 |
| Running Env Steps   | 895000   |
| Running Forward KL  | 7.13     |
| Running Reverse KL  | 6.9      |
| Running Update Time | 179      |
----------------------------------
--2024-08-11 12:43:14.451389 UTC---
| Itration            | 180       |
| Real Det Return     | 3.04e+03  |
| Real Sto Return     | 2.95e+03  |
| Reward Loss         | -5.24e+05 |
| Running Env Steps   | 900000    |
| Running Forward KL  | 4.72      |
| Running Reverse KL  | 19.1      |
| Running Update Time | 180       |
-----------------------------------
--2024-08-11 12:45:34.046390 UTC---
| Itration            | 181       |
| Real Det Return     | 2.87e+03  |
| Real Sto Return     | 2.76e+03  |
| Reward Loss         | -1.05e+06 |
| Running Env Steps   | 905000    |
| Running Forward KL  | 7.07      |
| Running Reverse KL  | 6.59      |
| Running Update Time | 181       |
-----------------------------------
--2024-08-11 12:47:54.955018 UTC---
| Itration            | 182       |
| Real Det Return     | 2.9e+03   |
| Real Sto Return     | 2.82e+03  |
| Reward Loss         | -9.55e+05 |
| Running Env Steps   | 910000    |
| Running Forward KL  | 5.34      |
| Running Reverse KL  | 20.2      |
| Running Update Time | 182       |
-----------------------------------
--2024-08-11 12:50:19.912551 UTC---
| Itration            | 183       |
| Real Det Return     | 2.87e+03  |
| Real Sto Return     | 2.89e+03  |
| Reward Loss         | -9.44e+05 |
| Running Env Steps   | 915000    |
| Running Forward KL  | 5.24      |
| Running Reverse KL  | 11.4      |
| Running Update Time | 183       |
-----------------------------------
--2024-08-11 12:52:39.522223 UTC---
| Itration            | 184       |
| Real Det Return     | 2.59e+03  |
| Real Sto Return     | 2.58e+03  |
| Reward Loss         | -1.61e+06 |
| Running Env Steps   | 920000    |
| Running Forward KL  | 8.49      |
| Running Reverse KL  | 7.65      |
| Running Update Time | 184       |
-----------------------------------
--2024-08-11 12:54:59.511517 UTC---
| Itration            | 185       |
| Real Det Return     | 3.09e+03  |
| Real Sto Return     | 2.95e+03  |
| Reward Loss         | -2.71e+05 |
| Running Env Steps   | 925000    |
| Running Forward KL  | 4.73      |
| Running Reverse KL  | 37.6      |
| Running Update Time | 185       |
-----------------------------------
--2024-08-11 12:57:12.516102 UTC--
| Itration            | 186      |
| Real Det Return     | 3.22e+03 |
| Real Sto Return     | 2.07e+03 |
| Reward Loss         | 1.01e+05 |
| Running Env Steps   | 930000   |
| Running Forward KL  | 5.57     |
| Running Reverse KL  | 101      |
| Running Update Time | 186      |
----------------------------------
--2024-08-11 12:59:29.593109 UTC---
| Itration            | 187       |
| Real Det Return     | 2.72e+03  |
| Real Sto Return     | 2.5e+03   |
| Reward Loss         | -1.24e+06 |
| Running Env Steps   | 935000    |
| Running Forward KL  | 7.1       |
| Running Reverse KL  | 22.8      |
| Running Update Time | 187       |
-----------------------------------
--2024-08-11 13:01:50.926398 UTC---
| Itration            | 188       |
| Real Det Return     | 2.83e+03  |
| Real Sto Return     | 2.56e+03  |
| Reward Loss         | -1.49e+06 |
| Running Env Steps   | 940000    |
| Running Forward KL  | 7.76      |
| Running Reverse KL  | 27.4      |
| Running Update Time | 188       |
-----------------------------------
--2024-08-11 13:04:11.882931 UTC--
| Itration            | 189      |
| Real Det Return     | 3.05e+03 |
| Real Sto Return     | 2.98e+03 |
| Reward Loss         | -3e+05   |
| Running Env Steps   | 945000   |
| Running Forward KL  | 3.67     |
| Running Reverse KL  | 9.39     |
| Running Update Time | 189      |
----------------------------------
--2024-08-11 13:06:29.980929 UTC---
| Itration            | 190       |
| Real Det Return     | 2.96e+03  |
| Real Sto Return     | 2.79e+03  |
| Reward Loss         | -9.92e+05 |
| Running Env Steps   | 950000    |
| Running Forward KL  | 5.74      |
| Running Reverse KL  | 6.05      |
| Running Update Time | 190       |
-----------------------------------
--2024-08-11 13:08:54.956536 UTC---
| Itration            | 191       |
| Real Det Return     | 2.69e+03  |
| Real Sto Return     | 2.72e+03  |
| Reward Loss         | -1.47e+06 |
| Running Env Steps   | 955000    |
| Running Forward KL  | 8.29      |
| Running Reverse KL  | 7.63      |
| Running Update Time | 191       |
-----------------------------------
--2024-08-11 13:11:16.414916 UTC---
| Itration            | 192       |
| Real Det Return     | 3.08e+03  |
| Real Sto Return     | 3.07e+03  |
| Reward Loss         | -7.87e+05 |
| Running Env Steps   | 960000    |
| Running Forward KL  | 5.45      |
| Running Reverse KL  | 6.08      |
| Running Update Time | 192       |
-----------------------------------
--2024-08-11 13:13:36.131458 UTC---
| Itration            | 193       |
| Real Det Return     | 2.81e+03  |
| Real Sto Return     | 2.86e+03  |
| Reward Loss         | -1.33e+06 |
| Running Env Steps   | 965000    |
| Running Forward KL  | 7.23      |
| Running Reverse KL  | 6.79      |
| Running Update Time | 193       |
-----------------------------------
--2024-08-11 13:16:00.827874 UTC---
| Itration            | 194       |
| Real Det Return     | 2.81e+03  |
| Real Sto Return     | 2.84e+03  |
| Reward Loss         | -1.23e+06 |
| Running Env Steps   | 970000    |
| Running Forward KL  | 7.02      |
| Running Reverse KL  | 6.8       |
| Running Update Time | 194       |
-----------------------------------
--2024-08-11 13:18:22.909396 UTC---
| Itration            | 195       |
| Real Det Return     | 3.05e+03  |
| Real Sto Return     | 3.01e+03  |
| Reward Loss         | -9.29e+05 |
| Running Env Steps   | 975000    |
| Running Forward KL  | 5.23      |
| Running Reverse KL  | 5.65      |
| Running Update Time | 195       |
-----------------------------------
--2024-08-11 13:20:42.823300 UTC---
| Itration            | 196       |
| Real Det Return     | 2.87e+03  |
| Real Sto Return     | 2.89e+03  |
| Reward Loss         | -1.22e+06 |
| Running Env Steps   | 980000    |
| Running Forward KL  | 6.9       |
| Running Reverse KL  | 6.87      |
| Running Update Time | 196       |
-----------------------------------
--2024-08-11 13:23:07.202352 UTC---
| Itration            | 197       |
| Real Det Return     | 2.79e+03  |
| Real Sto Return     | 2.84e+03  |
| Reward Loss         | -1.31e+06 |
| Running Env Steps   | 985000    |
| Running Forward KL  | 7.28      |
| Running Reverse KL  | 29.4      |
| Running Update Time | 197       |
-----------------------------------
--2024-08-11 13:25:28.156453 UTC---
| Itration            | 198       |
| Real Det Return     | 2.9e+03   |
| Real Sto Return     | 2.94e+03  |
| Reward Loss         | -1.03e+06 |
| Running Env Steps   | 990000    |
| Running Forward KL  | 6.53      |
| Running Reverse KL  | 6.87      |
| Running Update Time | 198       |
-----------------------------------
--2024-08-11 13:27:46.924715 UTC---
| Itration            | 199       |
| Real Det Return     | 2.98e+03  |
| Real Sto Return     | 3.04e+03  |
| Reward Loss         | -3.15e+05 |
| Running Env Steps   | 995000    |
| Running Forward KL  | 4.45      |
| Running Reverse KL  | 14.7      |
| Running Update Time | 199       |
-----------------------------------
