Logging to logs/HalfCheetah-v4/UTILITY/2025_02_01_21_17_29
2025-02-01 21:21:41.120090 Eastern Standard Time
| Itration            | 0        |
| Real Det Return     | -0.93    |
| Real Sto Return     | -30.1    |
| Reward Loss         | -11      |
| Running Env Steps   | 0        |
| Running Update Time | 0        |
----------------------------------
2025-02-01 21:26:51.340728 Eastern Standard Time
| Itration            | 1        |
| Real Det Return     | -3.15    |
| Real Sto Return     | -41.7    |
| Reward Loss         | -8.09    |
| Running Env Steps   | 500      |
| Running Update Time | 1        |
----------------------------------
2025-02-01 21:33:04.049680 Eastern Standard Time
| Itration            | 2        |
| Real Det Return     | -2.11    |
| Real Sto Return     | -34.6    |
| Reward Loss         | -9.05    |
| Running Env Steps   | 1000     |
| Running Update Time | 2        |
----------------------------------
2025-02-01 21:40:04.409452 Eastern Standard Time
| Itration            | 3        |
| Real Det Return     | -3.11    |
| Real Sto Return     | -14.6    |
| Reward Loss         | 0.862    |
| Running Env Steps   | 1500     |
| Running Update Time | 3        |
----------------------------------
2025-02-01 21:47:15.183387 Eastern Standard Time
| Itration            | 4        |
| Real Det Return     | -1.79    |
| Real Sto Return     | -10.2    |
| Reward Loss         | 4.44     |
| Running Env Steps   | 2000     |
| Running Update Time | 4        |
----------------------------------
2025-02-01 21:54:36.634397 Eastern Standard Time
| Itration            | 5        |
| Real Det Return     | -3.24    |
| Real Sto Return     | -9.12    |
| Reward Loss         | -6       |
| Running Env Steps   | 2500     |
| Running Update Time | 5        |
----------------------------------
2025-02-01 22:02:06.308921 Eastern Standard Time
| Itration            | 6        |
| Real Det Return     | -2.43    |
| Real Sto Return     | 0.14     |
| Reward Loss         | 3.24     |
| Running Env Steps   | 3000     |
| Running Update Time | 6        |
----------------------------------
2025-02-01 22:10:07.932672 Eastern Standard Time
| Itration            | 7        |
| Real Det Return     | -2.95    |
| Real Sto Return     | -4.38    |
| Reward Loss         | -1.9     |
| Running Env Steps   | 3500     |
| Running Update Time | 7        |
----------------------------------
2025-02-01 22:17:58.176438 Eastern Standard Time
| Itration            | 8        |
| Real Det Return     | -2.54    |
| Real Sto Return     | -2.55    |
| Reward Loss         | -1.73    |
| Running Env Steps   | 4000     |
| Running Update Time | 8        |
----------------------------------
2025-02-01 22:25:46.568550 Eastern Standard Time
| Itration            | 9        |
| Real Det Return     | -5.63    |
| Real Sto Return     | -6.09    |
| Reward Loss         | -0.077   |
| Running Env Steps   | 4500     |
| Running Update Time | 9        |
----------------------------------
2025-02-01 22:33:25.314810 Eastern Standard Time
| Itration            | 10       |
| Real Det Return     | -4.01    |
| Real Sto Return     | 2.39     |
| Reward Loss         | 2.33     |
| Running Env Steps   | 5000     |
| Running Update Time | 10       |
----------------------------------
2025-02-01 22:41:09.209909 Eastern Standard Time
| Itration            | 11       |
| Real Det Return     | -1.89    |
| Real Sto Return     | 3.75     |
| Reward Loss         | -4.26    |
| Running Env Steps   | 5500     |
| Running Update Time | 11       |
----------------------------------
2025-02-01 22:49:09.257397 Eastern Standard Time
| Itration            | 12       |
| Real Det Return     | 13.1     |
| Real Sto Return     | 2.54     |
| Reward Loss         | -3.07    |
| Running Env Steps   | 6000     |
| Running Update Time | 12       |
----------------------------------
2025-02-01 22:56:40.780092 Eastern Standard Time
| Itration            | 13       |
| Real Det Return     | 13.2     |
| Real Sto Return     | -1.58    |
| Reward Loss         | -2.89    |
| Running Env Steps   | 6500     |
| Running Update Time | 13       |
----------------------------------
2025-02-01 23:04:17.885950 Eastern Standard Time
| Itration            | 14       |
| Real Det Return     | 0.55     |
| Real Sto Return     | 1.19     |
| Reward Loss         | -3.37    |
| Running Env Steps   | 7000     |
| Running Update Time | 14       |
----------------------------------
2025-02-01 23:12:29.392750 Eastern Standard Time
| Itration            | 15       |
| Real Det Return     | 13       |
| Real Sto Return     | -0.35    |
| Reward Loss         | -0.878   |
| Running Env Steps   | 7500     |
| Running Update Time | 15       |
----------------------------------
2025-02-01 23:20:55.725009 Eastern Standard Time
| Itration            | 16       |
| Real Det Return     | 8.25     |
| Real Sto Return     | -0.67    |
| Reward Loss         | -1.87    |
| Running Env Steps   | 8000     |
| Running Update Time | 16       |
----------------------------------
2025-02-01 23:29:54.869142 Eastern Standard Time
| Itration            | 17       |
| Real Det Return     | 1.15     |
| Real Sto Return     | -2.64    |
| Reward Loss         | -1.83    |
| Running Env Steps   | 8500     |
| Running Update Time | 17       |
----------------------------------
2025-02-01 23:38:51.183401 Eastern Standard Time
| Itration            | 18       |
| Real Det Return     | 12.4     |
| Real Sto Return     | -1.22    |
| Reward Loss         | -1.41    |
| Running Env Steps   | 9000     |
| Running Update Time | 18       |
----------------------------------
2025-02-01 23:47:28.325970 Eastern Standard Time
| Itration            | 19       |
| Real Det Return     | 11.4     |
| Real Sto Return     | -1.49    |
| Reward Loss         | 0.626    |
| Running Env Steps   | 9500     |
| Running Update Time | 19       |
----------------------------------
2025-02-01 23:56:25.308916 Eastern Standard Time
| Itration            | 20       |
| Real Det Return     | 12.5     |
| Real Sto Return     | -1.88    |
| Reward Loss         | 1.05     |
| Running Env Steps   | 10000    |
| Running Update Time | 20       |
----------------------------------
2025-02-02 00:06:27.684556 Eastern Standard Time
| Itration            | 21       |
| Real Det Return     | 7.31     |
| Real Sto Return     | -3.77    |
| Reward Loss         | -0.932   |
| Running Env Steps   | 10500    |
| Running Update Time | 21       |
----------------------------------
2025-02-02 00:15:28.003159 Eastern Standard Time
| Itration            | 22       |
| Real Det Return     | 8.23     |
| Real Sto Return     | -5.35    |
| Reward Loss         | -1.11    |
| Running Env Steps   | 11000    |
| Running Update Time | 22       |
----------------------------------
