2023-07-07 13:58:59,053 -        meta learning: [    INFO] - [INFO] checkpoint saved to: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 13:58:59,053 -        meta learning: [    INFO] - [INFO] tensorboard dir set to: ./runs/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 13:58:59,054 -        meta learning: [    INFO] - [ARGS]: Namespace(policy='BatchedGruMetaStdpMLPPolicy', algo='PGPE', task='SeqTask', seq_length=20, latency=24, num_cls=5, feature_dims=14, sigma=0.1, batch_size=512, hidden_dims=[128], pop_size=256, center_lr=0.01, init_std=0.04, decay_std=0.999, limit_std=0.001, std_lr=0.07, terminate_when_unhealthy=False, max_iters=12000, num_tasks=1, seed=43, num_tests=128, eval_epoch=100, eval=False, eval_with_injury=False, resume='', save=False, repeat=1, root_dir='/data/anonymous/meta', tensorboard_dir='./runs', suffix='', output_dir='/data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859', summary_writer=<torch.utils.tensorboard.writer.SummaryWriter object at 0x7f1f40365d60>, tb_prefix='PGPE/SeqTask/BatchedGruMetaStdpMLPPolicy')
2023-07-07 13:59:02,393 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 13:59:02,461 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 13:59:10,556 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 100, best=0.72, avg=0.71, std=0.01, steps=4.137e+05
2023-07-07 13:59:14,499 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 200, best=0.80, avg=0.78, std=0.01, steps=8.233e+05
2023-07-07 13:59:18,431 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 300, best=0.86, avg=0.85, std=0.01, steps=1.233e+06
2023-07-07 13:59:22,362 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 400, best=0.89, avg=0.88, std=0.01, steps=1.642e+06
2023-07-07 13:59:26,320 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 500, best=0.91, avg=0.90, std=0.00, steps=2.052e+06
2023-07-07 13:59:30,317 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 600, best=0.93, avg=0.92, std=0.00, steps=2.462e+06
2023-07-07 13:59:34,277 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 700, best=0.95, avg=0.94, std=0.00, steps=2.871e+06
2023-07-07 13:59:38,273 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 800, best=0.96, avg=0.95, std=0.00, steps=3.281e+06
2023-07-07 13:59:42,212 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 900, best=0.96, avg=0.95, std=0.00, steps=3.690e+06
2023-07-07 13:59:46,143 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1000, best=0.96, avg=0.95, std=0.00, steps=4.100e+06
2023-07-07 13:59:50,065 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1100, best=0.96, avg=0.95, std=0.00, steps=4.510e+06
2023-07-07 13:59:53,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1200, best=0.96, avg=0.95, std=0.00, steps=4.919e+06
2023-07-07 13:59:57,911 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1300, best=0.96, avg=0.95, std=0.00, steps=5.329e+06
2023-07-07 14:00:01,845 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1400, best=0.96, avg=0.95, std=0.00, steps=5.738e+06
2023-07-07 14:00:05,774 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1500, best=0.96, avg=0.96, std=0.00, steps=6.148e+06
2023-07-07 14:00:09,705 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1600, best=0.97, avg=0.96, std=0.00, steps=6.558e+06
2023-07-07 14:00:13,632 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1700, best=0.98, avg=0.98, std=0.00, steps=6.967e+06
2023-07-07 14:00:17,559 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1800, best=0.99, avg=0.98, std=0.00, steps=7.377e+06
2023-07-07 14:00:21,484 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1900, best=0.99, avg=0.99, std=0.00, steps=7.786e+06
2023-07-07 14:00:25,427 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2000, best=0.99, avg=0.99, std=0.00, steps=8.196e+06
2023-07-07 14:00:29,350 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2100, best=1.00, avg=0.99, std=0.00, steps=8.606e+06
2023-07-07 14:00:33,273 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2200, best=1.00, avg=0.99, std=0.00, steps=9.015e+06
2023-07-07 14:00:37,200 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2300, best=0.99, avg=0.99, std=0.00, steps=9.425e+06
2023-07-07 14:00:41,132 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2400, best=1.00, avg=1.00, std=0.00, steps=9.834e+06
2023-07-07 14:00:45,084 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2500, best=1.00, avg=1.00, std=0.00, steps=1.024e+07
2023-07-07 14:00:49,009 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2600, best=1.00, avg=1.00, std=0.00, steps=1.065e+07
2023-07-07 14:00:52,931 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2700, best=1.00, avg=1.00, std=0.00, steps=1.106e+07
2023-07-07 14:00:56,865 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2800, best=1.00, avg=1.00, std=0.00, steps=1.147e+07
2023-07-07 14:01:00,798 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2900, best=1.00, avg=1.00, std=0.00, steps=1.188e+07
2023-07-07 14:01:04,729 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3000, best=1.00, avg=1.00, std=0.00, steps=1.229e+07
2023-07-07 14:01:08,675 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3100, best=1.00, avg=1.00, std=0.00, steps=1.270e+07
2023-07-07 14:01:12,619 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3200, best=1.00, avg=1.00, std=0.00, steps=1.311e+07
2023-07-07 14:01:16,558 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3300, best=1.00, avg=1.00, std=0.00, steps=1.352e+07
2023-07-07 14:01:20,496 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3400, best=1.00, avg=1.00, std=0.00, steps=1.393e+07
2023-07-07 14:01:24,434 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3500, best=1.00, avg=1.00, std=0.00, steps=1.434e+07
2023-07-07 14:01:28,379 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3600, best=1.00, avg=1.00, std=0.00, steps=1.475e+07
2023-07-07 14:01:32,328 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3700, best=1.00, avg=1.00, std=0.00, steps=1.516e+07
2023-07-07 14:01:36,253 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3800, best=1.00, avg=1.00, std=0.00, steps=1.557e+07
2023-07-07 14:01:40,177 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3900, best=1.00, avg=1.00, std=0.00, steps=1.598e+07
2023-07-07 14:01:44,125 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4000, best=1.00, avg=1.00, std=0.00, steps=1.639e+07
2023-07-07 14:01:48,082 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4100, best=1.00, avg=1.00, std=0.00, steps=1.680e+07
2023-07-07 14:01:52,027 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4200, best=1.00, avg=1.00, std=0.00, steps=1.721e+07
2023-07-07 14:01:55,962 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4300, best=1.00, avg=1.00, std=0.00, steps=1.762e+07
2023-07-07 14:01:59,899 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4400, best=1.00, avg=1.00, std=0.00, steps=1.803e+07
2023-07-07 14:02:03,839 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4500, best=1.00, avg=1.00, std=0.00, steps=1.844e+07
2023-07-07 14:02:07,776 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4600, best=1.00, avg=1.00, std=0.00, steps=1.885e+07
2023-07-07 14:02:11,706 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4700, best=1.00, avg=1.00, std=0.00, steps=1.926e+07
2023-07-07 14:02:15,634 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4800, best=1.00, avg=1.00, std=0.00, steps=1.966e+07
2023-07-07 14:02:19,575 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4900, best=1.00, avg=1.00, std=0.00, steps=2.007e+07
2023-07-07 14:02:23,508 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5000, best=1.00, avg=1.00, std=0.00, steps=2.048e+07
2023-07-07 14:02:27,453 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5100, best=1.00, avg=1.00, std=0.00, steps=2.089e+07
2023-07-07 14:02:31,383 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5200, best=1.00, avg=1.00, std=0.00, steps=2.130e+07
2023-07-07 14:02:35,337 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5300, best=1.00, avg=1.00, std=0.00, steps=2.171e+07
2023-07-07 14:02:39,286 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5400, best=1.00, avg=1.00, std=0.00, steps=2.212e+07
2023-07-07 14:02:43,212 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5500, best=1.00, avg=1.00, std=0.00, steps=2.253e+07
2023-07-07 14:02:47,149 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5600, best=1.00, avg=1.00, std=0.00, steps=2.294e+07
2023-07-07 14:02:51,079 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5700, best=1.00, avg=1.00, std=0.00, steps=2.335e+07
2023-07-07 14:02:55,009 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5800, best=1.00, avg=1.00, std=0.00, steps=2.376e+07
2023-07-07 14:02:58,946 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5900, best=1.00, avg=1.00, std=0.00, steps=2.417e+07
2023-07-07 14:03:02,887 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6000, best=1.00, avg=1.00, std=0.00, steps=2.458e+07
2023-07-07 14:03:06,831 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6100, best=1.00, avg=1.00, std=0.00, steps=2.499e+07
2023-07-07 14:03:10,755 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6200, best=1.00, avg=1.00, std=0.00, steps=2.540e+07
2023-07-07 14:03:14,699 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6300, best=1.00, avg=1.00, std=0.00, steps=2.581e+07
2023-07-07 14:03:18,653 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6400, best=1.00, avg=1.00, std=0.00, steps=2.622e+07
2023-07-07 14:03:22,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6500, best=1.00, avg=1.00, std=0.00, steps=2.663e+07
2023-07-07 14:03:26,565 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6600, best=1.00, avg=1.00, std=0.00, steps=2.704e+07
2023-07-07 14:03:30,512 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6700, best=1.00, avg=1.00, std=0.00, steps=2.745e+07
2023-07-07 14:03:34,467 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6800, best=1.00, avg=1.00, std=0.00, steps=2.786e+07
2023-07-07 14:03:38,429 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6900, best=1.00, avg=1.00, std=0.00, steps=2.827e+07
2023-07-07 14:03:42,361 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7000, best=1.00, avg=1.00, std=0.00, steps=2.868e+07
2023-07-07 14:03:46,304 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7100, best=1.00, avg=1.00, std=0.00, steps=2.909e+07
2023-07-07 14:03:50,265 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7200, best=1.00, avg=1.00, std=0.00, steps=2.950e+07
2023-07-07 14:03:54,200 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7300, best=1.00, avg=1.00, std=0.00, steps=2.990e+07
2023-07-07 14:03:58,136 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7400, best=1.00, avg=1.00, std=0.00, steps=3.031e+07
2023-07-07 14:04:02,075 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7500, best=1.00, avg=1.00, std=0.00, steps=3.072e+07
2023-07-07 14:04:06,019 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7600, best=1.00, avg=1.00, std=0.00, steps=3.113e+07
2023-07-07 14:04:09,958 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7700, best=1.00, avg=1.00, std=0.00, steps=3.154e+07
2023-07-07 14:04:13,896 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7800, best=1.00, avg=1.00, std=0.00, steps=3.195e+07
2023-07-07 14:04:17,833 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7900, best=1.00, avg=1.00, std=0.00, steps=3.236e+07
2023-07-07 14:04:21,770 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8000, best=1.00, avg=1.00, std=0.00, steps=3.277e+07
2023-07-07 14:04:25,703 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8100, best=1.00, avg=1.00, std=0.00, steps=3.318e+07
2023-07-07 14:04:29,633 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8200, best=1.00, avg=1.00, std=0.00, steps=3.359e+07
2023-07-07 14:04:33,563 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8300, best=1.00, avg=1.00, std=0.00, steps=3.400e+07
2023-07-07 14:04:37,492 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8400, best=1.00, avg=1.00, std=0.00, steps=3.441e+07
2023-07-07 14:04:41,440 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8500, best=1.00, avg=1.00, std=0.00, steps=3.482e+07
2023-07-07 14:04:45,391 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8600, best=1.00, avg=1.00, std=0.00, steps=3.523e+07
2023-07-07 14:04:49,338 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8700, best=1.00, avg=1.00, std=0.00, steps=3.564e+07
2023-07-07 14:04:53,282 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8800, best=1.00, avg=1.00, std=0.00, steps=3.605e+07
2023-07-07 14:04:57,231 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8900, best=1.00, avg=1.00, std=0.00, steps=3.646e+07
2023-07-07 14:05:01,177 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9000, best=1.00, avg=1.00, std=0.00, steps=3.687e+07
2023-07-07 14:05:05,126 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9100, best=1.00, avg=1.00, std=0.00, steps=3.728e+07
2023-07-07 14:05:09,059 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9200, best=1.00, avg=1.00, std=0.00, steps=3.769e+07
2023-07-07 14:05:12,990 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9300, best=1.00, avg=1.00, std=0.00, steps=3.810e+07
2023-07-07 14:05:16,911 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9400, best=1.00, avg=1.00, std=0.00, steps=3.851e+07
2023-07-07 14:05:20,850 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9500, best=1.00, avg=1.00, std=0.00, steps=3.892e+07
2023-07-07 14:05:24,788 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9600, best=1.00, avg=1.00, std=0.00, steps=3.933e+07
2023-07-07 14:05:28,729 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9700, best=1.00, avg=1.00, std=0.00, steps=3.974e+07
2023-07-07 14:05:32,664 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9800, best=1.00, avg=1.00, std=0.00, steps=4.014e+07
2023-07-07 14:05:36,593 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9900, best=1.00, avg=1.00, std=0.00, steps=4.055e+07
2023-07-07 14:05:40,534 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10000, best=1.00, avg=1.00, std=0.00, steps=4.096e+07
2023-07-07 14:05:44,478 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10100, best=1.00, avg=1.00, std=0.00, steps=4.137e+07
2023-07-07 14:05:48,426 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10200, best=1.00, avg=1.00, std=0.00, steps=4.178e+07
2023-07-07 14:05:52,356 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10300, best=1.00, avg=1.00, std=0.00, steps=4.219e+07
2023-07-07 14:05:56,287 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10400, best=1.00, avg=1.00, std=0.00, steps=4.260e+07
2023-07-07 14:06:00,218 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10500, best=1.00, avg=1.00, std=0.00, steps=4.301e+07
2023-07-07 14:06:04,153 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10600, best=1.00, avg=1.00, std=0.00, steps=4.342e+07
2023-07-07 14:06:08,086 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10700, best=1.00, avg=1.00, std=0.00, steps=4.383e+07
2023-07-07 14:06:12,016 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10800, best=1.00, avg=1.00, std=0.00, steps=4.424e+07
2023-07-07 14:06:15,956 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10900, best=1.00, avg=1.00, std=0.00, steps=4.465e+07
2023-07-07 14:06:19,893 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11000, best=1.00, avg=1.00, std=0.00, steps=4.506e+07
2023-07-07 14:06:23,824 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11100, best=1.00, avg=1.00, std=0.00, steps=4.547e+07
2023-07-07 14:06:27,748 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11200, best=1.00, avg=1.00, std=0.00, steps=4.588e+07
2023-07-07 14:06:31,688 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11300, best=1.00, avg=1.00, std=0.00, steps=4.629e+07
2023-07-07 14:06:35,627 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11400, best=1.00, avg=1.00, std=0.00, steps=4.670e+07
2023-07-07 14:06:39,563 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11500, best=1.00, avg=1.00, std=0.00, steps=4.711e+07
2023-07-07 14:06:43,501 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11600, best=1.00, avg=1.00, std=0.00, steps=4.752e+07
2023-07-07 14:06:47,422 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11700, best=1.00, avg=1.00, std=0.00, steps=4.793e+07
2023-07-07 14:06:51,355 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11800, best=1.00, avg=1.00, std=0.00, steps=4.834e+07
2023-07-07 14:06:55,276 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11900, best=1.00, avg=1.00, std=0.00, steps=4.875e+07
2023-07-07 14:06:59,160 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11999, best=1.00, avg=1.00, std=0.00, steps=4.915e+07
2023-07-07 14:06:59,161 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 14:06:59,189 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:06:59,222 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:07:07,117 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 100, best=0.69, avg=0.67, std=0.01, steps=6.205e+05
2023-07-07 14:07:12,874 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 200, best=0.73, avg=0.72, std=0.01, steps=1.235e+06
2023-07-07 14:07:18,618 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 300, best=0.76, avg=0.74, std=0.01, steps=1.849e+06
2023-07-07 14:07:24,343 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 400, best=0.78, avg=0.77, std=0.01, steps=2.464e+06
2023-07-07 14:07:30,077 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 500, best=0.80, avg=0.79, std=0.01, steps=3.078e+06
2023-07-07 14:07:35,825 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 600, best=0.82, avg=0.80, std=0.01, steps=3.693e+06
2023-07-07 14:07:41,563 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 700, best=0.84, avg=0.82, std=0.01, steps=4.307e+06
2023-07-07 14:07:47,312 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 800, best=0.85, avg=0.84, std=0.01, steps=4.921e+06
2023-07-07 14:07:53,084 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 900, best=0.87, avg=0.86, std=0.00, steps=5.536e+06
2023-07-07 14:07:58,807 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1000, best=0.88, avg=0.87, std=0.00, steps=6.150e+06
2023-07-07 14:08:04,565 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1100, best=0.89, avg=0.87, std=0.00, steps=6.765e+06
2023-07-07 14:08:10,293 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1200, best=0.89, avg=0.88, std=0.00, steps=7.379e+06
2023-07-07 14:08:16,034 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1300, best=0.89, avg=0.88, std=0.00, steps=7.993e+06
2023-07-07 14:08:21,767 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1400, best=0.90, avg=0.89, std=0.00, steps=8.608e+06
2023-07-07 14:08:27,497 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1500, best=0.90, avg=0.89, std=0.00, steps=9.222e+06
2023-07-07 14:08:33,219 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1600, best=0.90, avg=0.89, std=0.00, steps=9.837e+06
2023-07-07 14:08:38,951 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1700, best=0.90, avg=0.89, std=0.00, steps=1.045e+07
2023-07-07 14:08:44,696 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1800, best=0.90, avg=0.89, std=0.00, steps=1.107e+07
2023-07-07 14:08:50,448 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1900, best=0.90, avg=0.89, std=0.00, steps=1.168e+07
2023-07-07 14:08:56,203 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2000, best=0.90, avg=0.89, std=0.00, steps=1.229e+07
2023-07-07 14:09:01,962 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2100, best=0.90, avg=0.89, std=0.00, steps=1.291e+07
2023-07-07 14:09:07,715 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2200, best=0.90, avg=0.89, std=0.00, steps=1.352e+07
2023-07-07 14:09:13,447 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2300, best=0.90, avg=0.89, std=0.00, steps=1.414e+07
2023-07-07 14:09:19,170 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2400, best=0.90, avg=0.89, std=0.00, steps=1.475e+07
2023-07-07 14:09:24,891 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2500, best=0.90, avg=0.89, std=0.00, steps=1.537e+07
2023-07-07 14:09:30,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2600, best=0.90, avg=0.89, std=0.00, steps=1.598e+07
2023-07-07 14:09:36,374 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2700, best=0.90, avg=0.89, std=0.00, steps=1.659e+07
2023-07-07 14:09:42,111 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2800, best=0.90, avg=0.89, std=0.00, steps=1.721e+07
2023-07-07 14:09:47,840 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2900, best=0.90, avg=0.89, std=0.00, steps=1.782e+07
2023-07-07 14:09:53,584 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3000, best=0.90, avg=0.89, std=0.00, steps=1.844e+07
2023-07-07 14:09:59,338 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3100, best=0.90, avg=0.89, std=0.00, steps=1.905e+07
2023-07-07 14:10:05,082 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3200, best=0.90, avg=0.89, std=0.00, steps=1.967e+07
2023-07-07 14:10:10,835 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3300, best=0.90, avg=0.89, std=0.00, steps=2.028e+07
2023-07-07 14:10:16,594 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3400, best=0.91, avg=0.89, std=0.00, steps=2.090e+07
2023-07-07 14:10:22,367 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3500, best=0.90, avg=0.89, std=0.00, steps=2.151e+07
2023-07-07 14:10:28,114 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3600, best=0.90, avg=0.89, std=0.00, steps=2.212e+07
2023-07-07 14:10:33,855 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3700, best=0.90, avg=0.89, std=0.00, steps=2.274e+07
2023-07-07 14:10:39,610 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3800, best=0.90, avg=0.90, std=0.00, steps=2.335e+07
2023-07-07 14:10:45,369 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3900, best=0.90, avg=0.90, std=0.00, steps=2.397e+07
2023-07-07 14:10:51,125 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4000, best=0.91, avg=0.90, std=0.00, steps=2.458e+07
2023-07-07 14:10:56,867 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4100, best=0.91, avg=0.90, std=0.00, steps=2.520e+07
2023-07-07 14:11:02,612 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4200, best=0.91, avg=0.90, std=0.00, steps=2.581e+07
2023-07-07 14:11:08,343 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4300, best=0.91, avg=0.90, std=0.00, steps=2.643e+07
2023-07-07 14:11:14,093 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4400, best=0.91, avg=0.90, std=0.00, steps=2.704e+07
2023-07-07 14:11:19,830 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4500, best=0.91, avg=0.90, std=0.00, steps=2.765e+07
2023-07-07 14:11:25,578 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4600, best=0.91, avg=0.90, std=0.00, steps=2.827e+07
2023-07-07 14:11:31,315 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4700, best=0.91, avg=0.90, std=0.00, steps=2.888e+07
2023-07-07 14:11:37,058 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4800, best=0.91, avg=0.91, std=0.00, steps=2.950e+07
2023-07-07 14:11:42,789 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4900, best=0.92, avg=0.91, std=0.00, steps=3.011e+07
2023-07-07 14:11:48,527 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5000, best=0.92, avg=0.91, std=0.00, steps=3.073e+07
2023-07-07 14:11:54,274 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5100, best=0.92, avg=0.91, std=0.00, steps=3.134e+07
2023-07-07 14:12:00,032 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5200, best=0.92, avg=0.91, std=0.00, steps=3.195e+07
2023-07-07 14:12:05,779 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5300, best=0.92, avg=0.91, std=0.00, steps=3.257e+07
2023-07-07 14:12:11,521 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5400, best=0.92, avg=0.91, std=0.00, steps=3.318e+07
2023-07-07 14:12:17,260 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5500, best=0.92, avg=0.91, std=0.00, steps=3.380e+07
2023-07-07 14:12:22,995 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5600, best=0.92, avg=0.91, std=0.00, steps=3.441e+07
2023-07-07 14:12:28,720 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5700, best=0.92, avg=0.91, std=0.00, steps=3.503e+07
2023-07-07 14:12:34,463 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5800, best=0.92, avg=0.91, std=0.00, steps=3.564e+07
2023-07-07 14:12:40,224 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5900, best=0.92, avg=0.91, std=0.00, steps=3.626e+07
2023-07-07 14:12:45,971 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6000, best=0.92, avg=0.91, std=0.00, steps=3.687e+07
2023-07-07 14:12:51,721 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6100, best=0.92, avg=0.91, std=0.00, steps=3.748e+07
2023-07-07 14:12:57,462 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6200, best=0.92, avg=0.91, std=0.00, steps=3.810e+07
2023-07-07 14:13:03,213 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6300, best=0.92, avg=0.91, std=0.00, steps=3.871e+07
2023-07-07 14:13:08,962 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6400, best=0.92, avg=0.91, std=0.00, steps=3.933e+07
2023-07-07 14:13:14,724 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6500, best=0.92, avg=0.91, std=0.00, steps=3.994e+07
2023-07-07 14:13:20,503 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6600, best=0.92, avg=0.91, std=0.00, steps=4.056e+07
2023-07-07 14:13:26,246 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6700, best=0.92, avg=0.91, std=0.00, steps=4.117e+07
2023-07-07 14:13:31,988 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6800, best=0.92, avg=0.91, std=0.00, steps=4.179e+07
2023-07-07 14:13:37,754 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6900, best=0.92, avg=0.91, std=0.00, steps=4.240e+07
2023-07-07 14:13:43,504 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7000, best=0.92, avg=0.91, std=0.00, steps=4.301e+07
2023-07-07 14:13:49,273 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7100, best=0.92, avg=0.91, std=0.00, steps=4.363e+07
2023-07-07 14:13:55,017 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7200, best=0.92, avg=0.91, std=0.00, steps=4.424e+07
2023-07-07 14:14:00,757 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7300, best=0.92, avg=0.91, std=0.00, steps=4.486e+07
2023-07-07 14:14:06,505 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7400, best=0.92, avg=0.91, std=0.00, steps=4.547e+07
2023-07-07 14:14:12,256 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7500, best=0.92, avg=0.91, std=0.00, steps=4.609e+07
2023-07-07 14:14:17,987 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7600, best=0.92, avg=0.91, std=0.00, steps=4.670e+07
2023-07-07 14:14:23,711 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7700, best=0.92, avg=0.91, std=0.00, steps=4.731e+07
2023-07-07 14:14:29,421 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7800, best=0.92, avg=0.91, std=0.00, steps=4.793e+07
2023-07-07 14:14:35,159 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7900, best=0.92, avg=0.91, std=0.00, steps=4.854e+07
2023-07-07 14:14:40,884 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8000, best=0.92, avg=0.91, std=0.00, steps=4.916e+07
2023-07-07 14:14:46,628 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8100, best=0.92, avg=0.91, std=0.00, steps=4.977e+07
2023-07-07 14:14:52,391 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8200, best=0.92, avg=0.91, std=0.00, steps=5.039e+07
2023-07-07 14:14:58,138 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8300, best=0.92, avg=0.91, std=0.00, steps=5.100e+07
2023-07-07 14:15:03,874 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8400, best=0.92, avg=0.91, std=0.00, steps=5.162e+07
2023-07-07 14:15:09,624 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8500, best=0.92, avg=0.91, std=0.00, steps=5.223e+07
2023-07-07 14:15:15,368 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8600, best=0.92, avg=0.91, std=0.00, steps=5.284e+07
2023-07-07 14:15:21,112 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8700, best=0.92, avg=0.91, std=0.00, steps=5.346e+07
2023-07-07 14:15:26,839 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8800, best=0.92, avg=0.91, std=0.00, steps=5.407e+07
2023-07-07 14:15:32,583 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8900, best=0.92, avg=0.91, std=0.00, steps=5.469e+07
2023-07-07 14:15:38,307 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9000, best=0.92, avg=0.91, std=0.00, steps=5.530e+07
2023-07-07 14:15:44,054 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9100, best=0.92, avg=0.91, std=0.00, steps=5.592e+07
2023-07-07 14:15:49,785 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9200, best=0.92, avg=0.91, std=0.00, steps=5.653e+07
2023-07-07 14:15:55,512 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9300, best=0.92, avg=0.91, std=0.00, steps=5.715e+07
2023-07-07 14:16:01,260 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9400, best=0.92, avg=0.91, std=0.00, steps=5.776e+07
2023-07-07 14:16:07,025 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9500, best=0.92, avg=0.91, std=0.00, steps=5.837e+07
2023-07-07 14:16:12,799 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9600, best=0.92, avg=0.91, std=0.00, steps=5.899e+07
2023-07-07 14:16:18,562 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9700, best=0.92, avg=0.91, std=0.00, steps=5.960e+07
2023-07-07 14:16:24,318 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9800, best=0.92, avg=0.91, std=0.00, steps=6.022e+07
2023-07-07 14:16:30,056 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9900, best=0.92, avg=0.91, std=0.00, steps=6.083e+07
2023-07-07 14:16:35,806 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10000, best=0.92, avg=0.91, std=0.00, steps=6.145e+07
2023-07-07 14:16:41,550 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10100, best=0.93, avg=0.91, std=0.00, steps=6.206e+07
2023-07-07 14:16:47,303 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10200, best=0.92, avg=0.91, std=0.00, steps=6.267e+07
2023-07-07 14:16:53,048 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10300, best=0.92, avg=0.91, std=0.00, steps=6.329e+07
2023-07-07 14:16:58,796 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10400, best=0.92, avg=0.91, std=0.00, steps=6.390e+07
2023-07-07 14:17:04,550 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10500, best=0.92, avg=0.91, std=0.00, steps=6.452e+07
2023-07-07 14:17:10,297 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10600, best=0.92, avg=0.91, std=0.00, steps=6.513e+07
2023-07-07 14:17:16,028 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10700, best=0.92, avg=0.91, std=0.00, steps=6.575e+07
2023-07-07 14:17:21,762 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10800, best=0.92, avg=0.91, std=0.00, steps=6.636e+07
2023-07-07 14:17:27,507 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10900, best=0.92, avg=0.91, std=0.00, steps=6.698e+07
2023-07-07 14:17:33,250 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11000, best=0.92, avg=0.91, std=0.00, steps=6.759e+07
2023-07-07 14:17:38,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11100, best=0.92, avg=0.91, std=0.00, steps=6.820e+07
2023-07-07 14:17:44,756 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11200, best=0.92, avg=0.91, std=0.00, steps=6.882e+07
2023-07-07 14:17:50,492 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11300, best=0.92, avg=0.91, std=0.00, steps=6.943e+07
2023-07-07 14:17:56,250 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11400, best=0.92, avg=0.91, std=0.00, steps=7.005e+07
2023-07-07 14:18:01,981 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11500, best=0.92, avg=0.91, std=0.00, steps=7.066e+07
2023-07-07 14:18:07,712 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11600, best=0.92, avg=0.91, std=0.00, steps=7.128e+07
2023-07-07 14:18:13,448 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11700, best=0.92, avg=0.91, std=0.00, steps=7.189e+07
2023-07-07 14:18:19,199 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11800, best=0.92, avg=0.91, std=0.00, steps=7.251e+07
2023-07-07 14:18:24,942 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11900, best=0.92, avg=0.91, std=0.00, steps=7.312e+07
2023-07-07 14:18:30,634 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11999, best=0.92, avg=0.91, std=0.00, steps=7.373e+07
2023-07-07 14:18:30,635 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 14:18:30,661 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:18:30,694 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:18:40,426 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 100, best=0.65, avg=0.64, std=0.01, steps=8.274e+05
2023-07-07 14:18:47,979 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 200, best=0.69, avg=0.67, std=0.01, steps=1.647e+06
2023-07-07 14:18:55,532 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 300, best=0.71, avg=0.70, std=0.01, steps=2.466e+06
2023-07-07 14:19:03,063 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 400, best=0.72, avg=0.71, std=0.01, steps=3.285e+06
2023-07-07 14:19:10,601 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 500, best=0.73, avg=0.72, std=0.01, steps=4.104e+06
2023-07-07 14:19:18,140 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 600, best=0.74, avg=0.73, std=0.01, steps=4.923e+06
2023-07-07 14:19:25,689 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 700, best=0.75, avg=0.74, std=0.01, steps=5.743e+06
2023-07-07 14:19:33,237 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 800, best=0.77, avg=0.75, std=0.01, steps=6.562e+06
2023-07-07 14:19:40,794 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 900, best=0.77, avg=0.76, std=0.01, steps=7.381e+06
2023-07-07 14:19:48,338 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1000, best=0.78, avg=0.77, std=0.01, steps=8.200e+06
2023-07-07 14:19:55,883 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1100, best=0.78, avg=0.77, std=0.01, steps=9.019e+06
2023-07-07 14:20:03,437 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1200, best=0.79, avg=0.77, std=0.01, steps=9.839e+06
2023-07-07 14:20:10,973 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1300, best=0.80, avg=0.79, std=0.01, steps=1.066e+07
2023-07-07 14:20:18,501 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1400, best=0.81, avg=0.79, std=0.01, steps=1.148e+07
2023-07-07 14:20:26,055 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1500, best=0.81, avg=0.80, std=0.00, steps=1.230e+07
2023-07-07 14:20:33,590 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1600, best=0.82, avg=0.81, std=0.01, steps=1.312e+07
2023-07-07 14:20:41,162 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1700, best=0.83, avg=0.81, std=0.00, steps=1.393e+07
2023-07-07 14:20:48,731 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1800, best=0.83, avg=0.82, std=0.00, steps=1.475e+07
2023-07-07 14:20:56,287 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1900, best=0.83, avg=0.82, std=0.01, steps=1.557e+07
2023-07-07 14:21:03,843 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2000, best=0.84, avg=0.83, std=0.01, steps=1.639e+07
2023-07-07 14:21:11,450 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2100, best=0.84, avg=0.83, std=0.00, steps=1.721e+07
2023-07-07 14:21:19,004 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2200, best=0.84, avg=0.83, std=0.00, steps=1.803e+07
2023-07-07 14:21:26,544 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2300, best=0.85, avg=0.83, std=0.01, steps=1.885e+07
2023-07-07 14:21:34,104 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2400, best=0.85, avg=0.84, std=0.00, steps=1.967e+07
2023-07-07 14:21:41,660 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2500, best=0.85, avg=0.84, std=0.00, steps=2.049e+07
2023-07-07 14:21:49,229 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2600, best=0.85, avg=0.84, std=0.00, steps=2.131e+07
2023-07-07 14:21:56,800 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2700, best=0.85, avg=0.84, std=0.00, steps=2.213e+07
2023-07-07 14:22:04,360 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2800, best=0.86, avg=0.84, std=0.00, steps=2.295e+07
2023-07-07 14:22:11,928 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2900, best=0.86, avg=0.84, std=0.00, steps=2.376e+07
2023-07-07 14:22:19,499 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3000, best=0.86, avg=0.84, std=0.00, steps=2.458e+07
2023-07-07 14:22:27,056 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3100, best=0.86, avg=0.84, std=0.00, steps=2.540e+07
2023-07-07 14:22:34,608 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3200, best=0.86, avg=0.85, std=0.00, steps=2.622e+07
2023-07-07 14:22:42,160 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3300, best=0.86, avg=0.85, std=0.00, steps=2.704e+07
2023-07-07 14:22:49,707 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3400, best=0.86, avg=0.85, std=0.00, steps=2.786e+07
2023-07-07 14:22:57,255 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3500, best=0.86, avg=0.85, std=0.00, steps=2.868e+07
2023-07-07 14:23:04,805 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3600, best=0.86, avg=0.85, std=0.00, steps=2.950e+07
2023-07-07 14:23:12,341 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3700, best=0.86, avg=0.85, std=0.00, steps=3.032e+07
2023-07-07 14:23:19,903 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3800, best=0.86, avg=0.85, std=0.00, steps=3.114e+07
2023-07-07 14:23:27,433 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3900, best=0.86, avg=0.85, std=0.00, steps=3.196e+07
2023-07-07 14:23:34,964 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4000, best=0.86, avg=0.85, std=0.00, steps=3.278e+07
2023-07-07 14:23:42,506 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4100, best=0.87, avg=0.85, std=0.00, steps=3.360e+07
2023-07-07 14:23:50,038 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4200, best=0.87, avg=0.85, std=0.00, steps=3.441e+07
2023-07-07 14:23:57,561 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4300, best=0.87, avg=0.86, std=0.00, steps=3.523e+07
2023-07-07 14:24:05,098 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4400, best=0.87, avg=0.85, std=0.00, steps=3.605e+07
2023-07-07 14:24:12,668 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4500, best=0.87, avg=0.86, std=0.00, steps=3.687e+07
2023-07-07 14:24:20,242 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4600, best=0.86, avg=0.86, std=0.00, steps=3.769e+07
2023-07-07 14:24:27,780 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4700, best=0.87, avg=0.86, std=0.00, steps=3.851e+07
2023-07-07 14:24:35,344 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4800, best=0.87, avg=0.86, std=0.00, steps=3.933e+07
2023-07-07 14:24:42,913 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4900, best=0.87, avg=0.86, std=0.00, steps=4.015e+07
2023-07-07 14:24:50,477 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5000, best=0.86, avg=0.86, std=0.00, steps=4.097e+07
2023-07-07 14:24:58,074 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5100, best=0.87, avg=0.86, std=0.00, steps=4.179e+07
2023-07-07 14:25:05,639 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5200, best=0.87, avg=0.86, std=0.00, steps=4.261e+07
2023-07-07 14:25:13,200 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5300, best=0.87, avg=0.86, std=0.00, steps=4.343e+07
2023-07-07 14:25:20,771 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5400, best=0.87, avg=0.86, std=0.00, steps=4.424e+07
2023-07-07 14:25:28,334 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5500, best=0.86, avg=0.86, std=0.00, steps=4.506e+07
2023-07-07 14:25:35,886 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5600, best=0.87, avg=0.86, std=0.00, steps=4.588e+07
2023-07-07 14:25:43,416 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5700, best=0.87, avg=0.86, std=0.00, steps=4.670e+07
2023-07-07 14:25:50,957 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5800, best=0.87, avg=0.86, std=0.00, steps=4.752e+07
2023-07-07 14:25:58,520 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5900, best=0.87, avg=0.86, std=0.00, steps=4.834e+07
2023-07-07 14:26:06,088 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6000, best=0.87, avg=0.86, std=0.00, steps=4.916e+07
2023-07-07 14:26:13,623 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6100, best=0.87, avg=0.86, std=0.00, steps=4.998e+07
2023-07-07 14:26:21,176 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6200, best=0.88, avg=0.86, std=0.00, steps=5.080e+07
2023-07-07 14:26:28,720 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6300, best=0.87, avg=0.86, std=0.00, steps=5.162e+07
2023-07-07 14:26:36,281 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6400, best=0.88, avg=0.87, std=0.00, steps=5.244e+07
2023-07-07 14:26:43,830 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6500, best=0.88, avg=0.87, std=0.00, steps=5.326e+07
2023-07-07 14:26:51,377 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6600, best=0.88, avg=0.87, std=0.00, steps=5.408e+07
2023-07-07 14:26:58,929 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6700, best=0.88, avg=0.87, std=0.00, steps=5.489e+07
2023-07-07 14:27:06,479 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6800, best=0.88, avg=0.87, std=0.00, steps=5.571e+07
2023-07-07 14:27:14,022 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6900, best=0.88, avg=0.87, std=0.00, steps=5.653e+07
2023-07-07 14:27:21,552 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7000, best=0.89, avg=0.87, std=0.00, steps=5.735e+07
2023-07-07 14:27:29,094 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7100, best=0.88, avg=0.87, std=0.00, steps=5.817e+07
2023-07-07 14:27:36,639 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7200, best=0.88, avg=0.87, std=0.00, steps=5.899e+07
2023-07-07 14:27:44,183 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7300, best=0.88, avg=0.87, std=0.00, steps=5.981e+07
2023-07-07 14:27:51,732 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7400, best=0.89, avg=0.87, std=0.01, steps=6.063e+07
2023-07-07 14:27:59,287 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7500, best=0.89, avg=0.87, std=0.00, steps=6.145e+07
2023-07-07 14:28:06,834 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7600, best=0.89, avg=0.88, std=0.00, steps=6.227e+07
2023-07-07 14:28:14,379 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7700, best=0.88, avg=0.88, std=0.00, steps=6.309e+07
2023-07-07 14:28:21,947 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7800, best=0.89, avg=0.88, std=0.00, steps=6.391e+07
2023-07-07 14:28:29,496 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7900, best=0.88, avg=0.88, std=0.00, steps=6.472e+07
2023-07-07 14:28:37,046 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8000, best=0.89, avg=0.88, std=0.00, steps=6.554e+07
2023-07-07 14:28:44,598 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8100, best=0.89, avg=0.88, std=0.00, steps=6.636e+07
2023-07-07 14:28:52,144 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8200, best=0.89, avg=0.88, std=0.00, steps=6.718e+07
2023-07-07 14:28:59,684 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8300, best=0.89, avg=0.88, std=0.00, steps=6.800e+07
2023-07-07 14:29:07,232 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8400, best=0.89, avg=0.88, std=0.00, steps=6.882e+07
2023-07-07 14:29:14,773 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8500, best=0.89, avg=0.88, std=0.00, steps=6.964e+07
2023-07-07 14:29:22,323 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8600, best=0.89, avg=0.88, std=0.00, steps=7.046e+07
2023-07-07 14:29:29,861 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8700, best=0.89, avg=0.88, std=0.00, steps=7.128e+07
2023-07-07 14:29:37,408 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8800, best=0.89, avg=0.88, std=0.00, steps=7.210e+07
2023-07-07 14:29:44,935 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8900, best=0.89, avg=0.88, std=0.00, steps=7.292e+07
2023-07-07 14:29:52,469 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9000, best=0.89, avg=0.88, std=0.00, steps=7.374e+07
2023-07-07 14:30:00,032 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9100, best=0.89, avg=0.88, std=0.00, steps=7.456e+07
2023-07-07 14:30:07,595 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9200, best=0.89, avg=0.88, std=0.00, steps=7.537e+07
2023-07-07 14:30:15,120 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9300, best=0.89, avg=0.88, std=0.00, steps=7.619e+07
2023-07-07 14:30:22,641 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9400, best=0.89, avg=0.88, std=0.00, steps=7.701e+07
2023-07-07 14:30:30,162 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9500, best=0.89, avg=0.88, std=0.00, steps=7.783e+07
2023-07-07 14:30:37,700 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9600, best=0.89, avg=0.88, std=0.00, steps=7.865e+07
2023-07-07 14:30:45,260 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9700, best=0.89, avg=0.88, std=0.00, steps=7.947e+07
2023-07-07 14:30:52,817 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9800, best=0.89, avg=0.88, std=0.00, steps=8.029e+07
2023-07-07 14:31:00,376 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9900, best=0.89, avg=0.88, std=0.00, steps=8.111e+07
2023-07-07 14:31:07,951 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10000, best=0.88, avg=0.88, std=0.00, steps=8.193e+07
2023-07-07 14:31:15,509 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10100, best=0.89, avg=0.88, std=0.00, steps=8.275e+07
2023-07-07 14:31:23,065 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10200, best=0.89, avg=0.88, std=0.00, steps=8.357e+07
2023-07-07 14:31:30,633 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10300, best=0.89, avg=0.88, std=0.00, steps=8.439e+07
2023-07-07 14:31:38,190 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10400, best=0.89, avg=0.88, std=0.00, steps=8.520e+07
2023-07-07 14:31:45,737 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10500, best=0.89, avg=0.88, std=0.00, steps=8.602e+07
2023-07-07 14:31:53,274 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10600, best=0.89, avg=0.88, std=0.00, steps=8.684e+07
2023-07-07 14:32:00,810 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10700, best=0.89, avg=0.88, std=0.00, steps=8.766e+07
2023-07-07 14:32:08,366 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10800, best=0.89, avg=0.88, std=0.00, steps=8.848e+07
2023-07-07 14:32:15,917 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10900, best=0.89, avg=0.88, std=0.00, steps=8.930e+07
2023-07-07 14:32:23,478 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11000, best=0.89, avg=0.88, std=0.00, steps=9.012e+07
2023-07-07 14:32:31,035 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11100, best=0.89, avg=0.88, std=0.00, steps=9.094e+07
2023-07-07 14:32:38,588 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11200, best=0.90, avg=0.88, std=0.00, steps=9.176e+07
2023-07-07 14:32:46,128 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11300, best=0.89, avg=0.88, std=0.00, steps=9.258e+07
2023-07-07 14:32:53,672 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11400, best=0.89, avg=0.88, std=0.00, steps=9.340e+07
2023-07-07 14:33:01,216 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11500, best=0.89, avg=0.88, std=0.00, steps=9.422e+07
2023-07-07 14:33:08,753 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11600, best=0.89, avg=0.88, std=0.00, steps=9.504e+07
2023-07-07 14:33:16,292 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11700, best=0.89, avg=0.88, std=0.00, steps=9.585e+07
2023-07-07 14:33:23,810 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11800, best=0.90, avg=0.88, std=0.00, steps=9.667e+07
2023-07-07 14:33:31,345 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11900, best=0.89, avg=0.88, std=0.00, steps=9.749e+07
2023-07-07 14:33:38,796 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11999, best=0.90, avg=0.88, std=0.00, steps=9.830e+07
2023-07-07 14:33:38,797 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 14:33:38,823 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:33:38,863 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:33:52,234 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 100, best=0.52, avg=0.50, std=0.01, steps=1.241e+06
2023-07-07 14:34:03,396 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 200, best=0.65, avg=0.64, std=0.01, steps=2.470e+06
2023-07-07 14:34:14,589 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 300, best=0.67, avg=0.65, std=0.00, steps=3.699e+06
2023-07-07 14:34:25,748 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 400, best=0.69, avg=0.67, std=0.01, steps=4.927e+06
2023-07-07 14:34:36,898 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 500, best=0.70, avg=0.69, std=0.01, steps=6.156e+06
2023-07-07 14:34:48,053 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 600, best=0.72, avg=0.71, std=0.01, steps=7.385e+06
2023-07-07 14:34:59,220 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 700, best=0.72, avg=0.71, std=0.01, steps=8.614e+06
2023-07-07 14:35:10,417 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 800, best=0.74, avg=0.72, std=0.01, steps=9.843e+06
2023-07-07 14:35:21,577 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 900, best=0.74, avg=0.72, std=0.01, steps=1.107e+07
2023-07-07 14:35:32,722 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1000, best=0.75, avg=0.73, std=0.01, steps=1.230e+07
2023-07-07 14:35:43,882 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1100, best=0.76, avg=0.74, std=0.01, steps=1.353e+07
2023-07-07 14:35:55,045 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1200, best=0.76, avg=0.74, std=0.01, steps=1.476e+07
2023-07-07 14:36:06,179 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1300, best=0.76, avg=0.75, std=0.01, steps=1.599e+07
2023-07-07 14:36:17,304 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1400, best=0.77, avg=0.75, std=0.01, steps=1.722e+07
2023-07-07 14:36:28,446 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1500, best=0.76, avg=0.75, std=0.01, steps=1.844e+07
2023-07-07 14:36:39,593 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1600, best=0.77, avg=0.75, std=0.01, steps=1.967e+07
2023-07-07 14:36:50,759 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1700, best=0.77, avg=0.76, std=0.01, steps=2.090e+07
2023-07-07 14:37:01,907 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1800, best=0.77, avg=0.75, std=0.01, steps=2.213e+07
2023-07-07 14:37:13,087 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1900, best=0.77, avg=0.75, std=0.01, steps=2.336e+07
2023-07-07 14:37:24,249 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2000, best=0.78, avg=0.76, std=0.01, steps=2.459e+07
2023-07-07 14:37:35,419 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2100, best=0.78, avg=0.76, std=0.01, steps=2.582e+07
2023-07-07 14:37:46,592 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2200, best=0.78, avg=0.76, std=0.01, steps=2.705e+07
2023-07-07 14:37:57,780 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2300, best=0.78, avg=0.76, std=0.01, steps=2.827e+07
2023-07-07 14:38:08,943 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2400, best=0.78, avg=0.77, std=0.00, steps=2.950e+07
2023-07-07 14:38:20,103 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2500, best=0.78, avg=0.77, std=0.01, steps=3.073e+07
2023-07-07 14:38:31,258 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2600, best=0.78, avg=0.77, std=0.01, steps=3.196e+07
2023-07-07 14:38:42,416 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2700, best=0.79, avg=0.77, std=0.01, steps=3.319e+07
2023-07-07 14:38:53,589 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2800, best=0.79, avg=0.77, std=0.00, steps=3.442e+07
2023-07-07 14:39:04,763 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2900, best=0.79, avg=0.78, std=0.01, steps=3.565e+07
2023-07-07 14:39:15,953 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3000, best=0.79, avg=0.78, std=0.01, steps=3.688e+07
2023-07-07 14:39:27,118 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3100, best=0.79, avg=0.77, std=0.01, steps=3.811e+07
2023-07-07 14:39:38,269 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3200, best=0.79, avg=0.78, std=0.00, steps=3.933e+07
2023-07-07 14:39:49,406 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3300, best=0.79, avg=0.78, std=0.01, steps=4.056e+07
2023-07-07 14:40:00,550 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3400, best=0.79, avg=0.78, std=0.00, steps=4.179e+07
2023-07-07 14:40:11,686 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3500, best=0.79, avg=0.78, std=0.00, steps=4.302e+07
2023-07-07 14:40:22,820 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3600, best=0.80, avg=0.78, std=0.01, steps=4.425e+07
2023-07-07 14:40:33,950 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3700, best=0.79, avg=0.78, std=0.00, steps=4.548e+07
2023-07-07 14:40:45,119 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3800, best=0.80, avg=0.78, std=0.00, steps=4.671e+07
2023-07-07 14:40:56,290 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3900, best=0.79, avg=0.78, std=0.00, steps=4.794e+07
2023-07-07 14:41:07,433 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4000, best=0.79, avg=0.78, std=0.00, steps=4.916e+07
2023-07-07 14:41:18,585 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4100, best=0.80, avg=0.78, std=0.00, steps=5.039e+07
2023-07-07 14:41:29,732 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4200, best=0.79, avg=0.78, std=0.01, steps=5.162e+07
2023-07-07 14:41:40,866 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4300, best=0.80, avg=0.78, std=0.00, steps=5.285e+07
2023-07-07 14:41:52,006 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4400, best=0.79, avg=0.79, std=0.00, steps=5.408e+07
2023-07-07 14:42:03,144 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4500, best=0.80, avg=0.78, std=0.00, steps=5.531e+07
2023-07-07 14:42:14,301 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4600, best=0.80, avg=0.79, std=0.00, steps=5.654e+07
2023-07-07 14:42:25,449 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4700, best=0.80, avg=0.78, std=0.00, steps=5.777e+07
2023-07-07 14:42:36,632 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4800, best=0.80, avg=0.79, std=0.00, steps=5.899e+07
2023-07-07 14:42:47,814 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4900, best=0.80, avg=0.79, std=0.00, steps=6.022e+07
2023-07-07 14:42:58,982 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5000, best=0.80, avg=0.79, std=0.00, steps=6.145e+07
2023-07-07 14:43:10,157 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5100, best=0.80, avg=0.79, std=0.00, steps=6.268e+07
2023-07-07 14:43:21,315 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5200, best=0.80, avg=0.79, std=0.00, steps=6.391e+07
2023-07-07 14:43:32,471 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5300, best=0.80, avg=0.79, std=0.00, steps=6.514e+07
2023-07-07 14:43:43,683 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5400, best=0.80, avg=0.79, std=0.00, steps=6.637e+07
2023-07-07 14:43:54,837 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5500, best=0.80, avg=0.79, std=0.00, steps=6.760e+07
2023-07-07 14:44:06,006 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5600, best=0.80, avg=0.79, std=0.00, steps=6.883e+07
2023-07-07 14:44:17,172 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5700, best=0.80, avg=0.79, std=0.00, steps=7.005e+07
2023-07-07 14:44:28,339 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5800, best=0.80, avg=0.79, std=0.00, steps=7.128e+07
2023-07-07 14:44:39,511 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5900, best=0.80, avg=0.79, std=0.00, steps=7.251e+07
2023-07-07 14:44:50,691 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6000, best=0.80, avg=0.79, std=0.00, steps=7.374e+07
2023-07-07 14:45:01,844 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6100, best=0.80, avg=0.79, std=0.00, steps=7.497e+07
2023-07-07 14:45:13,028 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6200, best=0.80, avg=0.79, std=0.00, steps=7.620e+07
2023-07-07 14:45:24,199 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6300, best=0.80, avg=0.79, std=0.00, steps=7.743e+07
2023-07-07 14:45:35,362 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6400, best=0.80, avg=0.79, std=0.00, steps=7.866e+07
2023-07-07 14:45:46,547 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6500, best=0.80, avg=0.79, std=0.00, steps=7.988e+07
2023-07-07 14:45:57,692 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6600, best=0.80, avg=0.79, std=0.00, steps=8.111e+07
2023-07-07 14:46:08,885 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6700, best=0.80, avg=0.79, std=0.00, steps=8.234e+07
2023-07-07 14:46:20,044 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6800, best=0.80, avg=0.79, std=0.00, steps=8.357e+07
2023-07-07 14:46:31,206 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6900, best=0.80, avg=0.79, std=0.00, steps=8.480e+07
2023-07-07 14:46:42,369 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7000, best=0.80, avg=0.79, std=0.00, steps=8.603e+07
2023-07-07 14:46:53,538 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7100, best=0.80, avg=0.79, std=0.00, steps=8.726e+07
2023-07-07 14:47:04,720 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7200, best=0.80, avg=0.79, std=0.00, steps=8.849e+07
2023-07-07 14:47:15,892 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7300, best=0.80, avg=0.79, std=0.00, steps=8.971e+07
2023-07-07 14:47:27,052 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7400, best=0.80, avg=0.79, std=0.00, steps=9.094e+07
2023-07-07 14:47:38,199 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7500, best=0.80, avg=0.79, std=0.00, steps=9.217e+07
2023-07-07 14:47:49,363 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7600, best=0.80, avg=0.79, std=0.00, steps=9.340e+07
2023-07-07 14:48:00,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7700, best=0.80, avg=0.79, std=0.00, steps=9.463e+07
2023-07-07 14:48:11,675 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7800, best=0.80, avg=0.79, std=0.00, steps=9.586e+07
2023-07-07 14:48:22,828 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7900, best=0.80, avg=0.79, std=0.00, steps=9.709e+07
2023-07-07 14:48:33,983 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8000, best=0.80, avg=0.79, std=0.00, steps=9.832e+07
2023-07-07 14:48:45,138 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8100, best=0.80, avg=0.79, std=0.00, steps=9.955e+07
2023-07-07 14:48:56,288 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8200, best=0.80, avg=0.79, std=0.00, steps=1.008e+08
2023-07-07 14:49:07,439 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8300, best=0.80, avg=0.79, std=0.00, steps=1.020e+08
2023-07-07 14:49:18,595 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8400, best=0.80, avg=0.79, std=0.00, steps=1.032e+08
2023-07-07 14:49:29,743 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8500, best=0.80, avg=0.79, std=0.00, steps=1.045e+08
2023-07-07 14:49:40,904 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8600, best=0.80, avg=0.79, std=0.00, steps=1.057e+08
2023-07-07 14:49:52,045 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8700, best=0.80, avg=0.79, std=0.00, steps=1.069e+08
2023-07-07 14:50:03,213 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8800, best=0.80, avg=0.79, std=0.00, steps=1.081e+08
2023-07-07 14:50:14,358 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8900, best=0.80, avg=0.79, std=0.00, steps=1.094e+08
2023-07-07 14:50:25,531 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9000, best=0.80, avg=0.79, std=0.00, steps=1.106e+08
2023-07-07 14:50:36,716 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9100, best=0.80, avg=0.79, std=0.00, steps=1.118e+08
2023-07-07 14:50:47,899 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9200, best=0.80, avg=0.79, std=0.00, steps=1.131e+08
2023-07-07 14:50:59,090 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9300, best=0.80, avg=0.79, std=0.00, steps=1.143e+08
2023-07-07 14:51:10,253 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9400, best=0.80, avg=0.79, std=0.00, steps=1.155e+08
2023-07-07 14:51:21,406 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9500, best=0.80, avg=0.79, std=0.00, steps=1.167e+08
2023-07-07 14:51:32,578 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9600, best=0.80, avg=0.79, std=0.00, steps=1.180e+08
2023-07-07 14:51:43,753 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9700, best=0.80, avg=0.79, std=0.00, steps=1.192e+08
2023-07-07 14:51:54,917 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9800, best=0.80, avg=0.79, std=0.00, steps=1.204e+08
2023-07-07 14:52:06,072 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9900, best=0.80, avg=0.79, std=0.00, steps=1.217e+08
2023-07-07 14:52:17,242 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10000, best=0.80, avg=0.79, std=0.00, steps=1.229e+08
2023-07-07 14:52:28,444 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10100, best=0.80, avg=0.79, std=0.00, steps=1.241e+08
2023-07-07 14:52:39,611 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10200, best=0.80, avg=0.79, std=0.00, steps=1.253e+08
2023-07-07 14:52:50,787 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10300, best=0.80, avg=0.79, std=0.00, steps=1.266e+08
2023-07-07 14:53:01,945 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10400, best=0.80, avg=0.79, std=0.00, steps=1.278e+08
2023-07-07 14:53:13,116 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10500, best=0.80, avg=0.79, std=0.00, steps=1.290e+08
2023-07-07 14:53:24,299 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10600, best=0.80, avg=0.79, std=0.00, steps=1.303e+08
2023-07-07 14:53:35,485 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10700, best=0.80, avg=0.79, std=0.00, steps=1.315e+08
2023-07-07 14:53:46,655 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10800, best=0.80, avg=0.79, std=0.00, steps=1.327e+08
2023-07-07 14:53:57,805 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10900, best=0.80, avg=0.79, std=0.00, steps=1.340e+08
2023-07-07 14:54:08,997 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11000, best=0.80, avg=0.79, std=0.00, steps=1.352e+08
2023-07-07 14:54:20,171 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11100, best=0.80, avg=0.79, std=0.00, steps=1.364e+08
2023-07-07 14:54:31,316 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11200, best=0.80, avg=0.79, std=0.00, steps=1.376e+08
2023-07-07 14:54:42,457 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11300, best=0.80, avg=0.79, std=0.00, steps=1.389e+08
2023-07-07 14:54:53,639 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11400, best=0.80, avg=0.79, std=0.00, steps=1.401e+08
2023-07-07 14:55:04,799 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11500, best=0.80, avg=0.79, std=0.00, steps=1.413e+08
2023-07-07 14:55:15,966 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11600, best=0.80, avg=0.79, std=0.00, steps=1.426e+08
2023-07-07 14:55:27,121 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11700, best=0.80, avg=0.79, std=0.00, steps=1.438e+08
2023-07-07 14:55:38,270 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11800, best=0.80, avg=0.79, std=0.00, steps=1.450e+08
2023-07-07 14:55:49,426 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11900, best=0.80, avg=0.79, std=0.00, steps=1.462e+08
2023-07-07 14:56:00,478 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11999, best=0.80, avg=0.79, std=0.00, steps=1.475e+08
2023-07-07 14:56:00,479 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 14:56:00,505 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:56:00,539 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:56:10,295 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 100, best=0.65, avg=0.64, std=0.00, steps=8.274e+05
2023-07-07 14:56:17,836 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 200, best=0.71, avg=0.70, std=0.00, steps=1.647e+06
2023-07-07 14:56:25,366 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 300, best=0.74, avg=0.73, std=0.00, steps=2.466e+06
2023-07-07 14:56:32,908 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 400, best=0.76, avg=0.75, std=0.00, steps=3.285e+06
2023-07-07 14:56:40,458 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 500, best=0.78, avg=0.77, std=0.00, steps=4.104e+06
2023-07-07 14:56:48,011 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 600, best=0.80, avg=0.78, std=0.00, steps=4.923e+06
2023-07-07 14:56:55,564 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 700, best=0.81, avg=0.80, std=0.00, steps=5.743e+06
2023-07-07 14:57:03,128 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 800, best=0.82, avg=0.81, std=0.00, steps=6.562e+06
2023-07-07 14:57:10,688 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 900, best=0.83, avg=0.82, std=0.00, steps=7.381e+06
2023-07-07 14:57:18,244 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1000, best=0.84, avg=0.83, std=0.00, steps=8.200e+06
2023-07-07 14:57:25,792 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1100, best=0.84, avg=0.83, std=0.00, steps=9.019e+06
2023-07-07 14:57:33,321 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1200, best=0.84, avg=0.83, std=0.00, steps=9.839e+06
2023-07-07 14:57:40,861 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1300, best=0.84, avg=0.83, std=0.00, steps=1.066e+07
2023-07-07 14:57:48,414 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1400, best=0.85, avg=0.84, std=0.00, steps=1.148e+07
2023-07-07 14:57:55,960 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1500, best=0.85, avg=0.84, std=0.00, steps=1.230e+07
2023-07-07 14:58:03,513 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1600, best=0.85, avg=0.84, std=0.00, steps=1.312e+07
2023-07-07 14:58:11,057 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1700, best=0.86, avg=0.84, std=0.00, steps=1.393e+07
2023-07-07 14:58:18,601 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1800, best=0.85, avg=0.85, std=0.00, steps=1.475e+07
2023-07-07 14:58:26,119 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1900, best=0.86, avg=0.85, std=0.00, steps=1.557e+07
2023-07-07 14:58:33,637 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2000, best=0.86, avg=0.85, std=0.00, steps=1.639e+07
2023-07-07 14:58:41,185 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2100, best=0.87, avg=0.86, std=0.00, steps=1.721e+07
2023-07-07 14:58:48,712 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2200, best=0.87, avg=0.86, std=0.00, steps=1.803e+07
2023-07-07 14:58:56,238 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2300, best=0.87, avg=0.86, std=0.00, steps=1.885e+07
2023-07-07 14:59:03,773 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2400, best=0.87, avg=0.86, std=0.00, steps=1.967e+07
2023-07-07 14:59:11,311 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2500, best=0.87, avg=0.87, std=0.00, steps=2.049e+07
2023-07-07 14:59:18,847 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2600, best=0.88, avg=0.87, std=0.00, steps=2.131e+07
2023-07-07 14:59:26,376 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2700, best=0.88, avg=0.88, std=0.00, steps=2.213e+07
2023-07-07 14:59:33,919 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2800, best=0.89, avg=0.88, std=0.00, steps=2.295e+07
2023-07-07 14:59:41,443 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2900, best=0.89, avg=0.88, std=0.00, steps=2.376e+07
2023-07-07 14:59:48,975 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3000, best=0.89, avg=0.88, std=0.00, steps=2.458e+07
2023-07-07 14:59:56,538 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3100, best=0.90, avg=0.89, std=0.00, steps=2.540e+07
2023-07-07 15:00:04,081 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3200, best=0.90, avg=0.89, std=0.00, steps=2.622e+07
2023-07-07 15:00:11,604 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3300, best=0.90, avg=0.89, std=0.00, steps=2.704e+07
2023-07-07 15:00:19,141 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3400, best=0.90, avg=0.89, std=0.00, steps=2.786e+07
2023-07-07 15:00:26,691 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3500, best=0.90, avg=0.89, std=0.00, steps=2.868e+07
2023-07-07 15:00:34,232 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3600, best=0.90, avg=0.89, std=0.00, steps=2.950e+07
2023-07-07 15:00:41,768 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3700, best=0.90, avg=0.89, std=0.00, steps=3.032e+07
2023-07-07 15:00:49,322 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3800, best=0.90, avg=0.89, std=0.00, steps=3.114e+07
2023-07-07 15:00:56,862 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3900, best=0.90, avg=0.89, std=0.00, steps=3.196e+07
2023-07-07 15:01:04,425 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4000, best=0.90, avg=0.89, std=0.00, steps=3.278e+07
2023-07-07 15:01:11,960 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4100, best=0.90, avg=0.90, std=0.00, steps=3.360e+07
2023-07-07 15:01:19,505 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4200, best=0.90, avg=0.90, std=0.00, steps=3.441e+07
2023-07-07 15:01:27,039 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4300, best=0.90, avg=0.90, std=0.00, steps=3.523e+07
2023-07-07 15:01:34,560 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4400, best=0.90, avg=0.90, std=0.00, steps=3.605e+07
2023-07-07 15:01:42,086 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4500, best=0.90, avg=0.90, std=0.00, steps=3.687e+07
2023-07-07 15:01:49,628 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4600, best=0.91, avg=0.90, std=0.00, steps=3.769e+07
2023-07-07 15:01:57,161 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4700, best=0.90, avg=0.90, std=0.00, steps=3.851e+07
2023-07-07 15:02:04,712 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4800, best=0.91, avg=0.90, std=0.00, steps=3.933e+07
2023-07-07 15:02:12,273 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4900, best=0.91, avg=0.90, std=0.00, steps=4.015e+07
2023-07-07 15:02:19,828 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5000, best=0.90, avg=0.90, std=0.00, steps=4.097e+07
2023-07-07 15:02:27,380 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5100, best=0.90, avg=0.90, std=0.00, steps=4.179e+07
2023-07-07 15:02:34,926 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5200, best=0.90, avg=0.90, std=0.00, steps=4.261e+07
2023-07-07 15:02:42,495 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5300, best=0.90, avg=0.90, std=0.00, steps=4.343e+07
2023-07-07 15:02:50,032 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5400, best=0.91, avg=0.90, std=0.00, steps=4.424e+07
2023-07-07 15:02:57,558 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5500, best=0.91, avg=0.90, std=0.00, steps=4.506e+07
2023-07-07 15:03:05,109 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5600, best=0.91, avg=0.90, std=0.00, steps=4.588e+07
2023-07-07 15:03:12,635 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5700, best=0.90, avg=0.90, std=0.00, steps=4.670e+07
2023-07-07 15:03:20,172 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5800, best=0.91, avg=0.90, std=0.00, steps=4.752e+07
2023-07-07 15:03:27,705 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5900, best=0.91, avg=0.90, std=0.00, steps=4.834e+07
2023-07-07 15:03:35,262 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6000, best=0.91, avg=0.90, std=0.00, steps=4.916e+07
2023-07-07 15:03:42,802 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6100, best=0.91, avg=0.90, std=0.00, steps=4.998e+07
2023-07-07 15:03:50,335 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6200, best=0.91, avg=0.90, std=0.00, steps=5.080e+07
2023-07-07 15:03:57,889 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6300, best=0.91, avg=0.90, std=0.00, steps=5.162e+07
2023-07-07 15:04:05,437 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6400, best=0.91, avg=0.90, std=0.00, steps=5.244e+07
2023-07-07 15:04:12,984 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6500, best=0.91, avg=0.90, std=0.00, steps=5.326e+07
2023-07-07 15:04:20,546 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6600, best=0.90, avg=0.90, std=0.00, steps=5.408e+07
2023-07-07 15:04:28,086 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6700, best=0.91, avg=0.90, std=0.00, steps=5.489e+07
2023-07-07 15:04:35,635 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6800, best=0.91, avg=0.90, std=0.00, steps=5.571e+07
2023-07-07 15:04:43,174 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6900, best=0.91, avg=0.90, std=0.00, steps=5.653e+07
2023-07-07 15:04:50,709 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7000, best=0.91, avg=0.90, std=0.00, steps=5.735e+07
2023-07-07 15:04:58,271 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7100, best=0.91, avg=0.90, std=0.00, steps=5.817e+07
2023-07-07 15:05:05,841 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7200, best=0.91, avg=0.90, std=0.00, steps=5.899e+07
2023-07-07 15:05:13,381 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7300, best=0.91, avg=0.90, std=0.00, steps=5.981e+07
2023-07-07 15:05:20,940 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7400, best=0.91, avg=0.90, std=0.00, steps=6.063e+07
2023-07-07 15:05:28,510 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7500, best=0.91, avg=0.90, std=0.00, steps=6.145e+07
2023-07-07 15:05:36,076 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7600, best=0.91, avg=0.90, std=0.00, steps=6.227e+07
2023-07-07 15:05:43,597 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7700, best=0.91, avg=0.90, std=0.00, steps=6.309e+07
2023-07-07 15:05:51,129 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7800, best=0.91, avg=0.90, std=0.00, steps=6.391e+07
2023-07-07 15:05:58,685 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7900, best=0.91, avg=0.90, std=0.00, steps=6.472e+07
2023-07-07 15:06:06,225 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8000, best=0.91, avg=0.90, std=0.00, steps=6.554e+07
2023-07-07 15:06:13,766 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8100, best=0.91, avg=0.90, std=0.00, steps=6.636e+07
2023-07-07 15:06:21,328 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8200, best=0.91, avg=0.90, std=0.00, steps=6.718e+07
2023-07-07 15:06:28,854 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8300, best=0.91, avg=0.91, std=0.00, steps=6.800e+07
2023-07-07 15:06:36,415 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8400, best=0.91, avg=0.91, std=0.00, steps=6.882e+07
2023-07-07 15:06:43,990 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8500, best=0.92, avg=0.91, std=0.00, steps=6.964e+07
2023-07-07 15:06:51,537 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8600, best=0.92, avg=0.91, std=0.00, steps=7.046e+07
2023-07-07 15:06:59,066 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8700, best=0.92, avg=0.91, std=0.00, steps=7.128e+07
2023-07-07 15:07:06,620 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8800, best=0.92, avg=0.91, std=0.00, steps=7.210e+07
2023-07-07 15:07:14,167 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8900, best=0.92, avg=0.91, std=0.00, steps=7.292e+07
2023-07-07 15:07:21,700 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9000, best=0.92, avg=0.91, std=0.00, steps=7.374e+07
2023-07-07 15:07:29,253 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9100, best=0.92, avg=0.91, std=0.00, steps=7.456e+07
2023-07-07 15:07:36,804 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9200, best=0.92, avg=0.91, std=0.00, steps=7.537e+07
2023-07-07 15:07:44,340 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9300, best=0.92, avg=0.91, std=0.00, steps=7.619e+07
2023-07-07 15:07:51,869 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9400, best=0.92, avg=0.91, std=0.00, steps=7.701e+07
2023-07-07 15:07:59,402 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9500, best=0.92, avg=0.91, std=0.00, steps=7.783e+07
2023-07-07 15:08:06,937 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9600, best=0.92, avg=0.91, std=0.00, steps=7.865e+07
2023-07-07 15:08:14,475 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9700, best=0.92, avg=0.91, std=0.00, steps=7.947e+07
2023-07-07 15:08:22,016 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9800, best=0.92, avg=0.91, std=0.00, steps=8.029e+07
2023-07-07 15:08:29,568 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9900, best=0.92, avg=0.91, std=0.00, steps=8.111e+07
2023-07-07 15:08:37,134 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10000, best=0.92, avg=0.92, std=0.00, steps=8.193e+07
2023-07-07 15:08:44,685 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10100, best=0.92, avg=0.92, std=0.00, steps=8.275e+07
2023-07-07 15:08:52,218 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10200, best=0.92, avg=0.92, std=0.00, steps=8.357e+07
2023-07-07 15:08:59,753 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10300, best=0.93, avg=0.92, std=0.00, steps=8.439e+07
2023-07-07 15:09:07,290 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10400, best=0.93, avg=0.92, std=0.00, steps=8.520e+07
2023-07-07 15:09:14,824 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10500, best=0.93, avg=0.92, std=0.00, steps=8.602e+07
2023-07-07 15:09:22,359 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10600, best=0.93, avg=0.92, std=0.00, steps=8.684e+07
2023-07-07 15:09:29,908 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10700, best=0.93, avg=0.93, std=0.00, steps=8.766e+07
2023-07-07 15:09:37,439 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10800, best=0.94, avg=0.93, std=0.00, steps=8.848e+07
2023-07-07 15:09:44,978 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10900, best=0.94, avg=0.93, std=0.00, steps=8.930e+07
2023-07-07 15:09:52,499 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11000, best=0.94, avg=0.93, std=0.00, steps=9.012e+07
2023-07-07 15:10:00,030 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11100, best=0.94, avg=0.93, std=0.00, steps=9.094e+07
2023-07-07 15:10:07,565 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11200, best=0.94, avg=0.93, std=0.00, steps=9.176e+07
2023-07-07 15:10:15,094 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11300, best=0.94, avg=0.94, std=0.00, steps=9.258e+07
2023-07-07 15:10:22,640 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11400, best=0.94, avg=0.94, std=0.00, steps=9.340e+07
2023-07-07 15:10:30,166 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11500, best=0.94, avg=0.94, std=0.00, steps=9.422e+07
2023-07-07 15:10:37,703 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11600, best=0.94, avg=0.94, std=0.00, steps=9.504e+07
2023-07-07 15:10:45,247 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11700, best=0.94, avg=0.94, std=0.00, steps=9.585e+07
2023-07-07 15:10:52,811 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11800, best=0.94, avg=0.94, std=0.00, steps=9.667e+07
2023-07-07 15:11:00,349 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11900, best=0.94, avg=0.94, std=0.00, steps=9.749e+07
2023-07-07 15:11:07,853 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11999, best=0.95, avg=0.94, std=0.00, steps=9.830e+07
2023-07-07 15:11:07,854 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 15:11:07,879 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:11:07,912 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:11:19,648 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 100, best=0.60, avg=0.59, std=0.00, steps=1.034e+06
2023-07-07 15:11:29,011 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 200, best=0.66, avg=0.65, std=0.00, steps=2.058e+06
2023-07-07 15:11:38,354 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 300, best=0.69, avg=0.68, std=0.00, steps=3.082e+06
2023-07-07 15:11:47,728 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 400, best=0.71, avg=0.70, std=0.00, steps=4.106e+06
2023-07-07 15:11:57,060 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 500, best=0.73, avg=0.72, std=0.00, steps=5.130e+06
2023-07-07 15:12:06,401 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 600, best=0.75, avg=0.73, std=0.00, steps=6.154e+06
2023-07-07 15:12:15,743 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 700, best=0.75, avg=0.74, std=0.00, steps=7.178e+06
2023-07-07 15:12:25,108 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 800, best=0.77, avg=0.75, std=0.00, steps=8.202e+06
2023-07-07 15:12:34,475 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 900, best=0.77, avg=0.76, std=0.00, steps=9.226e+06
2023-07-07 15:12:43,877 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1000, best=0.78, avg=0.77, std=0.00, steps=1.025e+07
2023-07-07 15:12:53,225 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1100, best=0.78, avg=0.77, std=0.00, steps=1.127e+07
2023-07-07 15:13:02,581 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1200, best=0.79, avg=0.78, std=0.00, steps=1.230e+07
2023-07-07 15:13:11,942 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1300, best=0.80, avg=0.78, std=0.00, steps=1.332e+07
2023-07-07 15:13:21,308 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1400, best=0.80, avg=0.79, std=0.00, steps=1.435e+07
2023-07-07 15:13:30,685 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1500, best=0.80, avg=0.79, std=0.00, steps=1.537e+07
2023-07-07 15:13:40,063 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1600, best=0.81, avg=0.79, std=0.00, steps=1.639e+07
2023-07-07 15:13:49,443 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1700, best=0.81, avg=0.80, std=0.00, steps=1.742e+07
2023-07-07 15:13:58,807 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1800, best=0.81, avg=0.80, std=0.00, steps=1.844e+07
2023-07-07 15:14:08,169 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1900, best=0.81, avg=0.80, std=0.00, steps=1.947e+07
2023-07-07 15:14:17,542 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2000, best=0.82, avg=0.81, std=0.00, steps=2.049e+07
2023-07-07 15:14:26,906 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2100, best=0.82, avg=0.81, std=0.00, steps=2.151e+07
2023-07-07 15:14:36,275 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2200, best=0.82, avg=0.81, std=0.00, steps=2.254e+07
2023-07-07 15:14:45,630 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2300, best=0.82, avg=0.81, std=0.00, steps=2.356e+07
2023-07-07 15:14:54,986 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2400, best=0.82, avg=0.81, std=0.00, steps=2.459e+07
2023-07-07 15:15:04,338 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2500, best=0.83, avg=0.82, std=0.00, steps=2.561e+07
2023-07-07 15:15:13,691 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2600, best=0.83, avg=0.82, std=0.00, steps=2.663e+07
2023-07-07 15:15:23,046 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2700, best=0.83, avg=0.82, std=0.00, steps=2.766e+07
2023-07-07 15:15:32,434 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2800, best=0.83, avg=0.82, std=0.00, steps=2.868e+07
2023-07-07 15:15:41,793 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2900, best=0.83, avg=0.82, std=0.00, steps=2.971e+07
2023-07-07 15:15:51,174 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3000, best=0.83, avg=0.82, std=0.00, steps=3.073e+07
2023-07-07 15:16:00,554 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3100, best=0.84, avg=0.82, std=0.00, steps=3.175e+07
2023-07-07 15:16:09,939 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3200, best=0.83, avg=0.83, std=0.00, steps=3.278e+07
2023-07-07 15:16:19,271 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3300, best=0.84, avg=0.83, std=0.00, steps=3.380e+07
2023-07-07 15:16:28,611 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3400, best=0.84, avg=0.83, std=0.00, steps=3.483e+07
2023-07-07 15:16:37,970 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3500, best=0.84, avg=0.83, std=0.00, steps=3.585e+07
2023-07-07 15:16:47,325 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3600, best=0.84, avg=0.83, std=0.00, steps=3.687e+07
2023-07-07 15:16:56,682 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3700, best=0.84, avg=0.83, std=0.00, steps=3.790e+07
2023-07-07 15:17:06,033 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3800, best=0.84, avg=0.83, std=0.00, steps=3.892e+07
2023-07-07 15:17:15,376 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3900, best=0.84, avg=0.83, std=0.00, steps=3.995e+07
2023-07-07 15:17:24,727 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4000, best=0.84, avg=0.83, std=0.00, steps=4.097e+07
2023-07-07 15:17:34,081 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4100, best=0.84, avg=0.83, std=0.00, steps=4.199e+07
2023-07-07 15:17:43,439 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4200, best=0.84, avg=0.84, std=0.00, steps=4.302e+07
2023-07-07 15:17:52,809 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4300, best=0.84, avg=0.84, std=0.00, steps=4.404e+07
2023-07-07 15:18:02,144 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4400, best=0.85, avg=0.84, std=0.00, steps=4.507e+07
2023-07-07 15:18:11,495 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4500, best=0.84, avg=0.84, std=0.00, steps=4.609e+07
2023-07-07 15:18:20,848 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4600, best=0.84, avg=0.84, std=0.00, steps=4.711e+07
2023-07-07 15:18:30,209 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4700, best=0.85, avg=0.84, std=0.00, steps=4.814e+07
2023-07-07 15:18:39,566 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4800, best=0.85, avg=0.84, std=0.00, steps=4.916e+07
2023-07-07 15:18:48,916 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4900, best=0.85, avg=0.84, std=0.00, steps=5.019e+07
2023-07-07 15:18:58,282 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5000, best=0.85, avg=0.84, std=0.00, steps=5.121e+07
2023-07-07 15:19:07,655 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5100, best=0.85, avg=0.84, std=0.00, steps=5.223e+07
2023-07-07 15:19:17,012 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5200, best=0.85, avg=0.84, std=0.00, steps=5.326e+07
2023-07-07 15:19:26,374 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5300, best=0.85, avg=0.84, std=0.00, steps=5.428e+07
2023-07-07 15:19:35,723 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5400, best=0.85, avg=0.84, std=0.00, steps=5.531e+07
2023-07-07 15:19:45,083 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5500, best=0.85, avg=0.84, std=0.00, steps=5.633e+07
2023-07-07 15:19:54,414 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5600, best=0.85, avg=0.84, std=0.00, steps=5.735e+07
2023-07-07 15:20:03,800 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5700, best=0.86, avg=0.84, std=0.00, steps=5.838e+07
2023-07-07 15:20:13,186 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5800, best=0.85, avg=0.85, std=0.00, steps=5.940e+07
2023-07-07 15:20:22,570 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5900, best=0.86, avg=0.85, std=0.00, steps=6.043e+07
2023-07-07 15:20:31,911 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6000, best=0.86, avg=0.85, std=0.00, steps=6.145e+07
2023-07-07 15:20:41,267 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6100, best=0.85, avg=0.85, std=0.00, steps=6.247e+07
2023-07-07 15:20:50,607 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6200, best=0.86, avg=0.85, std=0.00, steps=6.350e+07
2023-07-07 15:20:59,977 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6300, best=0.86, avg=0.85, std=0.00, steps=6.452e+07
2023-07-07 15:21:09,375 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6400, best=0.86, avg=0.85, std=0.00, steps=6.555e+07
2023-07-07 15:21:18,724 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6500, best=0.86, avg=0.85, std=0.00, steps=6.657e+07
2023-07-07 15:21:28,072 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6600, best=0.86, avg=0.85, std=0.00, steps=6.759e+07
2023-07-07 15:21:37,424 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6700, best=0.86, avg=0.85, std=0.00, steps=6.862e+07
2023-07-07 15:21:46,789 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6800, best=0.86, avg=0.85, std=0.00, steps=6.964e+07
2023-07-07 15:21:56,154 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6900, best=0.86, avg=0.85, std=0.00, steps=7.067e+07
2023-07-07 15:22:05,507 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7000, best=0.86, avg=0.85, std=0.00, steps=7.169e+07
2023-07-07 15:22:14,866 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7100, best=0.86, avg=0.85, std=0.00, steps=7.271e+07
2023-07-07 15:22:24,241 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7200, best=0.86, avg=0.85, std=0.00, steps=7.374e+07
2023-07-07 15:22:33,573 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7300, best=0.86, avg=0.85, std=0.00, steps=7.476e+07
2023-07-07 15:22:42,918 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7400, best=0.86, avg=0.85, std=0.00, steps=7.579e+07
2023-07-07 15:22:52,272 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7500, best=0.86, avg=0.85, std=0.00, steps=7.681e+07
2023-07-07 15:23:01,610 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7600, best=0.86, avg=0.85, std=0.00, steps=7.783e+07
2023-07-07 15:23:10,962 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7700, best=0.86, avg=0.85, std=0.00, steps=7.886e+07
2023-07-07 15:23:20,300 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7800, best=0.86, avg=0.85, std=0.00, steps=7.988e+07
2023-07-07 15:23:29,642 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7900, best=0.86, avg=0.85, std=0.00, steps=8.091e+07
2023-07-07 15:23:38,960 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8000, best=0.86, avg=0.86, std=0.00, steps=8.193e+07
2023-07-07 15:23:48,302 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8100, best=0.86, avg=0.86, std=0.00, steps=8.295e+07
2023-07-07 15:23:57,649 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8200, best=0.86, avg=0.86, std=0.00, steps=8.398e+07
2023-07-07 15:24:07,011 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8300, best=0.86, avg=0.86, std=0.00, steps=8.500e+07
2023-07-07 15:24:16,358 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8400, best=0.87, avg=0.86, std=0.00, steps=8.603e+07
2023-07-07 15:24:25,707 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8500, best=0.86, avg=0.86, std=0.00, steps=8.705e+07
2023-07-07 15:24:35,062 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8600, best=0.86, avg=0.86, std=0.00, steps=8.807e+07
2023-07-07 15:24:44,425 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8700, best=0.87, avg=0.86, std=0.00, steps=8.910e+07
2023-07-07 15:24:53,791 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8800, best=0.87, avg=0.86, std=0.00, steps=9.012e+07
2023-07-07 15:25:03,137 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8900, best=0.87, avg=0.86, std=0.00, steps=9.115e+07
2023-07-07 15:25:12,499 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9000, best=0.87, avg=0.86, std=0.00, steps=9.217e+07
2023-07-07 15:25:21,834 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9100, best=0.87, avg=0.86, std=0.00, steps=9.319e+07
2023-07-07 15:25:31,182 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9200, best=0.87, avg=0.86, std=0.00, steps=9.422e+07
2023-07-07 15:25:40,544 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9300, best=0.87, avg=0.86, std=0.00, steps=9.524e+07
2023-07-07 15:25:49,886 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9400, best=0.87, avg=0.86, std=0.00, steps=9.627e+07
2023-07-07 15:25:59,222 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9500, best=0.87, avg=0.86, std=0.00, steps=9.729e+07
2023-07-07 15:26:08,568 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9600, best=0.87, avg=0.86, std=0.00, steps=9.831e+07
2023-07-07 15:26:17,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9700, best=0.87, avg=0.86, std=0.00, steps=9.934e+07
2023-07-07 15:26:27,283 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9800, best=0.87, avg=0.86, std=0.00, steps=1.004e+08
2023-07-07 15:26:36,633 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9900, best=0.87, avg=0.86, std=0.00, steps=1.014e+08
2023-07-07 15:26:45,993 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10000, best=0.87, avg=0.86, std=0.00, steps=1.024e+08
2023-07-07 15:26:55,336 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10100, best=0.87, avg=0.86, std=0.00, steps=1.034e+08
2023-07-07 15:27:04,694 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10200, best=0.87, avg=0.86, std=0.00, steps=1.045e+08
2023-07-07 15:27:14,057 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10300, best=0.87, avg=0.86, std=0.00, steps=1.055e+08
2023-07-07 15:27:23,417 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10400, best=0.87, avg=0.86, std=0.00, steps=1.065e+08
2023-07-07 15:27:32,791 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10500, best=0.87, avg=0.86, std=0.00, steps=1.075e+08
2023-07-07 15:27:42,167 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10600, best=0.87, avg=0.86, std=0.00, steps=1.086e+08
2023-07-07 15:27:51,525 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10700, best=0.87, avg=0.86, std=0.00, steps=1.096e+08
2023-07-07 15:28:00,891 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10800, best=0.87, avg=0.86, std=0.00, steps=1.106e+08
2023-07-07 15:28:10,248 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10900, best=0.87, avg=0.86, std=0.00, steps=1.116e+08
2023-07-07 15:28:19,593 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11000, best=0.87, avg=0.87, std=0.00, steps=1.127e+08
2023-07-07 15:28:28,955 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11100, best=0.87, avg=0.87, std=0.00, steps=1.137e+08
2023-07-07 15:28:38,315 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11200, best=0.87, avg=0.87, std=0.00, steps=1.147e+08
2023-07-07 15:28:47,655 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11300, best=0.87, avg=0.87, std=0.00, steps=1.157e+08
2023-07-07 15:28:56,995 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11400, best=0.87, avg=0.87, std=0.00, steps=1.167e+08
2023-07-07 15:29:06,344 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11500, best=0.87, avg=0.87, std=0.00, steps=1.178e+08
2023-07-07 15:29:15,718 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11600, best=0.87, avg=0.87, std=0.00, steps=1.188e+08
2023-07-07 15:29:25,089 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11700, best=0.87, avg=0.87, std=0.00, steps=1.198e+08
2023-07-07 15:29:34,465 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11800, best=0.88, avg=0.87, std=0.00, steps=1.208e+08
2023-07-07 15:29:43,833 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11900, best=0.88, avg=0.87, std=0.00, steps=1.219e+08
2023-07-07 15:29:53,110 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11999, best=0.87, avg=0.87, std=0.00, steps=1.229e+08
2023-07-07 15:29:53,111 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 15:29:53,136 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:29:53,171 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:30:06,596 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 100, best=0.56, avg=0.55, std=0.01, steps=1.241e+06
2023-07-07 15:30:17,785 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 200, best=0.61, avg=0.60, std=0.00, steps=2.470e+06
2023-07-07 15:30:29,002 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 300, best=0.67, avg=0.66, std=0.00, steps=3.699e+06
2023-07-07 15:30:40,178 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 400, best=0.69, avg=0.68, std=0.00, steps=4.927e+06
2023-07-07 15:30:51,363 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 500, best=0.69, avg=0.68, std=0.00, steps=6.156e+06
2023-07-07 15:31:02,536 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 600, best=0.70, avg=0.69, std=0.00, steps=7.385e+06
2023-07-07 15:31:13,737 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 700, best=0.71, avg=0.70, std=0.00, steps=8.614e+06
2023-07-07 15:31:24,911 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 800, best=0.72, avg=0.71, std=0.00, steps=9.843e+06
2023-07-07 15:31:36,081 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 900, best=0.72, avg=0.71, std=0.00, steps=1.107e+07
2023-07-07 15:31:47,248 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1000, best=0.73, avg=0.72, std=0.00, steps=1.230e+07
2023-07-07 15:31:58,408 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1100, best=0.73, avg=0.72, std=0.00, steps=1.353e+07
2023-07-07 15:32:09,586 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1200, best=0.74, avg=0.73, std=0.00, steps=1.476e+07
2023-07-07 15:32:20,739 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1300, best=0.74, avg=0.73, std=0.00, steps=1.599e+07
2023-07-07 15:32:31,909 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1400, best=0.75, avg=0.74, std=0.00, steps=1.722e+07
2023-07-07 15:32:43,087 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1500, best=0.75, avg=0.74, std=0.00, steps=1.844e+07
2023-07-07 15:32:54,243 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1600, best=0.75, avg=0.74, std=0.00, steps=1.967e+07
2023-07-07 15:33:05,417 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1700, best=0.75, avg=0.74, std=0.00, steps=2.090e+07
2023-07-07 15:33:16,586 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1800, best=0.75, avg=0.74, std=0.00, steps=2.213e+07
2023-07-07 15:33:27,775 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1900, best=0.76, avg=0.75, std=0.00, steps=2.336e+07
2023-07-07 15:33:38,949 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2000, best=0.76, avg=0.75, std=0.00, steps=2.459e+07
2023-07-07 15:33:50,113 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2100, best=0.76, avg=0.75, std=0.00, steps=2.582e+07
2023-07-07 15:34:01,263 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2200, best=0.76, avg=0.75, std=0.00, steps=2.705e+07
2023-07-07 15:34:12,415 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2300, best=0.77, avg=0.75, std=0.00, steps=2.827e+07
2023-07-07 15:34:23,578 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2400, best=0.76, avg=0.76, std=0.00, steps=2.950e+07
2023-07-07 15:34:34,744 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2500, best=0.77, avg=0.76, std=0.00, steps=3.073e+07
2023-07-07 15:34:45,886 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2600, best=0.77, avg=0.76, std=0.00, steps=3.196e+07
2023-07-07 15:34:57,034 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2700, best=0.77, avg=0.76, std=0.00, steps=3.319e+07
2023-07-07 15:35:08,190 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2800, best=0.77, avg=0.76, std=0.00, steps=3.442e+07
2023-07-07 15:35:19,349 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2900, best=0.78, avg=0.76, std=0.00, steps=3.565e+07
2023-07-07 15:35:30,501 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3000, best=0.78, avg=0.77, std=0.00, steps=3.688e+07
2023-07-07 15:35:41,658 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3100, best=0.78, avg=0.77, std=0.00, steps=3.811e+07
2023-07-07 15:35:52,831 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3200, best=0.78, avg=0.77, std=0.00, steps=3.933e+07
2023-07-07 15:36:03,988 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3300, best=0.78, avg=0.77, std=0.00, steps=4.056e+07
2023-07-07 15:36:15,148 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3400, best=0.78, avg=0.77, std=0.00, steps=4.179e+07
2023-07-07 15:36:26,288 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3500, best=0.78, avg=0.77, std=0.00, steps=4.302e+07
2023-07-07 15:36:37,433 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3600, best=0.78, avg=0.78, std=0.00, steps=4.425e+07
2023-07-07 15:36:48,605 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3700, best=0.78, avg=0.77, std=0.00, steps=4.548e+07
2023-07-07 15:36:59,793 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3800, best=0.78, avg=0.77, std=0.00, steps=4.671e+07
2023-07-07 15:37:10,948 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3900, best=0.79, avg=0.78, std=0.00, steps=4.794e+07
2023-07-07 15:37:22,101 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4000, best=0.79, avg=0.78, std=0.00, steps=4.916e+07
2023-07-07 15:37:33,248 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4100, best=0.79, avg=0.78, std=0.00, steps=5.039e+07
2023-07-07 15:37:44,396 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4200, best=0.79, avg=0.78, std=0.00, steps=5.162e+07
2023-07-07 15:37:55,563 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4300, best=0.79, avg=0.78, std=0.00, steps=5.285e+07
2023-07-07 15:38:06,748 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4400, best=0.79, avg=0.78, std=0.00, steps=5.408e+07
2023-07-07 15:38:17,935 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4500, best=0.79, avg=0.78, std=0.00, steps=5.531e+07
2023-07-07 15:38:29,110 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4600, best=0.79, avg=0.78, std=0.00, steps=5.654e+07
2023-07-07 15:38:40,278 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4700, best=0.79, avg=0.78, std=0.00, steps=5.777e+07
2023-07-07 15:38:51,460 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4800, best=0.79, avg=0.78, std=0.00, steps=5.899e+07
2023-07-07 15:39:02,614 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4900, best=0.79, avg=0.78, std=0.00, steps=6.022e+07
2023-07-07 15:39:13,786 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5000, best=0.79, avg=0.78, std=0.00, steps=6.145e+07
2023-07-07 15:39:24,953 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5100, best=0.79, avg=0.79, std=0.00, steps=6.268e+07
2023-07-07 15:39:36,131 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5200, best=0.80, avg=0.79, std=0.00, steps=6.391e+07
2023-07-07 15:39:47,280 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5300, best=0.79, avg=0.79, std=0.00, steps=6.514e+07
2023-07-07 15:39:58,461 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5400, best=0.80, avg=0.79, std=0.00, steps=6.637e+07
2023-07-07 15:40:09,628 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5500, best=0.80, avg=0.79, std=0.00, steps=6.760e+07
2023-07-07 15:40:20,769 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5600, best=0.80, avg=0.79, std=0.00, steps=6.883e+07
2023-07-07 15:40:31,922 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5700, best=0.79, avg=0.79, std=0.00, steps=7.005e+07
2023-07-07 15:40:43,105 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5800, best=0.80, avg=0.79, std=0.00, steps=7.128e+07
2023-07-07 15:40:54,293 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5900, best=0.80, avg=0.79, std=0.00, steps=7.251e+07
2023-07-07 15:41:05,480 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6000, best=0.80, avg=0.79, std=0.00, steps=7.374e+07
2023-07-07 15:41:16,672 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6100, best=0.80, avg=0.79, std=0.00, steps=7.497e+07
2023-07-07 15:41:27,840 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6200, best=0.80, avg=0.79, std=0.00, steps=7.620e+07
2023-07-07 15:41:39,035 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6300, best=0.80, avg=0.79, std=0.00, steps=7.743e+07
2023-07-07 15:41:50,200 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6400, best=0.80, avg=0.79, std=0.00, steps=7.866e+07
2023-07-07 15:42:01,369 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6500, best=0.80, avg=0.79, std=0.00, steps=7.988e+07
2023-07-07 15:42:12,525 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6600, best=0.80, avg=0.79, std=0.00, steps=8.111e+07
2023-07-07 15:42:23,722 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6700, best=0.80, avg=0.79, std=0.00, steps=8.234e+07
2023-07-07 15:42:34,893 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6800, best=0.80, avg=0.79, std=0.00, steps=8.357e+07
2023-07-07 15:42:46,071 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6900, best=0.80, avg=0.79, std=0.00, steps=8.480e+07
2023-07-07 15:42:57,270 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7000, best=0.80, avg=0.79, std=0.00, steps=8.603e+07
2023-07-07 15:43:08,453 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7100, best=0.80, avg=0.79, std=0.00, steps=8.726e+07
2023-07-07 15:43:19,605 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7200, best=0.80, avg=0.79, std=0.00, steps=8.849e+07
2023-07-07 15:43:30,787 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7300, best=0.80, avg=0.79, std=0.00, steps=8.971e+07
2023-07-07 15:43:41,952 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7400, best=0.80, avg=0.79, std=0.00, steps=9.094e+07
2023-07-07 15:43:53,096 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7500, best=0.80, avg=0.79, std=0.00, steps=9.217e+07
2023-07-07 15:44:04,284 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7600, best=0.80, avg=0.79, std=0.00, steps=9.340e+07
2023-07-07 15:44:15,473 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7700, best=0.80, avg=0.79, std=0.00, steps=9.463e+07
2023-07-07 15:44:26,641 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7800, best=0.80, avg=0.79, std=0.00, steps=9.586e+07
2023-07-07 15:44:37,819 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7900, best=0.80, avg=0.79, std=0.00, steps=9.709e+07
2023-07-07 15:44:48,983 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8000, best=0.80, avg=0.79, std=0.00, steps=9.832e+07
2023-07-07 15:45:00,154 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8100, best=0.80, avg=0.79, std=0.00, steps=9.955e+07
2023-07-07 15:45:11,312 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8200, best=0.80, avg=0.79, std=0.00, steps=1.008e+08
2023-07-07 15:45:22,470 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8300, best=0.81, avg=0.79, std=0.00, steps=1.020e+08
2023-07-07 15:45:33,618 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8400, best=0.80, avg=0.79, std=0.00, steps=1.032e+08
2023-07-07 15:45:44,768 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8500, best=0.80, avg=0.79, std=0.00, steps=1.045e+08
2023-07-07 15:45:55,922 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8600, best=0.80, avg=0.80, std=0.00, steps=1.057e+08
2023-07-07 15:46:07,098 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8700, best=0.80, avg=0.79, std=0.00, steps=1.069e+08
2023-07-07 15:46:18,270 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8800, best=0.80, avg=0.79, std=0.00, steps=1.081e+08
2023-07-07 15:46:29,432 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8900, best=0.80, avg=0.79, std=0.00, steps=1.094e+08
2023-07-07 15:46:40,570 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9000, best=0.80, avg=0.80, std=0.00, steps=1.106e+08
2023-07-07 15:46:51,735 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9100, best=0.80, avg=0.80, std=0.00, steps=1.118e+08
2023-07-07 15:47:02,899 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9200, best=0.80, avg=0.80, std=0.00, steps=1.131e+08
2023-07-07 15:47:14,042 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9300, best=0.81, avg=0.80, std=0.00, steps=1.143e+08
2023-07-07 15:47:25,194 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9400, best=0.80, avg=0.80, std=0.00, steps=1.155e+08
2023-07-07 15:47:36,344 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9500, best=0.81, avg=0.79, std=0.00, steps=1.167e+08
2023-07-07 15:47:47,520 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9600, best=0.80, avg=0.80, std=0.00, steps=1.180e+08
2023-07-07 15:47:58,673 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9700, best=0.80, avg=0.80, std=0.00, steps=1.192e+08
2023-07-07 15:48:09,811 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9800, best=0.80, avg=0.80, std=0.00, steps=1.204e+08
2023-07-07 15:48:20,950 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9900, best=0.80, avg=0.80, std=0.00, steps=1.217e+08
2023-07-07 15:48:32,102 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10000, best=0.81, avg=0.80, std=0.00, steps=1.229e+08
2023-07-07 15:48:43,278 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10100, best=0.81, avg=0.80, std=0.00, steps=1.241e+08
2023-07-07 15:48:54,446 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10200, best=0.80, avg=0.80, std=0.00, steps=1.253e+08
2023-07-07 15:49:05,600 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10300, best=0.80, avg=0.80, std=0.00, steps=1.266e+08
2023-07-07 15:49:16,728 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10400, best=0.81, avg=0.80, std=0.00, steps=1.278e+08
2023-07-07 15:49:27,882 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10500, best=0.80, avg=0.80, std=0.00, steps=1.290e+08
2023-07-07 15:49:39,063 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10600, best=0.81, avg=0.80, std=0.00, steps=1.303e+08
2023-07-07 15:49:50,215 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10700, best=0.81, avg=0.80, std=0.00, steps=1.315e+08
2023-07-07 15:50:01,392 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10800, best=0.81, avg=0.80, std=0.00, steps=1.327e+08
2023-07-07 15:50:12,565 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10900, best=0.81, avg=0.80, std=0.00, steps=1.340e+08
2023-07-07 15:50:23,733 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11000, best=0.81, avg=0.80, std=0.00, steps=1.352e+08
2023-07-07 15:50:34,887 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11100, best=0.81, avg=0.80, std=0.00, steps=1.364e+08
2023-07-07 15:50:46,051 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11200, best=0.81, avg=0.80, std=0.00, steps=1.376e+08
2023-07-07 15:50:57,210 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11300, best=0.81, avg=0.80, std=0.00, steps=1.389e+08
2023-07-07 15:51:08,385 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11400, best=0.81, avg=0.80, std=0.00, steps=1.401e+08
2023-07-07 15:51:19,556 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11500, best=0.81, avg=0.80, std=0.00, steps=1.413e+08
2023-07-07 15:51:30,707 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11600, best=0.81, avg=0.80, std=0.00, steps=1.426e+08
2023-07-07 15:51:41,856 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11700, best=0.81, avg=0.80, std=0.00, steps=1.438e+08
2023-07-07 15:51:53,020 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11800, best=0.81, avg=0.80, std=0.00, steps=1.450e+08
2023-07-07 15:52:04,206 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11900, best=0.81, avg=0.80, std=0.00, steps=1.462e+08
2023-07-07 15:52:15,247 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11999, best=0.81, avg=0.80, std=0.00, steps=1.475e+08
2023-07-07 15:52:15,248 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 15:52:15,273 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:52:15,305 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:52:32,367 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.655e+06
2023-07-07 15:52:47,172 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.01, steps=3.293e+06
2023-07-07 15:53:01,982 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 300, best=0.51, avg=0.50, std=0.01, steps=4.932e+06
2023-07-07 15:53:16,779 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.01, steps=6.570e+06
2023-07-07 15:53:31,607 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 500, best=0.51, avg=0.50, std=0.01, steps=8.208e+06
2023-07-07 15:53:46,429 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 600, best=0.51, avg=0.50, std=0.01, steps=9.847e+06
2023-07-07 15:54:01,242 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 700, best=0.52, avg=0.50, std=0.01, steps=1.149e+07
2023-07-07 15:54:16,080 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 800, best=0.51, avg=0.50, std=0.01, steps=1.312e+07
2023-07-07 15:54:30,892 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 900, best=0.52, avg=0.50, std=0.01, steps=1.476e+07
2023-07-07 15:54:45,731 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.01, steps=1.640e+07
2023-07-07 15:55:00,540 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.01, steps=1.804e+07
2023-07-07 15:55:15,341 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1200, best=0.52, avg=0.50, std=0.01, steps=1.968e+07
2023-07-07 15:55:30,147 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.01, steps=2.132e+07
2023-07-07 15:55:44,962 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.01, steps=2.295e+07
2023-07-07 15:55:59,756 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=2.459e+07
2023-07-07 15:56:14,562 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.01, steps=2.623e+07
2023-07-07 15:56:29,349 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.01, steps=2.787e+07
2023-07-07 15:56:44,162 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1800, best=0.52, avg=0.50, std=0.01, steps=2.951e+07
2023-07-07 15:56:58,983 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.00, steps=3.115e+07
2023-07-07 15:57:13,778 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2000, best=0.52, avg=0.50, std=0.01, steps=3.278e+07
2023-07-07 15:57:28,575 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.01, steps=3.442e+07
2023-07-07 15:57:43,407 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.01, steps=3.606e+07
2023-07-07 15:57:58,210 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2300, best=0.52, avg=0.50, std=0.01, steps=3.770e+07
2023-07-07 15:58:13,027 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2400, best=0.52, avg=0.50, std=0.01, steps=3.934e+07
2023-07-07 15:58:27,830 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2500, best=0.64, avg=0.63, std=0.00, steps=4.098e+07
2023-07-07 15:58:42,638 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2600, best=0.65, avg=0.64, std=0.00, steps=4.261e+07
2023-07-07 15:58:57,405 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2700, best=0.66, avg=0.64, std=0.00, steps=4.425e+07
2023-07-07 15:59:12,189 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2800, best=0.66, avg=0.65, std=0.00, steps=4.589e+07
2023-07-07 15:59:26,950 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2900, best=0.67, avg=0.65, std=0.00, steps=4.753e+07
2023-07-07 15:59:41,753 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3000, best=0.67, avg=0.66, std=0.00, steps=4.917e+07
2023-07-07 15:59:56,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3100, best=0.67, avg=0.66, std=0.00, steps=5.081e+07
2023-07-07 16:00:11,286 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3200, best=0.68, avg=0.67, std=0.00, steps=5.245e+07
2023-07-07 16:00:26,080 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3300, best=0.68, avg=0.67, std=0.00, steps=5.408e+07
2023-07-07 16:00:40,892 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3400, best=0.68, avg=0.67, std=0.00, steps=5.572e+07
2023-07-07 16:00:55,703 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3500, best=0.68, avg=0.67, std=0.00, steps=5.736e+07
2023-07-07 16:01:10,491 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3600, best=0.68, avg=0.67, std=0.00, steps=5.900e+07
2023-07-07 16:01:25,262 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3700, best=0.68, avg=0.67, std=0.00, steps=6.064e+07
2023-07-07 16:01:40,035 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3800, best=0.68, avg=0.67, std=0.00, steps=6.228e+07
2023-07-07 16:01:54,822 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3900, best=0.69, avg=0.67, std=0.00, steps=6.391e+07
2023-07-07 16:02:09,616 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4000, best=0.68, avg=0.68, std=0.00, steps=6.555e+07
2023-07-07 16:02:24,416 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4100, best=0.69, avg=0.67, std=0.00, steps=6.719e+07
2023-07-07 16:02:39,242 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4200, best=0.69, avg=0.68, std=0.00, steps=6.883e+07
2023-07-07 16:02:54,059 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4300, best=0.68, avg=0.68, std=0.00, steps=7.047e+07
2023-07-07 16:03:08,869 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4400, best=0.69, avg=0.68, std=0.00, steps=7.211e+07
2023-07-07 16:03:23,671 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4500, best=0.69, avg=0.68, std=0.00, steps=7.374e+07
2023-07-07 16:03:38,473 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4600, best=0.69, avg=0.68, std=0.00, steps=7.538e+07
2023-07-07 16:03:53,271 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4700, best=0.69, avg=0.68, std=0.00, steps=7.702e+07
2023-07-07 16:04:08,074 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4800, best=0.69, avg=0.68, std=0.00, steps=7.866e+07
2023-07-07 16:04:22,864 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4900, best=0.69, avg=0.68, std=0.00, steps=8.030e+07
2023-07-07 16:04:37,644 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5000, best=0.69, avg=0.68, std=0.00, steps=8.194e+07
2023-07-07 16:04:52,421 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5100, best=0.69, avg=0.68, std=0.00, steps=8.357e+07
2023-07-07 16:05:07,211 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5200, best=0.69, avg=0.68, std=0.00, steps=8.521e+07
2023-07-07 16:05:21,990 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5300, best=0.69, avg=0.68, std=0.00, steps=8.685e+07
2023-07-07 16:05:36,754 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5400, best=0.69, avg=0.68, std=0.00, steps=8.849e+07
2023-07-07 16:05:51,550 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5500, best=0.69, avg=0.68, std=0.00, steps=9.013e+07
2023-07-07 16:06:06,377 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5600, best=0.69, avg=0.68, std=0.00, steps=9.177e+07
2023-07-07 16:06:21,168 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5700, best=0.69, avg=0.69, std=0.00, steps=9.341e+07
2023-07-07 16:06:35,939 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5800, best=0.69, avg=0.68, std=0.00, steps=9.504e+07
2023-07-07 16:06:50,709 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5900, best=0.69, avg=0.68, std=0.00, steps=9.668e+07
2023-07-07 16:07:05,495 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6000, best=0.69, avg=0.69, std=0.00, steps=9.832e+07
2023-07-07 16:07:20,301 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6100, best=0.70, avg=0.69, std=0.00, steps=9.996e+07
2023-07-07 16:07:35,092 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6200, best=0.70, avg=0.69, std=0.00, steps=1.016e+08
2023-07-07 16:07:49,912 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6300, best=0.70, avg=0.69, std=0.00, steps=1.032e+08
2023-07-07 16:08:04,727 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6400, best=0.69, avg=0.68, std=0.00, steps=1.049e+08
2023-07-07 16:08:19,522 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6500, best=0.69, avg=0.68, std=0.00, steps=1.065e+08
2023-07-07 16:08:34,306 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6600, best=0.70, avg=0.69, std=0.00, steps=1.082e+08
2023-07-07 16:08:49,097 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6700, best=0.70, avg=0.69, std=0.00, steps=1.098e+08
2023-07-07 16:09:03,893 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6800, best=0.69, avg=0.69, std=0.00, steps=1.114e+08
2023-07-07 16:09:18,698 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6900, best=0.70, avg=0.69, std=0.00, steps=1.131e+08
2023-07-07 16:09:33,486 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7000, best=0.70, avg=0.69, std=0.00, steps=1.147e+08
2023-07-07 16:09:48,270 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7100, best=0.70, avg=0.69, std=0.00, steps=1.163e+08
2023-07-07 16:10:03,059 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7200, best=0.70, avg=0.69, std=0.00, steps=1.180e+08
2023-07-07 16:10:17,858 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7300, best=0.70, avg=0.69, std=0.00, steps=1.196e+08
2023-07-07 16:10:32,647 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7400, best=0.70, avg=0.69, std=0.00, steps=1.213e+08
2023-07-07 16:10:47,434 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7500, best=0.70, avg=0.69, std=0.00, steps=1.229e+08
2023-07-07 16:11:02,210 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7600, best=0.70, avg=0.69, std=0.00, steps=1.245e+08
2023-07-07 16:11:16,984 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7700, best=0.70, avg=0.69, std=0.00, steps=1.262e+08
2023-07-07 16:11:31,772 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7800, best=0.70, avg=0.69, std=0.00, steps=1.278e+08
2023-07-07 16:11:46,578 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7900, best=0.70, avg=0.69, std=0.00, steps=1.294e+08
2023-07-07 16:12:01,384 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8000, best=0.70, avg=0.69, std=0.00, steps=1.311e+08
2023-07-07 16:12:16,244 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8100, best=0.70, avg=0.69, std=0.00, steps=1.327e+08
2023-07-07 16:12:31,044 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8200, best=0.70, avg=0.69, std=0.00, steps=1.344e+08
2023-07-07 16:12:45,879 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8300, best=0.70, avg=0.69, std=0.00, steps=1.360e+08
2023-07-07 16:13:00,675 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8400, best=0.70, avg=0.69, std=0.00, steps=1.376e+08
2023-07-07 16:13:15,487 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8500, best=0.70, avg=0.69, std=0.00, steps=1.393e+08
2023-07-07 16:13:30,254 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8600, best=0.70, avg=0.69, std=0.00, steps=1.409e+08
2023-07-07 16:13:45,056 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8700, best=0.70, avg=0.69, std=0.00, steps=1.426e+08
2023-07-07 16:13:59,841 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8800, best=0.70, avg=0.69, std=0.00, steps=1.442e+08
2023-07-07 16:14:14,625 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8900, best=0.70, avg=0.69, std=0.00, steps=1.458e+08
2023-07-07 16:14:29,423 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9000, best=0.70, avg=0.69, std=0.00, steps=1.475e+08
2023-07-07 16:14:44,217 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9100, best=0.71, avg=0.70, std=0.00, steps=1.491e+08
2023-07-07 16:14:59,010 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9200, best=0.71, avg=0.70, std=0.00, steps=1.507e+08
2023-07-07 16:15:13,803 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9300, best=0.71, avg=0.70, std=0.00, steps=1.524e+08
2023-07-07 16:15:28,582 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9400, best=0.71, avg=0.70, std=0.00, steps=1.540e+08
2023-07-07 16:15:43,386 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9500, best=0.71, avg=0.70, std=0.00, steps=1.557e+08
2023-07-07 16:15:58,163 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9600, best=0.71, avg=0.70, std=0.00, steps=1.573e+08
2023-07-07 16:16:12,939 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9700, best=0.71, avg=0.70, std=0.00, steps=1.589e+08
2023-07-07 16:16:27,731 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9800, best=0.71, avg=0.70, std=0.00, steps=1.606e+08
2023-07-07 16:16:42,520 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9900, best=0.71, avg=0.70, std=0.00, steps=1.622e+08
2023-07-07 16:16:57,344 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10000, best=0.71, avg=0.70, std=0.00, steps=1.639e+08
2023-07-07 16:17:12,156 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10100, best=0.71, avg=0.70, std=0.00, steps=1.655e+08
2023-07-07 16:17:26,938 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10200, best=0.71, avg=0.70, std=0.00, steps=1.671e+08
2023-07-07 16:17:41,733 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10300, best=0.71, avg=0.70, std=0.00, steps=1.688e+08
2023-07-07 16:17:56,559 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10400, best=0.71, avg=0.70, std=0.00, steps=1.704e+08
2023-07-07 16:18:11,364 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10500, best=0.71, avg=0.70, std=0.00, steps=1.720e+08
2023-07-07 16:18:26,156 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10600, best=0.71, avg=0.70, std=0.00, steps=1.737e+08
2023-07-07 16:18:40,969 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10700, best=0.71, avg=0.70, std=0.00, steps=1.753e+08
2023-07-07 16:18:55,757 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10800, best=0.71, avg=0.70, std=0.00, steps=1.770e+08
2023-07-07 16:19:10,542 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10900, best=0.71, avg=0.70, std=0.00, steps=1.786e+08
2023-07-07 16:19:25,326 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11000, best=0.71, avg=0.70, std=0.00, steps=1.802e+08
2023-07-07 16:19:40,124 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11100, best=0.71, avg=0.70, std=0.00, steps=1.819e+08
2023-07-07 16:19:54,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11200, best=0.71, avg=0.70, std=0.00, steps=1.835e+08
2023-07-07 16:20:09,713 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11300, best=0.71, avg=0.70, std=0.00, steps=1.852e+08
2023-07-07 16:20:24,521 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11400, best=0.71, avg=0.70, std=0.00, steps=1.868e+08
2023-07-07 16:20:39,316 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11500, best=0.71, avg=0.70, std=0.00, steps=1.884e+08
2023-07-07 16:20:54,094 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11600, best=0.71, avg=0.70, std=0.00, steps=1.901e+08
2023-07-07 16:21:08,907 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11700, best=0.72, avg=0.70, std=0.00, steps=1.917e+08
2023-07-07 16:21:23,717 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11800, best=0.71, avg=0.70, std=0.00, steps=1.933e+08
2023-07-07 16:21:38,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11900, best=0.72, avg=0.71, std=0.00, steps=1.950e+08
2023-07-07 16:21:53,185 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11999, best=0.71, avg=0.71, std=0.00, steps=1.966e+08
2023-07-07 16:21:53,186 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 16:21:53,211 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:21:53,243 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:22:10,306 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.655e+06
2023-07-07 16:22:25,118 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 200, best=0.55, avg=0.55, std=0.00, steps=3.293e+06
2023-07-07 16:22:39,988 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 300, best=0.63, avg=0.62, std=0.00, steps=4.932e+06
2023-07-07 16:22:54,768 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 400, best=0.65, avg=0.64, std=0.00, steps=6.570e+06
2023-07-07 16:23:09,554 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 500, best=0.66, avg=0.65, std=0.00, steps=8.208e+06
2023-07-07 16:23:24,328 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 600, best=0.67, avg=0.66, std=0.00, steps=9.847e+06
2023-07-07 16:23:39,113 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 700, best=0.68, avg=0.67, std=0.00, steps=1.149e+07
2023-07-07 16:23:53,977 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 800, best=0.69, avg=0.68, std=0.00, steps=1.312e+07
2023-07-07 16:24:08,845 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 900, best=0.69, avg=0.69, std=0.00, steps=1.476e+07
2023-07-07 16:24:23,639 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1000, best=0.70, avg=0.69, std=0.00, steps=1.640e+07
2023-07-07 16:24:38,422 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1100, best=0.70, avg=0.69, std=0.00, steps=1.804e+07
2023-07-07 16:24:53,296 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1200, best=0.71, avg=0.70, std=0.00, steps=1.968e+07
2023-07-07 16:25:08,146 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1300, best=0.71, avg=0.70, std=0.00, steps=2.132e+07
2023-07-07 16:25:22,974 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1400, best=0.71, avg=0.70, std=0.00, steps=2.295e+07
2023-07-07 16:25:37,804 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1500, best=0.71, avg=0.71, std=0.00, steps=2.459e+07
2023-07-07 16:25:52,709 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1600, best=0.71, avg=0.71, std=0.00, steps=2.623e+07
2023-07-07 16:26:07,495 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1700, best=0.72, avg=0.71, std=0.00, steps=2.787e+07
2023-07-07 16:26:22,307 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1800, best=0.72, avg=0.71, std=0.00, steps=2.951e+07
2023-07-07 16:26:37,115 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1900, best=0.72, avg=0.71, std=0.00, steps=3.115e+07
2023-07-07 16:26:52,005 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2000, best=0.72, avg=0.72, std=0.00, steps=3.278e+07
2023-07-07 16:27:06,825 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2100, best=0.72, avg=0.72, std=0.00, steps=3.442e+07
2023-07-07 16:27:21,614 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2200, best=0.73, avg=0.72, std=0.00, steps=3.606e+07
2023-07-07 16:27:36,421 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2300, best=0.73, avg=0.72, std=0.00, steps=3.770e+07
2023-07-07 16:27:51,212 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2400, best=0.73, avg=0.72, std=0.00, steps=3.934e+07
2023-07-07 16:28:06,000 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2500, best=0.73, avg=0.72, std=0.00, steps=4.098e+07
2023-07-07 16:28:20,810 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2600, best=0.73, avg=0.73, std=0.00, steps=4.261e+07
2023-07-07 16:28:35,599 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2700, best=0.74, avg=0.73, std=0.00, steps=4.425e+07
2023-07-07 16:28:50,381 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2800, best=0.74, avg=0.73, std=0.00, steps=4.589e+07
2023-07-07 16:29:05,176 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2900, best=0.74, avg=0.73, std=0.00, steps=4.753e+07
2023-07-07 16:29:19,977 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3000, best=0.74, avg=0.73, std=0.00, steps=4.917e+07
2023-07-07 16:29:34,790 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3100, best=0.74, avg=0.73, std=0.00, steps=5.081e+07
2023-07-07 16:29:49,613 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3200, best=0.74, avg=0.73, std=0.00, steps=5.245e+07
2023-07-07 16:30:04,410 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3300, best=0.74, avg=0.73, std=0.00, steps=5.408e+07
2023-07-07 16:30:19,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3400, best=0.74, avg=0.73, std=0.00, steps=5.572e+07
2023-07-07 16:30:34,051 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3500, best=0.74, avg=0.74, std=0.00, steps=5.736e+07
2023-07-07 16:30:48,858 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3600, best=0.74, avg=0.74, std=0.00, steps=5.900e+07
2023-07-07 16:31:03,646 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3700, best=0.75, avg=0.74, std=0.00, steps=6.064e+07
2023-07-07 16:31:18,443 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3800, best=0.75, avg=0.74, std=0.00, steps=6.228e+07
2023-07-07 16:31:33,236 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3900, best=0.75, avg=0.74, std=0.00, steps=6.391e+07
2023-07-07 16:31:48,050 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4000, best=0.75, avg=0.74, std=0.00, steps=6.555e+07
2023-07-07 16:32:02,847 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4100, best=0.75, avg=0.74, std=0.00, steps=6.719e+07
2023-07-07 16:32:17,675 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4200, best=0.75, avg=0.74, std=0.00, steps=6.883e+07
2023-07-07 16:32:32,472 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4300, best=0.75, avg=0.74, std=0.00, steps=7.047e+07
2023-07-07 16:32:47,306 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4400, best=0.75, avg=0.74, std=0.00, steps=7.211e+07
2023-07-07 16:33:02,117 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4500, best=0.75, avg=0.74, std=0.00, steps=7.374e+07
2023-07-07 16:33:16,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4600, best=0.75, avg=0.75, std=0.00, steps=7.538e+07
2023-07-07 16:33:31,734 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4700, best=0.75, avg=0.75, std=0.00, steps=7.702e+07
2023-07-07 16:33:46,608 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4800, best=0.75, avg=0.75, std=0.00, steps=7.866e+07
2023-07-07 16:34:01,422 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4900, best=0.76, avg=0.75, std=0.00, steps=8.030e+07
2023-07-07 16:34:16,213 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5000, best=0.76, avg=0.75, std=0.00, steps=8.194e+07
2023-07-07 16:34:31,014 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5100, best=0.76, avg=0.75, std=0.00, steps=8.357e+07
2023-07-07 16:34:45,796 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5200, best=0.76, avg=0.75, std=0.00, steps=8.521e+07
2023-07-07 16:35:00,606 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5300, best=0.76, avg=0.75, std=0.00, steps=8.685e+07
2023-07-07 16:35:15,421 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5400, best=0.76, avg=0.75, std=0.00, steps=8.849e+07
2023-07-07 16:35:30,244 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5500, best=0.76, avg=0.75, std=0.00, steps=9.013e+07
2023-07-07 16:35:45,030 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5600, best=0.76, avg=0.75, std=0.00, steps=9.177e+07
2023-07-07 16:35:59,909 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5700, best=0.76, avg=0.75, std=0.00, steps=9.341e+07
2023-07-07 16:36:14,723 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5800, best=0.76, avg=0.76, std=0.00, steps=9.504e+07
2023-07-07 16:36:29,516 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5900, best=0.76, avg=0.76, std=0.00, steps=9.668e+07
2023-07-07 16:36:44,291 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6000, best=0.76, avg=0.76, std=0.00, steps=9.832e+07
2023-07-07 16:36:59,077 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6100, best=0.76, avg=0.76, std=0.00, steps=9.996e+07
2023-07-07 16:37:13,861 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6200, best=0.77, avg=0.76, std=0.00, steps=1.016e+08
2023-07-07 16:37:28,769 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6300, best=0.77, avg=0.76, std=0.00, steps=1.032e+08
2023-07-07 16:37:43,559 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6400, best=0.77, avg=0.76, std=0.00, steps=1.049e+08
2023-07-07 16:37:58,334 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6500, best=0.77, avg=0.76, std=0.00, steps=1.065e+08
2023-07-07 16:38:13,122 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6600, best=0.77, avg=0.76, std=0.00, steps=1.082e+08
2023-07-07 16:38:27,911 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6700, best=0.77, avg=0.76, std=0.00, steps=1.098e+08
2023-07-07 16:38:42,677 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6800, best=0.77, avg=0.76, std=0.00, steps=1.114e+08
2023-07-07 16:38:57,496 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6900, best=0.77, avg=0.77, std=0.00, steps=1.131e+08
2023-07-07 16:39:12,401 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7000, best=0.77, avg=0.77, std=0.00, steps=1.147e+08
2023-07-07 16:39:27,204 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7100, best=0.77, avg=0.77, std=0.00, steps=1.163e+08
2023-07-07 16:39:42,007 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7200, best=0.77, avg=0.77, std=0.00, steps=1.180e+08
2023-07-07 16:39:56,776 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7300, best=0.78, avg=0.77, std=0.00, steps=1.196e+08
2023-07-07 16:40:11,561 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7400, best=0.78, avg=0.77, std=0.00, steps=1.213e+08
2023-07-07 16:40:26,358 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7500, best=0.77, avg=0.77, std=0.00, steps=1.229e+08
2023-07-07 16:40:41,166 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7600, best=0.78, avg=0.77, std=0.00, steps=1.245e+08
2023-07-07 16:40:55,961 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7700, best=0.78, avg=0.77, std=0.00, steps=1.262e+08
2023-07-07 16:41:10,781 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7800, best=0.78, avg=0.77, std=0.00, steps=1.278e+08
2023-07-07 16:41:25,594 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7900, best=0.78, avg=0.77, std=0.00, steps=1.294e+08
2023-07-07 16:41:40,394 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8000, best=0.78, avg=0.77, std=0.00, steps=1.311e+08
2023-07-07 16:41:55,175 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8100, best=0.78, avg=0.77, std=0.00, steps=1.327e+08
2023-07-07 16:42:09,949 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8200, best=0.78, avg=0.77, std=0.00, steps=1.344e+08
2023-07-07 16:42:24,771 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8300, best=0.78, avg=0.78, std=0.00, steps=1.360e+08
2023-07-07 16:42:39,586 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8400, best=0.78, avg=0.78, std=0.00, steps=1.376e+08
2023-07-07 16:42:54,488 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8500, best=0.78, avg=0.78, std=0.00, steps=1.393e+08
2023-07-07 16:43:09,271 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8600, best=0.78, avg=0.78, std=0.00, steps=1.409e+08
2023-07-07 16:43:24,050 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8700, best=0.78, avg=0.78, std=0.00, steps=1.426e+08
2023-07-07 16:43:38,832 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8800, best=0.79, avg=0.78, std=0.00, steps=1.442e+08
2023-07-07 16:43:53,632 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8900, best=0.79, avg=0.78, std=0.00, steps=1.458e+08
2023-07-07 16:44:08,515 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9000, best=0.78, avg=0.78, std=0.00, steps=1.475e+08
2023-07-07 16:44:23,271 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9100, best=0.79, avg=0.78, std=0.00, steps=1.491e+08
2023-07-07 16:44:38,061 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9200, best=0.79, avg=0.78, std=0.00, steps=1.507e+08
2023-07-07 16:44:52,845 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9300, best=0.79, avg=0.78, std=0.00, steps=1.524e+08
2023-07-07 16:45:07,612 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9400, best=0.79, avg=0.78, std=0.00, steps=1.540e+08
2023-07-07 16:45:22,540 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9500, best=0.79, avg=0.78, std=0.00, steps=1.557e+08
2023-07-07 16:45:37,334 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9600, best=0.79, avg=0.78, std=0.00, steps=1.573e+08
2023-07-07 16:45:52,106 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9700, best=0.79, avg=0.78, std=0.00, steps=1.589e+08
2023-07-07 16:46:06,884 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9800, best=0.79, avg=0.78, std=0.00, steps=1.606e+08
2023-07-07 16:46:21,658 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9900, best=0.79, avg=0.78, std=0.00, steps=1.622e+08
2023-07-07 16:46:36,440 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10000, best=0.79, avg=0.78, std=0.00, steps=1.639e+08
2023-07-07 16:46:51,335 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10100, best=0.79, avg=0.78, std=0.00, steps=1.655e+08
2023-07-07 16:47:06,132 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10200, best=0.79, avg=0.79, std=0.00, steps=1.671e+08
2023-07-07 16:47:20,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10300, best=0.79, avg=0.79, std=0.00, steps=1.688e+08
2023-07-07 16:47:35,748 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10400, best=0.79, avg=0.79, std=0.00, steps=1.704e+08
2023-07-07 16:47:50,602 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10500, best=0.79, avg=0.79, std=0.00, steps=1.720e+08
2023-07-07 16:48:05,390 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10600, best=0.79, avg=0.79, std=0.00, steps=1.737e+08
2023-07-07 16:48:20,170 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10700, best=0.79, avg=0.79, std=0.00, steps=1.753e+08
2023-07-07 16:48:34,956 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10800, best=0.79, avg=0.79, std=0.00, steps=1.770e+08
2023-07-07 16:48:49,771 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10900, best=0.80, avg=0.79, std=0.00, steps=1.786e+08
2023-07-07 16:49:04,544 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11000, best=0.80, avg=0.79, std=0.00, steps=1.802e+08
2023-07-07 16:49:19,323 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11100, best=0.80, avg=0.79, std=0.00, steps=1.819e+08
2023-07-07 16:49:34,097 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11200, best=0.80, avg=0.79, std=0.00, steps=1.835e+08
2023-07-07 16:49:48,932 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11300, best=0.80, avg=0.79, std=0.00, steps=1.852e+08
2023-07-07 16:50:03,778 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11400, best=0.80, avg=0.79, std=0.00, steps=1.868e+08
2023-07-07 16:50:18,561 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11500, best=0.80, avg=0.79, std=0.00, steps=1.884e+08
2023-07-07 16:50:33,343 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11600, best=0.80, avg=0.79, std=0.00, steps=1.901e+08
2023-07-07 16:50:48,118 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11700, best=0.80, avg=0.79, std=0.00, steps=1.917e+08
2023-07-07 16:51:02,910 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11800, best=0.80, avg=0.79, std=0.00, steps=1.933e+08
2023-07-07 16:51:17,686 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11900, best=0.80, avg=0.79, std=0.00, steps=1.950e+08
2023-07-07 16:51:32,296 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11999, best=0.80, avg=0.79, std=0.00, steps=1.966e+08
2023-07-07 16:51:32,296 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 16:51:32,322 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:51:32,355 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:51:51,221 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.862e+06
2023-07-07 16:52:07,801 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 200, best=0.60, avg=0.59, std=0.00, steps=3.705e+06
2023-07-07 16:52:24,414 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 300, best=0.61, avg=0.61, std=0.00, steps=5.548e+06
2023-07-07 16:52:41,004 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 400, best=0.64, avg=0.64, std=0.00, steps=7.391e+06
2023-07-07 16:52:57,600 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 500, best=0.66, avg=0.65, std=0.00, steps=9.234e+06
2023-07-07 16:53:14,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 600, best=0.66, avg=0.66, std=0.00, steps=1.108e+07
2023-07-07 16:53:30,791 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 700, best=0.67, avg=0.67, std=0.00, steps=1.292e+07
2023-07-07 16:53:47,371 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 800, best=0.68, avg=0.67, std=0.00, steps=1.476e+07
2023-07-07 16:54:03,967 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 900, best=0.68, avg=0.67, std=0.00, steps=1.661e+07
2023-07-07 16:54:20,551 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1000, best=0.68, avg=0.68, std=0.00, steps=1.845e+07
2023-07-07 16:54:37,138 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1100, best=0.69, avg=0.68, std=0.00, steps=2.029e+07
2023-07-07 16:54:53,727 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1200, best=0.69, avg=0.68, std=0.00, steps=2.214e+07
2023-07-07 16:55:10,324 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1300, best=0.70, avg=0.69, std=0.00, steps=2.398e+07
2023-07-07 16:55:26,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1400, best=0.70, avg=0.69, std=0.00, steps=2.582e+07
2023-07-07 16:55:43,488 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1500, best=0.70, avg=0.69, std=0.00, steps=2.767e+07
2023-07-07 16:56:00,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1600, best=0.70, avg=0.69, std=0.00, steps=2.951e+07
2023-07-07 16:56:16,670 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1700, best=0.70, avg=0.69, std=0.00, steps=3.135e+07
2023-07-07 16:56:33,227 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1800, best=0.70, avg=0.70, std=0.00, steps=3.320e+07
2023-07-07 16:56:49,795 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1900, best=0.70, avg=0.70, std=0.00, steps=3.504e+07
2023-07-07 16:57:06,387 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2000, best=0.71, avg=0.70, std=0.00, steps=3.688e+07
2023-07-07 16:57:22,978 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2100, best=0.71, avg=0.70, std=0.00, steps=3.873e+07
2023-07-07 16:57:39,575 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2200, best=0.71, avg=0.70, std=0.00, steps=4.057e+07
2023-07-07 16:57:56,168 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2300, best=0.71, avg=0.70, std=0.00, steps=4.241e+07
2023-07-07 16:58:12,757 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2400, best=0.71, avg=0.70, std=0.00, steps=4.426e+07
2023-07-07 16:58:29,343 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2500, best=0.71, avg=0.70, std=0.00, steps=4.610e+07
2023-07-07 16:58:45,944 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2600, best=0.71, avg=0.71, std=0.00, steps=4.794e+07
2023-07-07 16:59:02,522 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2700, best=0.72, avg=0.71, std=0.00, steps=4.978e+07
2023-07-07 16:59:19,100 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2800, best=0.72, avg=0.71, std=0.00, steps=5.163e+07
2023-07-07 16:59:35,688 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2900, best=0.72, avg=0.71, std=0.00, steps=5.347e+07
2023-07-07 16:59:52,302 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3000, best=0.72, avg=0.71, std=0.00, steps=5.531e+07
2023-07-07 17:00:08,895 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3100, best=0.72, avg=0.71, std=0.00, steps=5.716e+07
2023-07-07 17:00:25,465 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3200, best=0.72, avg=0.71, std=0.00, steps=5.900e+07
2023-07-07 17:00:42,093 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3300, best=0.72, avg=0.71, std=0.00, steps=6.084e+07
2023-07-07 17:00:58,728 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3400, best=0.73, avg=0.72, std=0.00, steps=6.269e+07
2023-07-07 17:01:15,323 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3500, best=0.72, avg=0.72, std=0.00, steps=6.453e+07
2023-07-07 17:01:31,885 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3600, best=0.73, avg=0.72, std=0.00, steps=6.637e+07
2023-07-07 17:01:48,446 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3700, best=0.72, avg=0.72, std=0.00, steps=6.822e+07
2023-07-07 17:02:05,055 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3800, best=0.72, avg=0.72, std=0.00, steps=7.006e+07
2023-07-07 17:02:21,635 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3900, best=0.73, avg=0.72, std=0.00, steps=7.190e+07
2023-07-07 17:02:38,221 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4000, best=0.73, avg=0.72, std=0.00, steps=7.375e+07
2023-07-07 17:02:54,795 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4100, best=0.73, avg=0.72, std=0.00, steps=7.559e+07
2023-07-07 17:03:11,363 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4200, best=0.73, avg=0.72, std=0.00, steps=7.743e+07
2023-07-07 17:03:27,940 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4300, best=0.73, avg=0.72, std=0.00, steps=7.928e+07
2023-07-07 17:03:44,505 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4400, best=0.73, avg=0.72, std=0.00, steps=8.112e+07
2023-07-07 17:04:01,080 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4500, best=0.73, avg=0.72, std=0.00, steps=8.296e+07
2023-07-07 17:04:17,655 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4600, best=0.73, avg=0.72, std=0.00, steps=8.481e+07
2023-07-07 17:04:34,231 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4700, best=0.73, avg=0.72, std=0.00, steps=8.665e+07
2023-07-07 17:04:50,812 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4800, best=0.73, avg=0.72, std=0.00, steps=8.849e+07
2023-07-07 17:05:07,383 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4900, best=0.73, avg=0.72, std=0.00, steps=9.034e+07
2023-07-07 17:05:23,963 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5000, best=0.73, avg=0.72, std=0.00, steps=9.218e+07
2023-07-07 17:05:40,544 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5100, best=0.73, avg=0.72, std=0.00, steps=9.402e+07
2023-07-07 17:05:57,124 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5200, best=0.73, avg=0.73, std=0.00, steps=9.586e+07
2023-07-07 17:06:13,707 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5300, best=0.73, avg=0.73, std=0.00, steps=9.771e+07
2023-07-07 17:06:30,285 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5400, best=0.73, avg=0.73, std=0.00, steps=9.955e+07
2023-07-07 17:06:46,898 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5500, best=0.73, avg=0.73, std=0.00, steps=1.014e+08
2023-07-07 17:07:03,468 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5600, best=0.73, avg=0.73, std=0.00, steps=1.032e+08
2023-07-07 17:07:20,046 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5700, best=0.73, avg=0.73, std=0.00, steps=1.051e+08
2023-07-07 17:07:36,614 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5800, best=0.74, avg=0.73, std=0.00, steps=1.069e+08
2023-07-07 17:07:53,225 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5900, best=0.74, avg=0.73, std=0.00, steps=1.088e+08
2023-07-07 17:08:09,907 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6000, best=0.74, avg=0.73, std=0.00, steps=1.106e+08
2023-07-07 17:08:26,553 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6100, best=0.74, avg=0.73, std=0.00, steps=1.125e+08
2023-07-07 17:08:43,143 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6200, best=0.74, avg=0.73, std=0.00, steps=1.143e+08
2023-07-07 17:08:59,822 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6300, best=0.74, avg=0.73, std=0.00, steps=1.161e+08
2023-07-07 17:09:16,422 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6400, best=0.74, avg=0.73, std=0.00, steps=1.180e+08
2023-07-07 17:09:33,015 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6500, best=0.74, avg=0.73, std=0.00, steps=1.198e+08
2023-07-07 17:09:49,601 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6600, best=0.74, avg=0.73, std=0.00, steps=1.217e+08
2023-07-07 17:10:06,163 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6700, best=0.74, avg=0.73, std=0.00, steps=1.235e+08
2023-07-07 17:10:22,793 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6800, best=0.74, avg=0.73, std=0.00, steps=1.254e+08
2023-07-07 17:10:39,407 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6900, best=0.74, avg=0.73, std=0.00, steps=1.272e+08
2023-07-07 17:10:56,004 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7000, best=0.75, avg=0.73, std=0.00, steps=1.290e+08
2023-07-07 17:11:12,595 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7100, best=0.74, avg=0.74, std=0.00, steps=1.309e+08
2023-07-07 17:11:29,170 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7200, best=0.74, avg=0.74, std=0.00, steps=1.327e+08
2023-07-07 17:11:45,753 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7300, best=0.75, avg=0.74, std=0.00, steps=1.346e+08
2023-07-07 17:12:02,345 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7400, best=0.74, avg=0.74, std=0.00, steps=1.364e+08
2023-07-07 17:12:18,913 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7500, best=0.75, avg=0.74, std=0.00, steps=1.383e+08
2023-07-07 17:12:35,484 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7600, best=0.74, avg=0.74, std=0.00, steps=1.401e+08
2023-07-07 17:12:52,051 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7700, best=0.74, avg=0.74, std=0.00, steps=1.419e+08
2023-07-07 17:13:08,628 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7800, best=0.75, avg=0.74, std=0.00, steps=1.438e+08
2023-07-07 17:13:25,224 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7900, best=0.75, avg=0.74, std=0.00, steps=1.456e+08
2023-07-07 17:13:41,808 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8000, best=0.75, avg=0.74, std=0.00, steps=1.475e+08
2023-07-07 17:13:58,397 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8100, best=0.75, avg=0.74, std=0.00, steps=1.493e+08
2023-07-07 17:14:14,994 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8200, best=0.75, avg=0.74, std=0.00, steps=1.512e+08
2023-07-07 17:14:31,687 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8300, best=0.75, avg=0.74, std=0.00, steps=1.530e+08
2023-07-07 17:14:48,297 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8400, best=0.75, avg=0.74, std=0.00, steps=1.548e+08
2023-07-07 17:15:04,885 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8500, best=0.75, avg=0.74, std=0.00, steps=1.567e+08
2023-07-07 17:15:21,477 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8600, best=0.75, avg=0.74, std=0.00, steps=1.585e+08
2023-07-07 17:15:38,091 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8700, best=0.75, avg=0.74, std=0.00, steps=1.604e+08
2023-07-07 17:15:54,672 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8800, best=0.75, avg=0.74, std=0.00, steps=1.622e+08
2023-07-07 17:16:11,248 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8900, best=0.75, avg=0.74, std=0.00, steps=1.641e+08
2023-07-07 17:16:27,815 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9000, best=0.75, avg=0.74, std=0.00, steps=1.659e+08
2023-07-07 17:16:44,471 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9100, best=0.75, avg=0.75, std=0.00, steps=1.677e+08
2023-07-07 17:17:01,071 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9200, best=0.75, avg=0.75, std=0.00, steps=1.696e+08
2023-07-07 17:17:17,650 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9300, best=0.75, avg=0.75, std=0.00, steps=1.714e+08
2023-07-07 17:17:34,267 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9400, best=0.75, avg=0.75, std=0.00, steps=1.733e+08
2023-07-07 17:17:50,884 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9500, best=0.76, avg=0.75, std=0.00, steps=1.751e+08
2023-07-07 17:18:07,461 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9600, best=0.75, avg=0.75, std=0.00, steps=1.770e+08
2023-07-07 17:18:24,056 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9700, best=0.76, avg=0.75, std=0.00, steps=1.788e+08
2023-07-07 17:18:40,657 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9800, best=0.75, avg=0.75, std=0.00, steps=1.807e+08
2023-07-07 17:18:57,276 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9900, best=0.76, avg=0.75, std=0.00, steps=1.825e+08
2023-07-07 17:19:13,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10000, best=0.76, avg=0.75, std=0.00, steps=1.843e+08
2023-07-07 17:19:30,559 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10100, best=0.75, avg=0.75, std=0.00, steps=1.862e+08
2023-07-07 17:19:47,256 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10200, best=0.76, avg=0.75, std=0.00, steps=1.880e+08
2023-07-07 17:20:03,834 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10300, best=0.76, avg=0.75, std=0.00, steps=1.899e+08
2023-07-07 17:20:20,459 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10400, best=0.76, avg=0.75, std=0.00, steps=1.917e+08
2023-07-07 17:20:37,158 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10500, best=0.76, avg=0.75, std=0.00, steps=1.936e+08
2023-07-07 17:20:53,749 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10600, best=0.76, avg=0.75, std=0.00, steps=1.954e+08
2023-07-07 17:21:10,335 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10700, best=0.76, avg=0.75, std=0.00, steps=1.972e+08
2023-07-07 17:21:26,904 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10800, best=0.76, avg=0.75, std=0.00, steps=1.991e+08
2023-07-07 17:21:43,479 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10900, best=0.76, avg=0.75, std=0.00, steps=2.009e+08
2023-07-07 17:22:00,233 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11000, best=0.76, avg=0.75, std=0.00, steps=2.028e+08
2023-07-07 17:22:16,813 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11100, best=0.76, avg=0.75, std=0.00, steps=2.046e+08
2023-07-07 17:22:33,386 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11200, best=0.76, avg=0.75, std=0.00, steps=2.065e+08
2023-07-07 17:22:49,969 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11300, best=0.76, avg=0.76, std=0.00, steps=2.083e+08
2023-07-07 17:23:06,548 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11400, best=0.76, avg=0.76, std=0.00, steps=2.101e+08
2023-07-07 17:23:23,110 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11500, best=0.77, avg=0.76, std=0.00, steps=2.120e+08
2023-07-07 17:23:39,779 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11600, best=0.76, avg=0.76, std=0.00, steps=2.138e+08
2023-07-07 17:23:56,353 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11700, best=0.76, avg=0.76, std=0.00, steps=2.157e+08
2023-07-07 17:24:12,952 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11800, best=0.77, avg=0.76, std=0.00, steps=2.175e+08
2023-07-07 17:24:29,573 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11900, best=0.77, avg=0.76, std=0.00, steps=2.194e+08
2023-07-07 17:24:46,127 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11999, best=0.76, avg=0.76, std=0.00, steps=2.212e+08
2023-07-07 17:24:46,127 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 17:24:46,153 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 17:24:46,184 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 17:25:06,904 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.068e+06
2023-07-07 17:25:25,319 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 200, best=0.58, avg=0.57, std=0.00, steps=4.116e+06
2023-07-07 17:25:43,731 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 300, best=0.60, avg=0.59, std=0.00, steps=6.164e+06
2023-07-07 17:26:02,177 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 400, best=0.62, avg=0.62, std=0.00, steps=8.212e+06
2023-07-07 17:26:20,540 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 500, best=0.63, avg=0.63, std=0.00, steps=1.026e+07
2023-07-07 17:26:38,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 600, best=0.64, avg=0.63, std=0.00, steps=1.231e+07
2023-07-07 17:26:57,326 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 700, best=0.65, avg=0.64, std=0.00, steps=1.436e+07
2023-07-07 17:27:15,709 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 800, best=0.65, avg=0.64, std=0.00, steps=1.640e+07
2023-07-07 17:27:34,110 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 900, best=0.65, avg=0.64, std=0.00, steps=1.845e+07
2023-07-07 17:27:52,493 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1000, best=0.65, avg=0.65, std=0.00, steps=2.050e+07
2023-07-07 17:28:10,922 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1100, best=0.66, avg=0.65, std=0.00, steps=2.255e+07
2023-07-07 17:28:29,341 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1200, best=0.66, avg=0.65, std=0.00, steps=2.460e+07
2023-07-07 17:28:47,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1300, best=0.66, avg=0.65, std=0.00, steps=2.664e+07
2023-07-07 17:29:06,095 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1400, best=0.67, avg=0.66, std=0.00, steps=2.869e+07
2023-07-07 17:29:24,486 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1500, best=0.66, avg=0.66, std=0.00, steps=3.074e+07
2023-07-07 17:29:42,887 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1600, best=0.67, avg=0.66, std=0.00, steps=3.279e+07
2023-07-07 17:30:01,291 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1700, best=0.67, avg=0.66, std=0.00, steps=3.484e+07
2023-07-07 17:30:19,691 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1800, best=0.67, avg=0.66, std=0.00, steps=3.688e+07
2023-07-07 17:30:38,102 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1900, best=0.67, avg=0.67, std=0.00, steps=3.893e+07
2023-07-07 17:30:56,525 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2000, best=0.67, avg=0.67, std=0.00, steps=4.098e+07
2023-07-07 17:31:14,928 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2100, best=0.68, avg=0.67, std=0.00, steps=4.303e+07
2023-07-07 17:31:33,309 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2200, best=0.68, avg=0.67, std=0.00, steps=4.508e+07
2023-07-07 17:31:51,705 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2300, best=0.68, avg=0.67, std=0.00, steps=4.712e+07
2023-07-07 17:32:10,109 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2400, best=0.68, avg=0.67, std=0.00, steps=4.917e+07
2023-07-07 17:32:28,487 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2500, best=0.68, avg=0.67, std=0.00, steps=5.122e+07
2023-07-07 17:32:46,865 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2600, best=0.68, avg=0.67, std=0.00, steps=5.327e+07
2023-07-07 17:33:05,265 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2700, best=0.68, avg=0.68, std=0.00, steps=5.532e+07
2023-07-07 17:33:23,644 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2800, best=0.69, avg=0.68, std=0.00, steps=5.736e+07
2023-07-07 17:33:42,023 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2900, best=0.68, avg=0.68, std=0.00, steps=5.941e+07
2023-07-07 17:34:00,396 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3000, best=0.68, avg=0.68, std=0.00, steps=6.146e+07
2023-07-07 17:34:18,777 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3100, best=0.69, avg=0.68, std=0.00, steps=6.351e+07
2023-07-07 17:34:37,138 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3200, best=0.69, avg=0.68, std=0.00, steps=6.556e+07
2023-07-07 17:34:55,543 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3300, best=0.69, avg=0.68, std=0.00, steps=6.760e+07
2023-07-07 17:35:13,943 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3400, best=0.69, avg=0.68, std=0.00, steps=6.965e+07
2023-07-07 17:35:32,338 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3500, best=0.69, avg=0.68, std=0.00, steps=7.170e+07
2023-07-07 17:35:50,746 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3600, best=0.69, avg=0.68, std=0.00, steps=7.375e+07
2023-07-07 17:36:09,209 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3700, best=0.69, avg=0.68, std=0.00, steps=7.580e+07
2023-07-07 17:36:27,612 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3800, best=0.69, avg=0.68, std=0.00, steps=7.784e+07
2023-07-07 17:36:45,985 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3900, best=0.69, avg=0.68, std=0.00, steps=7.989e+07
2023-07-07 17:37:04,386 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4000, best=0.69, avg=0.68, std=0.00, steps=8.194e+07
2023-07-07 17:37:22,801 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4100, best=0.69, avg=0.69, std=0.00, steps=8.399e+07
2023-07-07 17:37:41,180 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4200, best=0.69, avg=0.69, std=0.00, steps=8.604e+07
2023-07-07 17:37:59,569 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4300, best=0.69, avg=0.69, std=0.00, steps=8.808e+07
2023-07-07 17:38:17,953 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4400, best=0.69, avg=0.69, std=0.00, steps=9.013e+07
2023-07-07 17:38:36,342 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4500, best=0.69, avg=0.69, std=0.00, steps=9.218e+07
2023-07-07 17:38:54,747 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4600, best=0.70, avg=0.69, std=0.00, steps=9.423e+07
2023-07-07 17:39:13,117 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4700, best=0.70, avg=0.69, std=0.00, steps=9.628e+07
2023-07-07 17:39:31,512 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4800, best=0.70, avg=0.69, std=0.00, steps=9.832e+07
2023-07-07 17:39:49,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4900, best=0.70, avg=0.69, std=0.00, steps=1.004e+08
2023-07-07 17:40:08,324 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5000, best=0.70, avg=0.69, std=0.00, steps=1.024e+08
2023-07-07 17:40:26,726 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5100, best=0.70, avg=0.69, std=0.00, steps=1.045e+08
2023-07-07 17:40:45,122 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5200, best=0.70, avg=0.69, std=0.00, steps=1.065e+08
2023-07-07 17:41:03,510 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5300, best=0.70, avg=0.69, std=0.00, steps=1.086e+08
2023-07-07 17:41:21,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5400, best=0.70, avg=0.69, std=0.00, steps=1.106e+08
2023-07-07 17:41:40,269 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5500, best=0.70, avg=0.69, std=0.00, steps=1.127e+08
2023-07-07 17:41:58,627 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5600, best=0.70, avg=0.69, std=0.00, steps=1.147e+08
2023-07-07 17:42:16,985 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5700, best=0.70, avg=0.70, std=0.00, steps=1.168e+08
2023-07-07 17:42:35,387 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5800, best=0.70, avg=0.70, std=0.00, steps=1.188e+08
2023-07-07 17:42:53,770 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5900, best=0.70, avg=0.70, std=0.00, steps=1.209e+08
2023-07-07 17:43:12,144 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6000, best=0.70, avg=0.70, std=0.00, steps=1.229e+08
2023-07-07 17:43:30,507 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6100, best=0.71, avg=0.70, std=0.00, steps=1.249e+08
2023-07-07 17:43:48,895 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6200, best=0.71, avg=0.70, std=0.00, steps=1.270e+08
2023-07-07 17:44:07,290 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6300, best=0.71, avg=0.70, std=0.00, steps=1.290e+08
2023-07-07 17:44:25,672 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6400, best=0.71, avg=0.70, std=0.00, steps=1.311e+08
2023-07-07 17:44:44,055 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6500, best=0.71, avg=0.70, std=0.00, steps=1.331e+08
2023-07-07 17:45:02,437 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6600, best=0.71, avg=0.70, std=0.00, steps=1.352e+08
2023-07-07 17:45:20,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6700, best=0.71, avg=0.70, std=0.00, steps=1.372e+08
2023-07-07 17:45:39,213 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6800, best=0.71, avg=0.70, std=0.00, steps=1.393e+08
2023-07-07 17:45:57,624 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6900, best=0.71, avg=0.70, std=0.00, steps=1.413e+08
2023-07-07 17:46:16,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7000, best=0.71, avg=0.70, std=0.00, steps=1.434e+08
2023-07-07 17:46:34,434 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7100, best=0.71, avg=0.70, std=0.00, steps=1.454e+08
2023-07-07 17:46:52,825 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7200, best=0.71, avg=0.70, std=0.00, steps=1.475e+08
2023-07-07 17:47:11,225 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7300, best=0.71, avg=0.70, std=0.00, steps=1.495e+08
2023-07-07 17:47:29,614 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7400, best=0.71, avg=0.70, std=0.00, steps=1.516e+08
2023-07-07 17:47:48,001 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7500, best=0.71, avg=0.70, std=0.00, steps=1.536e+08
2023-07-07 17:48:06,396 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7600, best=0.71, avg=0.70, std=0.00, steps=1.557e+08
2023-07-07 17:48:24,787 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7700, best=0.71, avg=0.70, std=0.00, steps=1.577e+08
2023-07-07 17:48:43,191 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7800, best=0.71, avg=0.70, std=0.00, steps=1.598e+08
2023-07-07 17:49:01,582 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7900, best=0.71, avg=0.71, std=0.00, steps=1.618e+08
2023-07-07 17:49:19,996 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8000, best=0.71, avg=0.70, std=0.00, steps=1.639e+08
2023-07-07 17:49:38,387 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8100, best=0.71, avg=0.71, std=0.00, steps=1.659e+08
2023-07-07 17:49:56,792 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8200, best=0.71, avg=0.71, std=0.00, steps=1.680e+08
2023-07-07 17:50:15,189 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8300, best=0.71, avg=0.71, std=0.00, steps=1.700e+08
2023-07-07 17:50:33,638 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8400, best=0.72, avg=0.71, std=0.00, steps=1.721e+08
2023-07-07 17:50:52,052 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8500, best=0.72, avg=0.71, std=0.00, steps=1.741e+08
2023-07-07 17:51:10,446 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8600, best=0.71, avg=0.71, std=0.00, steps=1.761e+08
2023-07-07 17:51:28,845 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8700, best=0.72, avg=0.71, std=0.00, steps=1.782e+08
2023-07-07 17:51:47,236 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8800, best=0.72, avg=0.71, std=0.00, steps=1.802e+08
2023-07-07 17:52:05,615 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8900, best=0.72, avg=0.71, std=0.00, steps=1.823e+08
2023-07-07 17:52:24,014 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9000, best=0.72, avg=0.71, std=0.00, steps=1.843e+08
2023-07-07 17:52:42,399 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9100, best=0.72, avg=0.71, std=0.00, steps=1.864e+08
2023-07-07 17:53:00,776 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9200, best=0.72, avg=0.71, std=0.00, steps=1.884e+08
2023-07-07 17:53:19,150 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9300, best=0.72, avg=0.71, std=0.00, steps=1.905e+08
2023-07-07 17:53:37,532 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9400, best=0.72, avg=0.71, std=0.00, steps=1.925e+08
2023-07-07 17:53:55,925 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9500, best=0.72, avg=0.71, std=0.00, steps=1.946e+08
2023-07-07 17:54:14,310 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9600, best=0.72, avg=0.71, std=0.00, steps=1.966e+08
2023-07-07 17:54:32,691 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9700, best=0.72, avg=0.71, std=0.00, steps=1.987e+08
2023-07-07 17:54:51,090 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9800, best=0.72, avg=0.71, std=0.00, steps=2.007e+08
2023-07-07 17:55:09,500 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9900, best=0.72, avg=0.71, std=0.00, steps=2.028e+08
2023-07-07 17:55:27,893 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10000, best=0.72, avg=0.71, std=0.00, steps=2.048e+08
2023-07-07 17:55:46,299 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10100, best=0.72, avg=0.71, std=0.00, steps=2.069e+08
2023-07-07 17:56:04,666 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10200, best=0.72, avg=0.71, std=0.00, steps=2.089e+08
2023-07-07 17:56:23,049 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10300, best=0.72, avg=0.71, std=0.00, steps=2.110e+08
2023-07-07 17:56:41,417 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10400, best=0.72, avg=0.71, std=0.00, steps=2.130e+08
2023-07-07 17:56:59,793 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10500, best=0.72, avg=0.71, std=0.00, steps=2.151e+08
2023-07-07 17:57:18,206 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10600, best=0.72, avg=0.71, std=0.00, steps=2.171e+08
2023-07-07 17:57:36,597 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10700, best=0.72, avg=0.71, std=0.00, steps=2.192e+08
2023-07-07 17:57:54,985 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10800, best=0.72, avg=0.71, std=0.00, steps=2.212e+08
2023-07-07 17:58:13,402 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10900, best=0.72, avg=0.71, std=0.00, steps=2.233e+08
2023-07-07 17:58:31,786 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11000, best=0.72, avg=0.71, std=0.00, steps=2.253e+08
2023-07-07 17:58:50,158 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11100, best=0.72, avg=0.72, std=0.00, steps=2.273e+08
2023-07-07 17:59:08,528 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11200, best=0.72, avg=0.71, std=0.00, steps=2.294e+08
2023-07-07 17:59:26,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11300, best=0.72, avg=0.71, std=0.00, steps=2.314e+08
2023-07-07 17:59:45,324 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11400, best=0.72, avg=0.71, std=0.00, steps=2.335e+08
2023-07-07 18:00:03,702 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11500, best=0.72, avg=0.72, std=0.00, steps=2.355e+08
2023-07-07 18:00:22,076 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11600, best=0.72, avg=0.72, std=0.00, steps=2.376e+08
2023-07-07 18:00:40,468 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11700, best=0.72, avg=0.72, std=0.00, steps=2.396e+08
2023-07-07 18:00:58,836 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11800, best=0.72, avg=0.72, std=0.00, steps=2.417e+08
2023-07-07 18:01:17,207 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11900, best=0.72, avg=0.72, std=0.00, steps=2.437e+08
2023-07-07 18:01:35,405 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11999, best=0.72, avg=0.72, std=0.00, steps=2.458e+08
2023-07-07 18:01:35,405 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 18:01:35,434 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:01:35,467 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:01:59,859 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.482e+06
2023-07-07 18:02:21,855 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=4.940e+06
2023-07-07 18:02:43,848 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 300, best=0.57, avg=0.57, std=0.00, steps=7.397e+06
2023-07-07 18:03:05,853 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 400, best=0.58, avg=0.57, std=0.00, steps=9.855e+06
2023-07-07 18:03:27,875 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 500, best=0.58, avg=0.57, std=0.00, steps=1.231e+07
2023-07-07 18:03:49,910 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 600, best=0.58, avg=0.57, std=0.00, steps=1.477e+07
2023-07-07 18:04:11,923 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 700, best=0.58, avg=0.57, std=0.00, steps=1.723e+07
2023-07-07 18:04:33,940 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 800, best=0.58, avg=0.57, std=0.00, steps=1.969e+07
2023-07-07 18:04:55,962 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 900, best=0.58, avg=0.57, std=0.00, steps=2.214e+07
2023-07-07 18:05:17,956 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1000, best=0.58, avg=0.57, std=0.00, steps=2.460e+07
2023-07-07 18:05:39,968 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1100, best=0.58, avg=0.57, std=0.00, steps=2.706e+07
2023-07-07 18:06:01,985 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1200, best=0.59, avg=0.58, std=0.00, steps=2.952e+07
2023-07-07 18:06:24,037 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1300, best=0.60, avg=0.59, std=0.00, steps=3.197e+07
2023-07-07 18:06:46,063 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1400, best=0.60, avg=0.59, std=0.00, steps=3.443e+07
2023-07-07 18:07:08,071 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1500, best=0.60, avg=0.60, std=0.00, steps=3.689e+07
2023-07-07 18:07:30,101 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1600, best=0.61, avg=0.60, std=0.00, steps=3.935e+07
2023-07-07 18:07:52,148 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1700, best=0.61, avg=0.60, std=0.00, steps=4.180e+07
2023-07-07 18:08:14,171 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1800, best=0.61, avg=0.60, std=0.00, steps=4.426e+07
2023-07-07 18:08:36,200 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1900, best=0.61, avg=0.60, std=0.00, steps=4.672e+07
2023-07-07 18:08:58,222 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2000, best=0.61, avg=0.60, std=0.00, steps=4.918e+07
2023-07-07 18:09:20,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2100, best=0.61, avg=0.60, std=0.00, steps=5.163e+07
2023-07-07 18:09:42,250 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2200, best=0.61, avg=0.61, std=0.00, steps=5.409e+07
2023-07-07 18:10:04,269 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2300, best=0.61, avg=0.61, std=0.00, steps=5.655e+07
2023-07-07 18:10:26,293 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2400, best=0.62, avg=0.61, std=0.00, steps=5.901e+07
2023-07-07 18:10:48,321 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2500, best=0.62, avg=0.61, std=0.00, steps=6.146e+07
2023-07-07 18:11:10,325 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2600, best=0.61, avg=0.61, std=0.00, steps=6.392e+07
2023-07-07 18:11:32,349 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2700, best=0.61, avg=0.61, std=0.00, steps=6.638e+07
2023-07-07 18:11:54,336 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2800, best=0.62, avg=0.61, std=0.00, steps=6.884e+07
2023-07-07 18:12:16,339 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2900, best=0.62, avg=0.61, std=0.00, steps=7.129e+07
2023-07-07 18:12:38,365 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3000, best=0.62, avg=0.61, std=0.00, steps=7.375e+07
2023-07-07 18:13:00,379 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3100, best=0.62, avg=0.61, std=0.00, steps=7.621e+07
2023-07-07 18:13:22,395 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3200, best=0.62, avg=0.61, std=0.00, steps=7.867e+07
2023-07-07 18:13:44,407 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3300, best=0.62, avg=0.61, std=0.00, steps=8.113e+07
2023-07-07 18:14:06,457 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3400, best=0.62, avg=0.61, std=0.00, steps=8.358e+07
2023-07-07 18:14:28,495 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3500, best=0.62, avg=0.61, std=0.00, steps=8.604e+07
2023-07-07 18:14:50,498 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3600, best=0.62, avg=0.61, std=0.00, steps=8.850e+07
2023-07-07 18:15:12,521 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3700, best=0.62, avg=0.61, std=0.00, steps=9.096e+07
2023-07-07 18:15:34,538 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3800, best=0.62, avg=0.61, std=0.00, steps=9.341e+07
2023-07-07 18:15:56,565 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3900, best=0.62, avg=0.61, std=0.00, steps=9.587e+07
2023-07-07 18:16:18,555 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4000, best=0.62, avg=0.61, std=0.00, steps=9.833e+07
2023-07-07 18:16:40,572 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4100, best=0.62, avg=0.62, std=0.00, steps=1.008e+08
2023-07-07 18:17:02,604 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4200, best=0.62, avg=0.62, std=0.00, steps=1.032e+08
2023-07-07 18:17:24,625 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4300, best=0.63, avg=0.62, std=0.00, steps=1.057e+08
2023-07-07 18:17:46,651 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4400, best=0.63, avg=0.62, std=0.00, steps=1.082e+08
2023-07-07 18:18:08,627 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4500, best=0.63, avg=0.62, std=0.00, steps=1.106e+08
2023-07-07 18:18:30,660 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4600, best=0.63, avg=0.62, std=0.00, steps=1.131e+08
2023-07-07 18:18:52,704 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4700, best=0.63, avg=0.62, std=0.00, steps=1.155e+08
2023-07-07 18:19:14,703 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4800, best=0.63, avg=0.63, std=0.00, steps=1.180e+08
2023-07-07 18:19:36,696 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4900, best=0.63, avg=0.63, std=0.00, steps=1.204e+08
2023-07-07 18:19:58,710 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5000, best=0.63, avg=0.63, std=0.00, steps=1.229e+08
2023-07-07 18:20:20,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5100, best=0.63, avg=0.63, std=0.00, steps=1.254e+08
2023-07-07 18:20:42,708 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5200, best=0.64, avg=0.63, std=0.00, steps=1.278e+08
2023-07-07 18:21:04,731 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5300, best=0.63, avg=0.63, std=0.00, steps=1.303e+08
2023-07-07 18:21:26,745 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5400, best=0.63, avg=0.63, std=0.00, steps=1.327e+08
2023-07-07 18:21:48,760 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5500, best=0.64, avg=0.63, std=0.00, steps=1.352e+08
2023-07-07 18:22:10,792 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5600, best=0.64, avg=0.63, std=0.00, steps=1.377e+08
2023-07-07 18:22:32,826 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5700, best=0.63, avg=0.63, std=0.00, steps=1.401e+08
2023-07-07 18:22:54,830 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5800, best=0.64, avg=0.63, std=0.00, steps=1.426e+08
2023-07-07 18:23:16,824 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5900, best=0.64, avg=0.63, std=0.00, steps=1.450e+08
2023-07-07 18:23:38,836 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6000, best=0.64, avg=0.63, std=0.00, steps=1.475e+08
2023-07-07 18:24:00,862 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6100, best=0.64, avg=0.63, std=0.00, steps=1.499e+08
2023-07-07 18:24:22,834 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6200, best=0.63, avg=0.63, std=0.00, steps=1.524e+08
2023-07-07 18:24:44,806 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6300, best=0.64, avg=0.63, std=0.00, steps=1.549e+08
2023-07-07 18:25:06,814 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6400, best=0.64, avg=0.63, std=0.00, steps=1.573e+08
2023-07-07 18:25:28,839 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6500, best=0.64, avg=0.63, std=0.00, steps=1.598e+08
2023-07-07 18:25:50,860 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6600, best=0.64, avg=0.63, std=0.00, steps=1.622e+08
2023-07-07 18:26:12,869 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6700, best=0.64, avg=0.63, std=0.00, steps=1.647e+08
2023-07-07 18:26:34,862 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6800, best=0.64, avg=0.63, std=0.00, steps=1.671e+08
2023-07-07 18:26:56,847 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6900, best=0.64, avg=0.63, std=0.00, steps=1.696e+08
2023-07-07 18:27:18,871 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7000, best=0.64, avg=0.63, std=0.00, steps=1.721e+08
2023-07-07 18:27:40,878 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7100, best=0.64, avg=0.63, std=0.00, steps=1.745e+08
2023-07-07 18:28:02,911 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7200, best=0.64, avg=0.63, std=0.00, steps=1.770e+08
2023-07-07 18:28:24,908 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7300, best=0.64, avg=0.63, std=0.00, steps=1.794e+08
2023-07-07 18:28:46,941 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7400, best=0.64, avg=0.63, std=0.00, steps=1.819e+08
2023-07-07 18:29:08,940 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7500, best=0.64, avg=0.63, std=0.00, steps=1.843e+08
2023-07-07 18:29:30,937 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7600, best=0.64, avg=0.63, std=0.00, steps=1.868e+08
2023-07-07 18:29:52,917 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7700, best=0.64, avg=0.63, std=0.00, steps=1.893e+08
2023-07-07 18:30:14,924 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7800, best=0.64, avg=0.63, std=0.00, steps=1.917e+08
2023-07-07 18:30:36,927 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7900, best=0.64, avg=0.63, std=0.00, steps=1.942e+08
2023-07-07 18:30:58,919 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8000, best=0.64, avg=0.63, std=0.00, steps=1.966e+08
2023-07-07 18:31:20,928 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8100, best=0.64, avg=0.63, std=0.00, steps=1.991e+08
2023-07-07 18:31:42,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8200, best=0.64, avg=0.63, std=0.00, steps=2.015e+08
2023-07-07 18:32:04,966 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8300, best=0.64, avg=0.63, std=0.00, steps=2.040e+08
2023-07-07 18:32:26,967 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8400, best=0.64, avg=0.63, std=0.00, steps=2.065e+08
2023-07-07 18:32:48,961 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8500, best=0.64, avg=0.63, std=0.00, steps=2.089e+08
2023-07-07 18:33:10,959 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8600, best=0.64, avg=0.63, std=0.00, steps=2.114e+08
2023-07-07 18:33:32,948 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8700, best=0.64, avg=0.63, std=0.00, steps=2.138e+08
2023-07-07 18:33:54,952 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8800, best=0.64, avg=0.63, std=0.00, steps=2.163e+08
2023-07-07 18:34:16,958 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8900, best=0.64, avg=0.63, std=0.00, steps=2.188e+08
2023-07-07 18:34:38,952 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9000, best=0.64, avg=0.63, std=0.00, steps=2.212e+08
2023-07-07 18:35:00,959 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9100, best=0.64, avg=0.63, std=0.00, steps=2.237e+08
2023-07-07 18:35:22,942 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9200, best=0.64, avg=0.63, std=0.00, steps=2.261e+08
2023-07-07 18:35:44,942 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9300, best=0.64, avg=0.63, std=0.00, steps=2.286e+08
2023-07-07 18:36:06,940 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9400, best=0.64, avg=0.63, std=0.00, steps=2.310e+08
2023-07-07 18:36:28,963 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9500, best=0.64, avg=0.63, std=0.00, steps=2.335e+08
2023-07-07 18:36:50,953 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9600, best=0.64, avg=0.63, std=0.00, steps=2.360e+08
2023-07-07 18:37:12,968 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9700, best=0.64, avg=0.63, std=0.00, steps=2.384e+08
2023-07-07 18:37:34,972 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9800, best=0.64, avg=0.63, std=0.00, steps=2.409e+08
2023-07-07 18:37:56,972 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9900, best=0.64, avg=0.63, std=0.00, steps=2.433e+08
2023-07-07 18:38:18,976 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10000, best=0.64, avg=0.63, std=0.00, steps=2.458e+08
2023-07-07 18:38:40,950 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10100, best=0.64, avg=0.63, std=0.00, steps=2.482e+08
2023-07-07 18:39:02,952 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10200, best=0.64, avg=0.63, std=0.00, steps=2.507e+08
2023-07-07 18:39:24,944 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10300, best=0.64, avg=0.63, std=0.00, steps=2.532e+08
2023-07-07 18:39:46,946 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10400, best=0.64, avg=0.63, std=0.00, steps=2.556e+08
2023-07-07 18:40:08,946 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10500, best=0.64, avg=0.63, std=0.00, steps=2.581e+08
2023-07-07 18:40:30,956 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10600, best=0.64, avg=0.63, std=0.00, steps=2.605e+08
2023-07-07 18:40:52,967 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10700, best=0.64, avg=0.64, std=0.00, steps=2.630e+08
2023-07-07 18:41:15,003 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10800, best=0.64, avg=0.63, std=0.00, steps=2.654e+08
2023-07-07 18:41:37,024 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10900, best=0.64, avg=0.63, std=0.00, steps=2.679e+08
2023-07-07 18:41:59,044 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11000, best=0.64, avg=0.64, std=0.00, steps=2.704e+08
2023-07-07 18:42:21,040 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11100, best=0.64, avg=0.64, std=0.00, steps=2.728e+08
2023-07-07 18:42:43,058 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11200, best=0.64, avg=0.64, std=0.00, steps=2.753e+08
2023-07-07 18:43:05,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11300, best=0.64, avg=0.64, std=0.00, steps=2.777e+08
2023-07-07 18:43:27,093 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11400, best=0.64, avg=0.64, std=0.00, steps=2.802e+08
2023-07-07 18:43:49,089 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11500, best=0.64, avg=0.64, std=0.00, steps=2.826e+08
2023-07-07 18:44:11,076 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11600, best=0.64, avg=0.64, std=0.00, steps=2.851e+08
2023-07-07 18:44:33,092 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11700, best=0.64, avg=0.64, std=0.00, steps=2.876e+08
2023-07-07 18:44:55,106 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11800, best=0.64, avg=0.64, std=0.00, steps=2.900e+08
2023-07-07 18:45:17,117 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11900, best=0.64, avg=0.64, std=0.00, steps=2.925e+08
2023-07-07 18:45:38,934 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11999, best=0.64, avg=0.64, std=0.00, steps=2.949e+08
2023-07-07 18:45:38,935 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 18:45:38,961 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:45:38,994 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:46:11,067 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.310e+06
2023-07-07 18:46:40,322 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=6.586e+06
2023-07-07 18:47:09,539 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 300, best=0.56, avg=0.55, std=0.00, steps=9.863e+06
2023-07-07 18:47:38,754 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 400, best=0.56, avg=0.55, std=0.00, steps=1.314e+07
2023-07-07 18:48:07,980 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 500, best=0.57, avg=0.57, std=0.00, steps=1.642e+07
2023-07-07 18:48:37,195 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 600, best=0.59, avg=0.58, std=0.00, steps=1.969e+07
2023-07-07 18:49:06,398 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 700, best=0.60, avg=0.59, std=0.00, steps=2.297e+07
2023-07-07 18:49:35,630 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 800, best=0.61, avg=0.60, std=0.00, steps=2.625e+07
2023-07-07 18:50:04,887 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 900, best=0.61, avg=0.60, std=0.00, steps=2.952e+07
2023-07-07 18:50:34,154 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1000, best=0.61, avg=0.61, std=0.00, steps=3.280e+07
2023-07-07 18:51:03,399 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1100, best=0.62, avg=0.61, std=0.00, steps=3.608e+07
2023-07-07 18:51:32,621 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1200, best=0.62, avg=0.61, std=0.00, steps=3.935e+07
2023-07-07 18:52:01,847 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1300, best=0.62, avg=0.62, std=0.00, steps=4.263e+07
2023-07-07 18:52:31,067 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1400, best=0.62, avg=0.62, std=0.00, steps=4.591e+07
2023-07-07 18:53:00,280 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1500, best=0.62, avg=0.62, std=0.00, steps=4.918e+07
2023-07-07 18:53:29,501 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1600, best=0.63, avg=0.62, std=0.00, steps=5.246e+07
2023-07-07 18:53:58,729 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1700, best=0.63, avg=0.62, std=0.00, steps=5.574e+07
2023-07-07 18:54:27,989 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1800, best=0.63, avg=0.62, std=0.00, steps=5.902e+07
2023-07-07 18:54:57,228 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1900, best=0.63, avg=0.63, std=0.00, steps=6.229e+07
2023-07-07 18:55:26,470 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2000, best=0.63, avg=0.63, std=0.00, steps=6.557e+07
2023-07-07 18:55:55,684 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2100, best=0.63, avg=0.63, std=0.00, steps=6.885e+07
2023-07-07 18:56:24,952 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2200, best=0.63, avg=0.63, std=0.00, steps=7.212e+07
2023-07-07 18:56:54,192 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2300, best=0.63, avg=0.63, std=0.00, steps=7.540e+07
2023-07-07 18:57:23,422 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2400, best=0.63, avg=0.63, std=0.00, steps=7.868e+07
2023-07-07 18:57:52,655 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2500, best=0.64, avg=0.63, std=0.00, steps=8.195e+07
2023-07-07 18:58:21,908 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2600, best=0.64, avg=0.63, std=0.00, steps=8.523e+07
2023-07-07 18:58:51,121 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2700, best=0.64, avg=0.63, std=0.00, steps=8.851e+07
2023-07-07 18:59:20,353 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2800, best=0.64, avg=0.63, std=0.00, steps=9.178e+07
2023-07-07 18:59:49,594 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2900, best=0.64, avg=0.63, std=0.00, steps=9.506e+07
2023-07-07 19:00:18,836 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3000, best=0.64, avg=0.63, std=0.00, steps=9.834e+07
2023-07-07 19:00:48,068 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3100, best=0.64, avg=0.64, std=0.00, steps=1.016e+08
2023-07-07 19:01:17,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3200, best=0.64, avg=0.64, std=0.00, steps=1.049e+08
2023-07-07 19:01:46,539 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3300, best=0.64, avg=0.64, std=0.00, steps=1.082e+08
2023-07-07 19:02:15,808 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3400, best=0.64, avg=0.64, std=0.00, steps=1.114e+08
2023-07-07 19:02:45,069 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3500, best=0.64, avg=0.64, std=0.00, steps=1.147e+08
2023-07-07 19:03:14,287 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3600, best=0.64, avg=0.64, std=0.00, steps=1.180e+08
2023-07-07 19:03:43,498 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3700, best=0.64, avg=0.64, std=0.00, steps=1.213e+08
2023-07-07 19:04:12,741 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3800, best=0.64, avg=0.64, std=0.00, steps=1.246e+08
2023-07-07 19:04:41,967 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3900, best=0.64, avg=0.64, std=0.00, steps=1.278e+08
2023-07-07 19:05:11,178 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4000, best=0.65, avg=0.64, std=0.00, steps=1.311e+08
2023-07-07 19:05:40,431 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4100, best=0.65, avg=0.64, std=0.00, steps=1.344e+08
2023-07-07 19:06:09,652 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4200, best=0.65, avg=0.64, std=0.00, steps=1.377e+08
2023-07-07 19:06:38,871 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4300, best=0.65, avg=0.64, std=0.00, steps=1.409e+08
2023-07-07 19:07:08,144 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4400, best=0.65, avg=0.64, std=0.00, steps=1.442e+08
2023-07-07 19:07:37,374 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4500, best=0.65, avg=0.64, std=0.00, steps=1.475e+08
2023-07-07 19:08:06,588 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4600, best=0.65, avg=0.64, std=0.00, steps=1.508e+08
2023-07-07 19:08:35,788 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4700, best=0.65, avg=0.64, std=0.00, steps=1.540e+08
2023-07-07 19:09:04,986 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4800, best=0.65, avg=0.64, std=0.00, steps=1.573e+08
2023-07-07 19:09:34,175 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4900, best=0.65, avg=0.64, std=0.00, steps=1.606e+08
2023-07-07 19:10:03,441 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5000, best=0.65, avg=0.64, std=0.00, steps=1.639e+08
2023-07-07 19:10:32,700 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5100, best=0.65, avg=0.64, std=0.00, steps=1.671e+08
2023-07-07 19:11:01,918 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5200, best=0.65, avg=0.65, std=0.00, steps=1.704e+08
2023-07-07 19:11:31,131 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5300, best=0.65, avg=0.65, std=0.00, steps=1.737e+08
2023-07-07 19:12:00,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5400, best=0.65, avg=0.65, std=0.00, steps=1.770e+08
2023-07-07 19:12:29,623 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5500, best=0.65, avg=0.65, std=0.00, steps=1.803e+08
2023-07-07 19:12:58,853 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5600, best=0.65, avg=0.65, std=0.00, steps=1.835e+08
2023-07-07 19:13:28,088 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5700, best=0.65, avg=0.65, std=0.00, steps=1.868e+08
2023-07-07 19:13:57,332 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5800, best=0.65, avg=0.65, std=0.00, steps=1.901e+08
2023-07-07 19:14:26,566 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5900, best=0.65, avg=0.65, std=0.00, steps=1.934e+08
2023-07-07 19:14:55,852 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6000, best=0.65, avg=0.65, std=0.00, steps=1.966e+08
2023-07-07 19:15:25,110 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6100, best=0.65, avg=0.65, std=0.00, steps=1.999e+08
2023-07-07 19:15:54,342 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6200, best=0.65, avg=0.65, std=0.00, steps=2.032e+08
2023-07-07 19:16:23,557 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6300, best=0.65, avg=0.65, std=0.00, steps=2.065e+08
2023-07-07 19:16:52,790 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6400, best=0.65, avg=0.65, std=0.00, steps=2.097e+08
2023-07-07 19:17:22,020 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6500, best=0.66, avg=0.65, std=0.00, steps=2.130e+08
2023-07-07 19:17:51,265 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6600, best=0.66, avg=0.65, std=0.00, steps=2.163e+08
2023-07-07 19:18:20,510 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6700, best=0.66, avg=0.65, std=0.00, steps=2.196e+08
2023-07-07 19:18:49,737 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6800, best=0.66, avg=0.65, std=0.00, steps=2.229e+08
2023-07-07 19:19:18,950 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6900, best=0.66, avg=0.65, std=0.00, steps=2.261e+08
2023-07-07 19:19:48,177 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7000, best=0.66, avg=0.65, std=0.00, steps=2.294e+08
2023-07-07 19:20:17,390 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7100, best=0.66, avg=0.65, std=0.00, steps=2.327e+08
2023-07-07 19:20:46,613 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7200, best=0.66, avg=0.65, std=0.00, steps=2.360e+08
2023-07-07 19:21:15,842 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7300, best=0.66, avg=0.65, std=0.00, steps=2.392e+08
2023-07-07 19:21:45,066 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7400, best=0.66, avg=0.65, std=0.00, steps=2.425e+08
2023-07-07 19:22:14,303 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7500, best=0.66, avg=0.65, std=0.00, steps=2.458e+08
2023-07-07 19:22:43,530 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7600, best=0.66, avg=0.65, std=0.00, steps=2.491e+08
2023-07-07 19:23:12,746 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7700, best=0.66, avg=0.65, std=0.00, steps=2.523e+08
2023-07-07 19:23:41,969 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7800, best=0.66, avg=0.65, std=0.00, steps=2.556e+08
2023-07-07 19:24:11,201 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7900, best=0.66, avg=0.65, std=0.00, steps=2.589e+08
2023-07-07 19:24:40,412 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8000, best=0.66, avg=0.65, std=0.00, steps=2.622e+08
2023-07-07 19:25:09,640 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8100, best=0.66, avg=0.66, std=0.00, steps=2.655e+08
2023-07-07 19:25:38,844 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8200, best=0.66, avg=0.66, std=0.00, steps=2.687e+08
2023-07-07 19:26:08,094 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8300, best=0.66, avg=0.66, std=0.00, steps=2.720e+08
2023-07-07 19:26:37,317 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8400, best=0.66, avg=0.66, std=0.00, steps=2.753e+08
2023-07-07 19:27:06,509 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8500, best=0.66, avg=0.66, std=0.00, steps=2.786e+08
2023-07-07 19:27:35,727 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8600, best=0.66, avg=0.66, std=0.00, steps=2.818e+08
2023-07-07 19:28:04,934 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8700, best=0.66, avg=0.66, std=0.00, steps=2.851e+08
2023-07-07 19:28:34,173 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8800, best=0.66, avg=0.66, std=0.00, steps=2.884e+08
2023-07-07 19:29:03,414 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8900, best=0.66, avg=0.66, std=0.00, steps=2.917e+08
2023-07-07 19:29:32,662 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9000, best=0.66, avg=0.66, std=0.00, steps=2.949e+08
2023-07-07 19:30:01,867 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9100, best=0.66, avg=0.66, std=0.00, steps=2.982e+08
2023-07-07 19:30:31,099 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9200, best=0.66, avg=0.66, std=0.00, steps=3.015e+08
2023-07-07 19:31:00,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9300, best=0.66, avg=0.66, std=0.00, steps=3.048e+08
2023-07-07 19:31:29,535 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9400, best=0.66, avg=0.66, std=0.00, steps=3.081e+08
2023-07-07 19:31:58,788 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9500, best=0.66, avg=0.66, std=0.00, steps=3.113e+08
2023-07-07 19:32:28,009 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9600, best=0.66, avg=0.66, std=0.00, steps=3.146e+08
2023-07-07 19:32:57,245 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9700, best=0.66, avg=0.66, std=0.00, steps=3.179e+08
2023-07-07 19:33:26,489 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9800, best=0.67, avg=0.66, std=0.00, steps=3.212e+08
2023-07-07 19:33:55,719 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9900, best=0.66, avg=0.66, std=0.00, steps=3.244e+08
2023-07-07 19:34:24,927 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10000, best=0.66, avg=0.66, std=0.00, steps=3.277e+08
2023-07-07 19:34:54,155 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10100, best=0.66, avg=0.66, std=0.00, steps=3.310e+08
2023-07-07 19:35:23,398 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10200, best=0.66, avg=0.66, std=0.00, steps=3.343e+08
2023-07-07 19:35:52,631 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10300, best=0.66, avg=0.66, std=0.00, steps=3.375e+08
2023-07-07 19:36:21,859 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10400, best=0.67, avg=0.66, std=0.00, steps=3.408e+08
2023-07-07 19:36:51,090 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10500, best=0.66, avg=0.66, std=0.00, steps=3.441e+08
2023-07-07 19:37:20,346 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10600, best=0.66, avg=0.66, std=0.00, steps=3.474e+08
2023-07-07 19:37:49,628 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10700, best=0.67, avg=0.66, std=0.00, steps=3.507e+08
2023-07-07 19:38:18,895 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10800, best=0.66, avg=0.66, std=0.00, steps=3.539e+08
2023-07-07 19:38:48,115 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10900, best=0.66, avg=0.66, std=0.00, steps=3.572e+08
2023-07-07 19:39:17,359 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11000, best=0.67, avg=0.66, std=0.00, steps=3.605e+08
2023-07-07 19:39:46,571 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11100, best=0.67, avg=0.66, std=0.00, steps=3.638e+08
2023-07-07 19:40:15,811 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11200, best=0.67, avg=0.66, std=0.00, steps=3.670e+08
2023-07-07 19:40:45,073 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11300, best=0.67, avg=0.66, std=0.00, steps=3.703e+08
2023-07-07 19:41:14,326 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11400, best=0.67, avg=0.66, std=0.00, steps=3.736e+08
2023-07-07 19:41:43,568 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11500, best=0.67, avg=0.66, std=0.00, steps=3.769e+08
2023-07-07 19:42:12,759 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11600, best=0.67, avg=0.66, std=0.00, steps=3.801e+08
2023-07-07 19:42:42,005 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11700, best=0.67, avg=0.66, std=0.00, steps=3.834e+08
2023-07-07 19:43:11,299 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11800, best=0.67, avg=0.66, std=0.00, steps=3.867e+08
2023-07-07 19:43:40,539 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11900, best=0.67, avg=0.66, std=0.00, steps=3.900e+08
2023-07-07 19:44:09,461 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11999, best=0.67, avg=0.66, std=0.00, steps=3.932e+08
2023-07-07 19:44:09,462 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 19:44:09,489 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 19:44:09,532 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 19:44:43,068 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.516e+06
2023-07-07 19:45:14,119 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 200, best=0.52, avg=0.51, std=0.00, steps=6.998e+06
2023-07-07 19:45:45,179 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 300, best=0.55, avg=0.54, std=0.00, steps=1.048e+07
2023-07-07 19:46:16,236 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 400, best=0.55, avg=0.55, std=0.00, steps=1.396e+07
2023-07-07 19:46:47,249 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 500, best=0.56, avg=0.55, std=0.00, steps=1.744e+07
2023-07-07 19:47:18,263 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 600, best=0.57, avg=0.56, std=0.00, steps=2.092e+07
2023-07-07 19:47:49,314 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 700, best=0.58, avg=0.57, std=0.00, steps=2.441e+07
2023-07-07 19:48:20,369 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 800, best=0.58, avg=0.58, std=0.00, steps=2.789e+07
2023-07-07 19:48:51,382 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 900, best=0.59, avg=0.59, std=0.00, steps=3.137e+07
2023-07-07 19:49:22,450 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1000, best=0.60, avg=0.59, std=0.00, steps=3.485e+07
2023-07-07 19:49:53,474 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1100, best=0.60, avg=0.59, std=0.00, steps=3.833e+07
2023-07-07 19:50:24,505 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1200, best=0.60, avg=0.60, std=0.00, steps=4.181e+07
2023-07-07 19:50:55,580 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1300, best=0.60, avg=0.60, std=0.00, steps=4.530e+07
2023-07-07 19:51:26,621 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1400, best=0.60, avg=0.60, std=0.00, steps=4.878e+07
2023-07-07 19:51:57,651 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1500, best=0.61, avg=0.60, std=0.00, steps=5.226e+07
2023-07-07 19:52:28,660 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1600, best=0.61, avg=0.60, std=0.00, steps=5.574e+07
2023-07-07 19:52:59,674 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1700, best=0.61, avg=0.60, std=0.00, steps=5.922e+07
2023-07-07 19:53:30,690 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1800, best=0.61, avg=0.61, std=0.00, steps=6.270e+07
2023-07-07 19:54:01,723 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1900, best=0.61, avg=0.61, std=0.00, steps=6.619e+07
2023-07-07 19:54:32,794 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2000, best=0.61, avg=0.61, std=0.00, steps=6.967e+07
2023-07-07 19:55:03,778 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2100, best=0.62, avg=0.61, std=0.00, steps=7.315e+07
2023-07-07 19:55:34,821 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2200, best=0.62, avg=0.61, std=0.00, steps=7.663e+07
2023-07-07 19:56:05,912 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2300, best=0.62, avg=0.61, std=0.00, steps=8.011e+07
2023-07-07 19:56:36,940 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2400, best=0.62, avg=0.61, std=0.00, steps=8.359e+07
2023-07-07 19:57:07,977 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2500, best=0.62, avg=0.61, std=0.00, steps=8.707e+07
2023-07-07 19:57:39,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2600, best=0.62, avg=0.61, std=0.00, steps=9.056e+07
2023-07-07 19:58:10,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2700, best=0.62, avg=0.62, std=0.00, steps=9.404e+07
2023-07-07 19:58:41,058 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2800, best=0.62, avg=0.62, std=0.00, steps=9.752e+07
2023-07-07 19:59:12,081 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2900, best=0.62, avg=0.62, std=0.00, steps=1.010e+08
2023-07-07 19:59:43,094 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3000, best=0.62, avg=0.62, std=0.00, steps=1.045e+08
2023-07-07 20:00:14,100 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3100, best=0.62, avg=0.62, std=0.00, steps=1.080e+08
2023-07-07 20:00:45,142 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3200, best=0.62, avg=0.62, std=0.00, steps=1.114e+08
2023-07-07 20:01:16,170 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3300, best=0.63, avg=0.62, std=0.00, steps=1.149e+08
2023-07-07 20:01:47,184 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3400, best=0.63, avg=0.62, std=0.00, steps=1.184e+08
2023-07-07 20:02:18,220 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3500, best=0.62, avg=0.62, std=0.00, steps=1.219e+08
2023-07-07 20:02:49,322 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3600, best=0.63, avg=0.62, std=0.00, steps=1.254e+08
2023-07-07 20:03:20,364 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3700, best=0.63, avg=0.62, std=0.00, steps=1.289e+08
2023-07-07 20:03:51,373 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3800, best=0.63, avg=0.62, std=0.00, steps=1.323e+08
2023-07-07 20:04:22,430 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3900, best=0.63, avg=0.62, std=0.00, steps=1.358e+08
2023-07-07 20:04:53,517 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4000, best=0.63, avg=0.62, std=0.00, steps=1.393e+08
2023-07-07 20:05:24,577 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4100, best=0.63, avg=0.62, std=0.00, steps=1.428e+08
2023-07-07 20:05:55,593 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4200, best=0.63, avg=0.63, std=0.00, steps=1.463e+08
2023-07-07 20:06:26,640 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4300, best=0.63, avg=0.63, std=0.00, steps=1.497e+08
2023-07-07 20:06:57,672 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4400, best=0.63, avg=0.63, std=0.00, steps=1.532e+08
2023-07-07 20:07:28,689 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4500, best=0.63, avg=0.63, std=0.00, steps=1.567e+08
2023-07-07 20:07:59,744 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4600, best=0.63, avg=0.63, std=0.00, steps=1.602e+08
2023-07-07 20:08:30,772 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4700, best=0.63, avg=0.63, std=0.00, steps=1.637e+08
2023-07-07 20:09:01,792 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4800, best=0.63, avg=0.63, std=0.00, steps=1.672e+08
2023-07-07 20:09:32,812 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4900, best=0.63, avg=0.63, std=0.00, steps=1.706e+08
2023-07-07 20:10:03,843 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5000, best=0.63, avg=0.63, std=0.00, steps=1.741e+08
2023-07-07 20:10:34,873 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5100, best=0.63, avg=0.63, std=0.00, steps=1.776e+08
2023-07-07 20:11:05,926 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5200, best=0.63, avg=0.63, std=0.00, steps=1.811e+08
2023-07-07 20:11:36,972 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5300, best=0.64, avg=0.63, std=0.00, steps=1.846e+08
2023-07-07 20:12:07,995 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5400, best=0.64, avg=0.63, std=0.00, steps=1.880e+08
2023-07-07 20:12:39,023 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5500, best=0.63, avg=0.63, std=0.00, steps=1.915e+08
2023-07-07 20:13:10,089 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5600, best=0.64, avg=0.63, std=0.00, steps=1.950e+08
2023-07-07 20:13:41,150 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5700, best=0.64, avg=0.63, std=0.00, steps=1.985e+08
2023-07-07 20:14:12,204 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5800, best=0.64, avg=0.63, std=0.00, steps=2.020e+08
2023-07-07 20:14:43,225 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5900, best=0.64, avg=0.63, std=0.00, steps=2.054e+08
2023-07-07 20:15:14,313 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6000, best=0.64, avg=0.63, std=0.00, steps=2.089e+08
2023-07-07 20:15:45,358 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6100, best=0.64, avg=0.63, std=0.00, steps=2.124e+08
2023-07-07 20:16:16,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6200, best=0.64, avg=0.63, std=0.00, steps=2.159e+08
2023-07-07 20:16:47,423 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6300, best=0.64, avg=0.63, std=0.00, steps=2.194e+08
2023-07-07 20:17:18,506 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6400, best=0.64, avg=0.63, std=0.00, steps=2.229e+08
2023-07-07 20:17:49,575 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6500, best=0.64, avg=0.63, std=0.00, steps=2.263e+08
2023-07-07 20:18:20,646 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6600, best=0.64, avg=0.63, std=0.00, steps=2.298e+08
2023-07-07 20:18:51,708 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6700, best=0.64, avg=0.63, std=0.00, steps=2.333e+08
2023-07-07 20:19:22,744 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6800, best=0.64, avg=0.63, std=0.00, steps=2.368e+08
2023-07-07 20:19:53,760 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6900, best=0.64, avg=0.64, std=0.00, steps=2.403e+08
2023-07-07 20:20:24,828 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7000, best=0.64, avg=0.64, std=0.00, steps=2.437e+08
2023-07-07 20:20:55,836 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7100, best=0.64, avg=0.64, std=0.00, steps=2.472e+08
2023-07-07 20:21:26,884 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7200, best=0.64, avg=0.64, std=0.00, steps=2.507e+08
2023-07-07 20:21:57,905 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7300, best=0.64, avg=0.64, std=0.00, steps=2.542e+08
2023-07-07 20:22:28,933 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7400, best=0.64, avg=0.64, std=0.00, steps=2.577e+08
2023-07-07 20:22:59,991 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7500, best=0.65, avg=0.64, std=0.00, steps=2.612e+08
2023-07-07 20:23:31,079 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7600, best=0.64, avg=0.64, std=0.00, steps=2.646e+08
2023-07-07 20:24:02,108 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7700, best=0.64, avg=0.64, std=0.00, steps=2.681e+08
2023-07-07 20:24:33,202 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7800, best=0.64, avg=0.64, std=0.00, steps=2.716e+08
2023-07-07 20:25:04,277 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7900, best=0.64, avg=0.64, std=0.00, steps=2.751e+08
2023-07-07 20:25:35,308 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8000, best=0.64, avg=0.64, std=0.00, steps=2.786e+08
2023-07-07 20:26:06,350 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8100, best=0.64, avg=0.64, std=0.00, steps=2.820e+08
2023-07-07 20:26:37,394 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8200, best=0.64, avg=0.64, std=0.00, steps=2.855e+08
2023-07-07 20:27:08,441 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8300, best=0.64, avg=0.64, std=0.00, steps=2.890e+08
2023-07-07 20:27:39,475 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8400, best=0.64, avg=0.64, std=0.00, steps=2.925e+08
2023-07-07 20:28:10,546 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8500, best=0.64, avg=0.64, std=0.00, steps=2.960e+08
2023-07-07 20:28:41,587 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8600, best=0.64, avg=0.64, std=0.00, steps=2.995e+08
2023-07-07 20:29:12,624 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8700, best=0.65, avg=0.64, std=0.00, steps=3.029e+08
2023-07-07 20:29:43,800 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8800, best=0.64, avg=0.64, std=0.00, steps=3.064e+08
2023-07-07 20:30:14,885 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8900, best=0.64, avg=0.64, std=0.00, steps=3.099e+08
2023-07-07 20:30:45,960 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9000, best=0.65, avg=0.64, std=0.00, steps=3.134e+08
2023-07-07 20:31:17,018 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9100, best=0.64, avg=0.64, std=0.00, steps=3.169e+08
2023-07-07 20:31:48,058 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9200, best=0.65, avg=0.64, std=0.00, steps=3.203e+08
2023-07-07 20:32:19,108 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9300, best=0.65, avg=0.64, std=0.00, steps=3.238e+08
2023-07-07 20:32:50,152 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9400, best=0.65, avg=0.64, std=0.00, steps=3.273e+08
2023-07-07 20:33:21,185 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9500, best=0.65, avg=0.64, std=0.00, steps=3.308e+08
2023-07-07 20:33:52,182 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9600, best=0.65, avg=0.64, std=0.00, steps=3.343e+08
2023-07-07 20:34:23,188 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9700, best=0.65, avg=0.64, std=0.00, steps=3.378e+08
2023-07-07 20:34:54,222 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9800, best=0.65, avg=0.64, std=0.00, steps=3.412e+08
2023-07-07 20:35:25,237 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9900, best=0.65, avg=0.64, std=0.00, steps=3.447e+08
2023-07-07 20:35:56,259 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10000, best=0.65, avg=0.64, std=0.00, steps=3.482e+08
2023-07-07 20:36:27,289 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10100, best=0.65, avg=0.64, std=0.00, steps=3.517e+08
2023-07-07 20:36:58,296 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10200, best=0.65, avg=0.64, std=0.00, steps=3.552e+08
2023-07-07 20:37:29,318 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10300, best=0.65, avg=0.64, std=0.00, steps=3.586e+08
2023-07-07 20:38:00,341 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10400, best=0.65, avg=0.64, std=0.00, steps=3.621e+08
2023-07-07 20:38:31,397 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10500, best=0.65, avg=0.64, std=0.00, steps=3.656e+08
2023-07-07 20:39:02,455 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10600, best=0.65, avg=0.64, std=0.00, steps=3.691e+08
2023-07-07 20:39:33,569 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10700, best=0.65, avg=0.64, std=0.00, steps=3.726e+08
2023-07-07 20:40:04,675 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10800, best=0.65, avg=0.64, std=0.00, steps=3.760e+08
2023-07-07 20:40:35,765 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10900, best=0.65, avg=0.64, std=0.00, steps=3.795e+08
2023-07-07 20:41:06,829 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11000, best=0.65, avg=0.64, std=0.00, steps=3.830e+08
2023-07-07 20:41:37,876 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11100, best=0.65, avg=0.64, std=0.00, steps=3.865e+08
2023-07-07 20:42:08,927 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11200, best=0.65, avg=0.64, std=0.00, steps=3.900e+08
2023-07-07 20:42:39,987 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11300, best=0.65, avg=0.64, std=0.00, steps=3.935e+08
2023-07-07 20:43:11,017 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11400, best=0.65, avg=0.64, std=0.00, steps=3.969e+08
2023-07-07 20:43:42,076 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11500, best=0.65, avg=0.64, std=0.00, steps=4.004e+08
2023-07-07 20:44:13,053 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11600, best=0.65, avg=0.64, std=0.00, steps=4.039e+08
2023-07-07 20:44:44,090 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11700, best=0.65, avg=0.65, std=0.00, steps=4.074e+08
2023-07-07 20:45:15,110 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11800, best=0.65, avg=0.64, std=0.00, steps=4.109e+08
2023-07-07 20:45:46,123 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11900, best=0.65, avg=0.65, std=0.00, steps=4.143e+08
2023-07-07 20:46:16,884 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11999, best=0.65, avg=0.65, std=0.00, steps=4.178e+08
2023-07-07 20:46:16,885 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 20:46:16,911 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 20:46:16,946 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 20:46:52,202 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.723e+06
2023-07-07 20:47:25,111 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=7.410e+06
2023-07-07 20:47:58,258 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.110e+07
2023-07-07 20:48:31,296 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.478e+07
2023-07-07 20:49:04,265 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=1.847e+07
2023-07-07 20:49:37,368 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.216e+07
2023-07-07 20:50:10,459 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.584e+07
2023-07-07 20:50:43,617 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=2.953e+07
2023-07-07 20:51:16,734 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.321e+07
2023-07-07 20:51:50,030 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=3.690e+07
2023-07-07 20:52:23,277 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=4.059e+07
2023-07-07 20:52:56,385 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=4.427e+07
2023-07-07 20:53:29,581 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.00, steps=4.796e+07
2023-07-07 20:54:02,686 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.00, steps=5.165e+07
2023-07-07 20:54:35,904 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=5.533e+07
2023-07-07 20:55:09,001 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=5.902e+07
2023-07-07 20:55:42,217 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1700, best=0.54, avg=0.53, std=0.00, steps=6.271e+07
2023-07-07 20:56:15,463 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1800, best=0.55, avg=0.54, std=0.00, steps=6.639e+07
2023-07-07 20:56:48,564 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1900, best=0.55, avg=0.55, std=0.00, steps=7.008e+07
2023-07-07 20:57:21,652 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2000, best=0.55, avg=0.55, std=0.00, steps=7.376e+07
2023-07-07 20:57:54,731 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2100, best=0.55, avg=0.55, std=0.00, steps=7.745e+07
2023-07-07 20:58:27,785 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2200, best=0.55, avg=0.55, std=0.00, steps=8.114e+07
2023-07-07 20:59:00,875 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2300, best=0.55, avg=0.55, std=0.00, steps=8.482e+07
2023-07-07 20:59:34,010 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2400, best=0.55, avg=0.55, std=0.00, steps=8.851e+07
2023-07-07 21:00:07,305 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2500, best=0.55, avg=0.55, std=0.00, steps=9.220e+07
2023-07-07 21:00:40,527 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2600, best=0.55, avg=0.55, std=0.00, steps=9.588e+07
2023-07-07 21:01:13,786 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2700, best=0.57, avg=0.56, std=0.00, steps=9.957e+07
2023-07-07 21:01:46,898 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2800, best=0.58, avg=0.57, std=0.00, steps=1.033e+08
2023-07-07 21:02:19,993 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2900, best=0.58, avg=0.58, std=0.00, steps=1.069e+08
2023-07-07 21:02:53,067 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3000, best=0.59, avg=0.58, std=0.00, steps=1.106e+08
2023-07-07 21:03:26,251 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3100, best=0.59, avg=0.58, std=0.00, steps=1.143e+08
2023-07-07 21:03:59,422 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3200, best=0.59, avg=0.59, std=0.00, steps=1.180e+08
2023-07-07 21:04:32,610 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3300, best=0.59, avg=0.59, std=0.00, steps=1.217e+08
2023-07-07 21:05:05,765 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3400, best=0.60, avg=0.59, std=0.00, steps=1.254e+08
2023-07-07 21:05:38,863 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3500, best=0.60, avg=0.59, std=0.00, steps=1.291e+08
2023-07-07 21:06:11,979 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3600, best=0.60, avg=0.59, std=0.00, steps=1.327e+08
2023-07-07 21:06:45,208 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3700, best=0.60, avg=0.60, std=0.00, steps=1.364e+08
2023-07-07 21:07:18,320 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3800, best=0.60, avg=0.60, std=0.00, steps=1.401e+08
2023-07-07 21:07:51,406 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3900, best=0.60, avg=0.60, std=0.00, steps=1.438e+08
2023-07-07 21:08:24,568 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4000, best=0.60, avg=0.60, std=0.00, steps=1.475e+08
2023-07-07 21:08:57,678 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4100, best=0.61, avg=0.60, std=0.00, steps=1.512e+08
2023-07-07 21:09:30,954 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4200, best=0.61, avg=0.60, std=0.00, steps=1.549e+08
2023-07-07 21:10:04,281 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4300, best=0.61, avg=0.60, std=0.00, steps=1.586e+08
2023-07-07 21:10:37,437 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4400, best=0.61, avg=0.60, std=0.00, steps=1.622e+08
2023-07-07 21:11:10,589 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4500, best=0.61, avg=0.60, std=0.00, steps=1.659e+08
2023-07-07 21:11:43,817 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4600, best=0.61, avg=0.60, std=0.00, steps=1.696e+08
2023-07-07 21:12:16,977 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4700, best=0.61, avg=0.60, std=0.00, steps=1.733e+08
2023-07-07 21:12:50,225 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4800, best=0.61, avg=0.60, std=0.00, steps=1.770e+08
2023-07-07 21:13:23,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4900, best=0.61, avg=0.61, std=0.00, steps=1.807e+08
2023-07-07 21:13:56,488 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5000, best=0.61, avg=0.61, std=0.00, steps=1.844e+08
2023-07-07 21:14:29,645 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5100, best=0.61, avg=0.61, std=0.00, steps=1.880e+08
2023-07-07 21:15:02,847 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5200, best=0.61, avg=0.61, std=0.00, steps=1.917e+08
2023-07-07 21:15:35,898 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5300, best=0.61, avg=0.61, std=0.00, steps=1.954e+08
2023-07-07 21:16:09,089 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5400, best=0.61, avg=0.61, std=0.00, steps=1.991e+08
2023-07-07 21:16:42,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5500, best=0.62, avg=0.61, std=0.00, steps=2.028e+08
2023-07-07 21:17:15,255 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5600, best=0.61, avg=0.61, std=0.00, steps=2.065e+08
2023-07-07 21:17:48,424 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5700, best=0.62, avg=0.61, std=0.00, steps=2.102e+08
2023-07-07 21:18:21,495 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5800, best=0.62, avg=0.61, std=0.00, steps=2.138e+08
2023-07-07 21:18:54,580 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5900, best=0.62, avg=0.61, std=0.00, steps=2.175e+08
2023-07-07 21:19:27,690 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6000, best=0.62, avg=0.61, std=0.00, steps=2.212e+08
2023-07-07 21:20:00,886 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6100, best=0.62, avg=0.61, std=0.00, steps=2.249e+08
2023-07-07 21:20:33,943 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6200, best=0.62, avg=0.61, std=0.00, steps=2.286e+08
2023-07-07 21:21:07,082 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6300, best=0.62, avg=0.62, std=0.00, steps=2.323e+08
2023-07-07 21:21:40,221 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6400, best=0.62, avg=0.62, std=0.00, steps=2.360e+08
2023-07-07 21:22:13,414 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6500, best=0.62, avg=0.62, std=0.00, steps=2.397e+08
2023-07-07 21:22:46,633 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6600, best=0.62, avg=0.62, std=0.00, steps=2.433e+08
2023-07-07 21:23:19,830 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6700, best=0.62, avg=0.62, std=0.00, steps=2.470e+08
2023-07-07 21:23:52,984 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6800, best=0.62, avg=0.62, std=0.00, steps=2.507e+08
2023-07-07 21:24:26,202 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6900, best=0.62, avg=0.62, std=0.00, steps=2.544e+08
2023-07-07 21:24:59,356 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7000, best=0.62, avg=0.62, std=0.00, steps=2.581e+08
2023-07-07 21:25:32,538 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7100, best=0.62, avg=0.62, std=0.00, steps=2.618e+08
2023-07-07 21:26:05,833 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7200, best=0.63, avg=0.62, std=0.00, steps=2.655e+08
2023-07-07 21:26:39,137 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7300, best=0.63, avg=0.62, std=0.00, steps=2.691e+08
2023-07-07 21:27:12,156 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7400, best=0.62, avg=0.62, std=0.00, steps=2.728e+08
2023-07-07 21:27:45,184 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7500, best=0.63, avg=0.62, std=0.00, steps=2.765e+08
2023-07-07 21:28:18,179 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7600, best=0.63, avg=0.62, std=0.00, steps=2.802e+08
2023-07-07 21:28:51,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7700, best=0.63, avg=0.62, std=0.00, steps=2.839e+08
2023-07-07 21:29:24,499 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7800, best=0.63, avg=0.62, std=0.00, steps=2.876e+08
2023-07-07 21:29:57,707 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7900, best=0.63, avg=0.62, std=0.00, steps=2.913e+08
2023-07-07 21:30:30,982 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8000, best=0.63, avg=0.62, std=0.00, steps=2.949e+08
2023-07-07 21:31:04,112 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8100, best=0.63, avg=0.62, std=0.00, steps=2.986e+08
2023-07-07 21:31:37,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8200, best=0.63, avg=0.62, std=0.00, steps=3.023e+08
2023-07-07 21:32:10,522 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8300, best=0.63, avg=0.62, std=0.00, steps=3.060e+08
2023-07-07 21:32:43,749 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8400, best=0.63, avg=0.62, std=0.00, steps=3.097e+08
2023-07-07 21:33:17,123 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8500, best=0.63, avg=0.62, std=0.00, steps=3.134e+08
2023-07-07 21:33:50,279 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8600, best=0.63, avg=0.62, std=0.00, steps=3.171e+08
2023-07-07 21:34:23,704 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8700, best=0.63, avg=0.62, std=0.00, steps=3.208e+08
2023-07-07 21:34:57,020 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8800, best=0.63, avg=0.63, std=0.00, steps=3.244e+08
2023-07-07 21:35:30,129 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8900, best=0.63, avg=0.62, std=0.00, steps=3.281e+08
2023-07-07 21:36:03,179 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9000, best=0.63, avg=0.63, std=0.00, steps=3.318e+08
2023-07-07 21:36:36,414 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9100, best=0.63, avg=0.63, std=0.00, steps=3.355e+08
2023-07-07 21:37:09,758 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9200, best=0.63, avg=0.63, std=0.00, steps=3.392e+08
2023-07-07 21:37:42,955 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9300, best=0.63, avg=0.63, std=0.00, steps=3.429e+08
2023-07-07 21:38:16,291 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9400, best=0.63, avg=0.63, std=0.00, steps=3.466e+08
2023-07-07 21:38:49,462 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9500, best=0.63, avg=0.63, std=0.00, steps=3.502e+08
2023-07-07 21:39:22,869 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9600, best=0.63, avg=0.63, std=0.00, steps=3.539e+08
2023-07-07 21:39:56,196 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9700, best=0.63, avg=0.63, std=0.00, steps=3.576e+08
2023-07-07 21:40:29,418 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9800, best=0.63, avg=0.63, std=0.00, steps=3.613e+08
2023-07-07 21:41:02,611 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9900, best=0.63, avg=0.63, std=0.00, steps=3.650e+08
2023-07-07 21:41:35,901 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10000, best=0.63, avg=0.63, std=0.00, steps=3.687e+08
2023-07-07 21:42:09,203 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10100, best=0.63, avg=0.63, std=0.00, steps=3.724e+08
2023-07-07 21:42:42,330 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10200, best=0.63, avg=0.63, std=0.00, steps=3.760e+08
2023-07-07 21:43:15,449 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10300, best=0.63, avg=0.63, std=0.00, steps=3.797e+08
2023-07-07 21:43:48,583 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10400, best=0.63, avg=0.63, std=0.00, steps=3.834e+08
2023-07-07 21:44:21,745 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10500, best=0.63, avg=0.63, std=0.00, steps=3.871e+08
2023-07-07 21:44:55,015 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10600, best=0.63, avg=0.63, std=0.00, steps=3.908e+08
2023-07-07 21:45:28,324 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10700, best=0.63, avg=0.63, std=0.00, steps=3.945e+08
2023-07-07 21:46:01,681 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10800, best=0.64, avg=0.63, std=0.00, steps=3.982e+08
2023-07-07 21:46:34,967 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10900, best=0.63, avg=0.63, std=0.00, steps=4.019e+08
2023-07-07 21:47:08,274 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11000, best=0.64, avg=0.63, std=0.00, steps=4.055e+08
2023-07-07 21:47:41,633 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11100, best=0.63, avg=0.63, std=0.00, steps=4.092e+08
2023-07-07 21:48:14,952 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11200, best=0.63, avg=0.63, std=0.00, steps=4.129e+08
2023-07-07 21:48:48,385 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11300, best=0.64, avg=0.63, std=0.00, steps=4.166e+08
2023-07-07 21:49:21,759 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11400, best=0.64, avg=0.63, std=0.00, steps=4.203e+08
2023-07-07 21:49:55,095 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11500, best=0.64, avg=0.63, std=0.00, steps=4.240e+08
2023-07-07 21:50:28,360 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11600, best=0.64, avg=0.63, std=0.00, steps=4.277e+08
2023-07-07 21:51:01,707 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11700, best=0.64, avg=0.63, std=0.00, steps=4.313e+08
2023-07-07 21:51:34,888 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11800, best=0.64, avg=0.63, std=0.00, steps=4.350e+08
2023-07-07 21:52:08,104 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11900, best=0.64, avg=0.63, std=0.00, steps=4.387e+08
2023-07-07 21:52:41,058 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11999, best=0.64, avg=0.63, std=0.00, steps=4.424e+08
2023-07-07 21:52:41,059 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
2023-07-07 21:52:41,086 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 21:52:41,119 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 21:53:20,590 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=4.137e+06
2023-07-07 21:53:57,389 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=8.233e+06
2023-07-07 21:54:34,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.233e+07
2023-07-07 21:55:11,227 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.642e+07
2023-07-07 21:55:48,253 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=2.052e+07
2023-07-07 21:56:25,184 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.462e+07
2023-07-07 21:57:02,053 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.871e+07
2023-07-07 21:57:38,978 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=3.281e+07
2023-07-07 21:58:15,905 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.690e+07
2023-07-07 21:58:52,810 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=4.100e+07
2023-07-07 21:59:29,693 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=4.510e+07
2023-07-07 22:00:06,553 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=4.919e+07
2023-07-07 22:00:43,320 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1300, best=0.55, avg=0.54, std=0.00, steps=5.329e+07
2023-07-07 22:01:20,162 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1400, best=0.55, avg=0.55, std=0.00, steps=5.738e+07
2023-07-07 22:01:56,941 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1500, best=0.55, avg=0.55, std=0.00, steps=6.148e+07
2023-07-07 22:02:33,785 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1600, best=0.55, avg=0.55, std=0.00, steps=6.558e+07
2023-07-07 22:03:10,777 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1700, best=0.55, avg=0.55, std=0.00, steps=6.967e+07
2023-07-07 22:03:47,728 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1800, best=0.55, avg=0.55, std=0.00, steps=7.377e+07
2023-07-07 22:04:24,738 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1900, best=0.55, avg=0.55, std=0.00, steps=7.786e+07
2023-07-07 22:05:01,826 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2000, best=0.55, avg=0.55, std=0.00, steps=8.196e+07
2023-07-07 22:05:38,825 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2100, best=0.55, avg=0.55, std=0.00, steps=8.606e+07
2023-07-07 22:06:15,729 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2200, best=0.55, avg=0.55, std=0.00, steps=9.015e+07
2023-07-07 22:06:52,578 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2300, best=0.56, avg=0.55, std=0.00, steps=9.425e+07
2023-07-07 22:07:29,479 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2400, best=0.55, avg=0.55, std=0.00, steps=9.834e+07
2023-07-07 22:08:06,403 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2500, best=0.55, avg=0.55, std=0.00, steps=1.024e+08
2023-07-07 22:08:43,155 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2600, best=0.55, avg=0.55, std=0.00, steps=1.065e+08
2023-07-07 22:09:20,009 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2700, best=0.55, avg=0.55, std=0.00, steps=1.106e+08
2023-07-07 22:09:57,044 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2800, best=0.55, avg=0.55, std=0.00, steps=1.147e+08
2023-07-07 22:10:33,944 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2900, best=0.55, avg=0.55, std=0.00, steps=1.188e+08
2023-07-07 22:11:10,728 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3000, best=0.55, avg=0.55, std=0.00, steps=1.229e+08
2023-07-07 22:11:47,485 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3100, best=0.55, avg=0.55, std=0.00, steps=1.270e+08
2023-07-07 22:12:24,166 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3200, best=0.55, avg=0.55, std=0.00, steps=1.311e+08
2023-07-07 22:13:00,883 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3300, best=0.55, avg=0.55, std=0.00, steps=1.352e+08
2023-07-07 22:13:37,633 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3400, best=0.56, avg=0.55, std=0.00, steps=1.393e+08
2023-07-07 22:14:14,407 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3500, best=0.55, avg=0.55, std=0.00, steps=1.434e+08
2023-07-07 22:14:51,198 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3600, best=0.55, avg=0.55, std=0.00, steps=1.475e+08
2023-07-07 22:15:27,990 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3700, best=0.55, avg=0.55, std=0.00, steps=1.516e+08
2023-07-07 22:16:04,766 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3800, best=0.55, avg=0.55, std=0.00, steps=1.557e+08
2023-07-07 22:16:41,537 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3900, best=0.55, avg=0.55, std=0.00, steps=1.598e+08
2023-07-07 22:17:18,228 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4000, best=0.55, avg=0.55, std=0.00, steps=1.639e+08
2023-07-07 22:17:54,991 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4100, best=0.55, avg=0.55, std=0.00, steps=1.680e+08
2023-07-07 22:18:31,786 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4200, best=0.56, avg=0.55, std=0.00, steps=1.721e+08
2023-07-07 22:19:08,595 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4300, best=0.55, avg=0.55, std=0.00, steps=1.762e+08
2023-07-07 22:19:45,402 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4400, best=0.55, avg=0.55, std=0.00, steps=1.803e+08
2023-07-07 22:20:22,339 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4500, best=0.55, avg=0.55, std=0.00, steps=1.844e+08
2023-07-07 22:20:59,166 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4600, best=0.55, avg=0.55, std=0.00, steps=1.885e+08
2023-07-07 22:21:35,937 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4700, best=0.55, avg=0.55, std=0.00, steps=1.926e+08
2023-07-07 22:22:12,660 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4800, best=0.56, avg=0.56, std=0.00, steps=1.966e+08
2023-07-07 22:22:49,484 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4900, best=0.56, avg=0.56, std=0.00, steps=2.007e+08
2023-07-07 22:23:26,182 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5000, best=0.57, avg=0.56, std=0.00, steps=2.048e+08
2023-07-07 22:24:02,955 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5100, best=0.57, avg=0.57, std=0.00, steps=2.089e+08
2023-07-07 22:24:39,744 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5200, best=0.57, avg=0.57, std=0.00, steps=2.130e+08
2023-07-07 22:25:16,617 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5300, best=0.57, avg=0.57, std=0.00, steps=2.171e+08
2023-07-07 22:25:53,373 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5400, best=0.57, avg=0.57, std=0.00, steps=2.212e+08
2023-07-07 22:26:30,131 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5500, best=0.58, avg=0.57, std=0.00, steps=2.253e+08
2023-07-07 22:27:06,883 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5600, best=0.58, avg=0.57, std=0.00, steps=2.294e+08
2023-07-07 22:27:43,597 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5700, best=0.58, avg=0.57, std=0.00, steps=2.335e+08
2023-07-07 22:28:20,356 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5800, best=0.58, avg=0.57, std=0.00, steps=2.376e+08
2023-07-07 22:28:57,269 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5900, best=0.58, avg=0.57, std=0.00, steps=2.417e+08
2023-07-07 22:29:34,084 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6000, best=0.58, avg=0.58, std=0.00, steps=2.458e+08
2023-07-07 22:30:10,922 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6100, best=0.59, avg=0.58, std=0.00, steps=2.499e+08
2023-07-07 22:30:47,702 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6200, best=0.59, avg=0.58, std=0.00, steps=2.540e+08
2023-07-07 22:31:24,449 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6300, best=0.59, avg=0.58, std=0.00, steps=2.581e+08
2023-07-07 22:32:01,146 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6400, best=0.59, avg=0.58, std=0.00, steps=2.622e+08
2023-07-07 22:32:37,854 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6500, best=0.59, avg=0.59, std=0.00, steps=2.663e+08
2023-07-07 22:33:14,605 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6600, best=0.59, avg=0.59, std=0.00, steps=2.704e+08
2023-07-07 22:33:51,334 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6700, best=0.59, avg=0.59, std=0.00, steps=2.745e+08
2023-07-07 22:34:28,140 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6800, best=0.59, avg=0.59, std=0.00, steps=2.786e+08
2023-07-07 22:35:05,087 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6900, best=0.59, avg=0.59, std=0.00, steps=2.827e+08
2023-07-07 22:35:41,798 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7000, best=0.59, avg=0.59, std=0.00, steps=2.868e+08
2023-07-07 22:36:18,534 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7100, best=0.59, avg=0.59, std=0.00, steps=2.909e+08
2023-07-07 22:36:55,299 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7200, best=0.59, avg=0.59, std=0.00, steps=2.950e+08
2023-07-07 22:37:32,073 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7300, best=0.59, avg=0.59, std=0.00, steps=2.990e+08
2023-07-07 22:38:08,902 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7400, best=0.60, avg=0.59, std=0.00, steps=3.031e+08
2023-07-07 22:38:45,715 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7500, best=0.59, avg=0.59, std=0.00, steps=3.072e+08
2023-07-07 22:39:22,503 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7600, best=0.59, avg=0.59, std=0.00, steps=3.113e+08
2023-07-07 22:39:59,406 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7700, best=0.60, avg=0.59, std=0.00, steps=3.154e+08
2023-07-07 22:40:36,244 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7800, best=0.60, avg=0.59, std=0.00, steps=3.195e+08
2023-07-07 22:41:13,107 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7900, best=0.60, avg=0.59, std=0.00, steps=3.236e+08
2023-07-07 22:41:49,911 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8000, best=0.60, avg=0.59, std=0.00, steps=3.277e+08
2023-07-07 22:42:26,711 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8100, best=0.60, avg=0.59, std=0.00, steps=3.318e+08
2023-07-07 22:43:03,606 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8200, best=0.60, avg=0.59, std=0.00, steps=3.359e+08
2023-07-07 22:43:40,354 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8300, best=0.60, avg=0.60, std=0.00, steps=3.400e+08
2023-07-07 22:44:17,140 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8400, best=0.60, avg=0.60, std=0.00, steps=3.441e+08
2023-07-07 22:44:53,962 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8500, best=0.60, avg=0.60, std=0.00, steps=3.482e+08
2023-07-07 22:45:30,758 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8600, best=0.60, avg=0.60, std=0.00, steps=3.523e+08
2023-07-07 22:46:07,544 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8700, best=0.60, avg=0.60, std=0.00, steps=3.564e+08
2023-07-07 22:46:44,400 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8800, best=0.60, avg=0.60, std=0.00, steps=3.605e+08
2023-07-07 22:47:21,260 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8900, best=0.60, avg=0.60, std=0.00, steps=3.646e+08
2023-07-07 22:47:57,979 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9000, best=0.60, avg=0.60, std=0.00, steps=3.687e+08
2023-07-07 22:48:34,713 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9100, best=0.60, avg=0.60, std=0.00, steps=3.728e+08
2023-07-07 22:49:11,449 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9200, best=0.60, avg=0.60, std=0.00, steps=3.769e+08
2023-07-07 22:49:48,282 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9300, best=0.60, avg=0.60, std=0.00, steps=3.810e+08
2023-07-07 22:50:25,007 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9400, best=0.61, avg=0.60, std=0.00, steps=3.851e+08
2023-07-07 22:51:01,667 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9500, best=0.61, avg=0.60, std=0.00, steps=3.892e+08
2023-07-07 22:51:38,466 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9600, best=0.60, avg=0.60, std=0.00, steps=3.933e+08
2023-07-07 22:52:15,305 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9700, best=0.61, avg=0.60, std=0.00, steps=3.974e+08
2023-07-07 22:52:52,069 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9800, best=0.61, avg=0.60, std=0.00, steps=4.014e+08
2023-07-07 22:53:28,901 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9900, best=0.61, avg=0.60, std=0.00, steps=4.055e+08
2023-07-07 22:54:05,716 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10000, best=0.61, avg=0.60, std=0.00, steps=4.096e+08
2023-07-07 22:54:42,495 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10100, best=0.61, avg=0.60, std=0.00, steps=4.137e+08
2023-07-07 22:55:19,311 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10200, best=0.61, avg=0.60, std=0.00, steps=4.178e+08
2023-07-07 22:55:55,986 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10300, best=0.61, avg=0.60, std=0.00, steps=4.219e+08
2023-07-07 22:56:32,852 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10400, best=0.61, avg=0.60, std=0.00, steps=4.260e+08
2023-07-07 22:57:09,703 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10500, best=0.61, avg=0.60, std=0.00, steps=4.301e+08
2023-07-07 22:57:46,533 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10600, best=0.61, avg=0.60, std=0.00, steps=4.342e+08
2023-07-07 22:58:23,380 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10700, best=0.61, avg=0.60, std=0.00, steps=4.383e+08
2023-07-07 22:59:00,187 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10800, best=0.61, avg=0.60, std=0.00, steps=4.424e+08
2023-07-07 22:59:37,009 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10900, best=0.61, avg=0.60, std=0.00, steps=4.465e+08
2023-07-07 23:00:13,789 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11000, best=0.61, avg=0.60, std=0.00, steps=4.506e+08
2023-07-07 23:00:50,710 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11100, best=0.61, avg=0.60, std=0.00, steps=4.547e+08
2023-07-07 23:01:27,457 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11200, best=0.61, avg=0.60, std=0.00, steps=4.588e+08
2023-07-07 23:02:04,180 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11300, best=0.61, avg=0.60, std=0.00, steps=4.629e+08
2023-07-07 23:02:40,920 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11400, best=0.61, avg=0.60, std=0.00, steps=4.670e+08
2023-07-07 23:03:17,637 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11500, best=0.61, avg=0.61, std=0.00, steps=4.711e+08
2023-07-07 23:03:54,486 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11600, best=0.61, avg=0.60, std=0.00, steps=4.752e+08
2023-07-07 23:04:31,262 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11700, best=0.61, avg=0.60, std=0.00, steps=4.793e+08
2023-07-07 23:05:08,051 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11800, best=0.61, avg=0.60, std=0.00, steps=4.834e+08
2023-07-07 23:05:44,827 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11900, best=0.61, avg=0.61, std=0.00, steps=4.875e+08
2023-07-07 23:06:21,185 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11999, best=0.61, avg=0.61, std=0.00, steps=4.915e+08
2023-07-07 23:06:21,186 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135859
