2023-07-07 13:58:23,031 -        meta learning: [    INFO] - [INFO] checkpoint saved to: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 13:58:23,031 -        meta learning: [    INFO] - [INFO] tensorboard dir set to: ./runs/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 13:58:23,031 -        meta learning: [    INFO] - [ARGS]: Namespace(policy='BatchedGruMetaStdpMLPPolicy', algo='PGPE', task='SeqTask', seq_length=20, latency=24, num_cls=5, feature_dims=14, sigma=0.1, batch_size=512, hidden_dims=[128], pop_size=256, center_lr=0.01, init_std=0.04, decay_std=0.999, limit_std=0.001, std_lr=0.07, terminate_when_unhealthy=False, max_iters=12000, num_tasks=1, seed=36, num_tests=128, eval_epoch=100, eval=False, eval_with_injury=False, resume='', save=False, repeat=1, root_dir='/data/anonymous/meta', tensorboard_dir='./runs', suffix='', output_dir='/data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823', summary_writer=<torch.utils.tensorboard.writer.SummaryWriter object at 0x7fb238105d00>, tb_prefix='PGPE/SeqTask/BatchedGruMetaStdpMLPPolicy')
2023-07-07 13:58:26,340 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 13:58:26,408 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 13:58:34,236 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 100, best=0.74, avg=0.73, std=0.01, steps=4.137e+05
2023-07-07 13:58:38,197 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 200, best=0.81, avg=0.79, std=0.01, steps=8.233e+05
2023-07-07 13:58:42,140 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 300, best=0.86, avg=0.85, std=0.01, steps=1.233e+06
2023-07-07 13:58:46,048 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 400, best=0.90, avg=0.89, std=0.01, steps=1.642e+06
2023-07-07 13:58:49,973 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 500, best=0.93, avg=0.92, std=0.00, steps=2.052e+06
2023-07-07 13:58:53,891 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 600, best=0.94, avg=0.93, std=0.00, steps=2.462e+06
2023-07-07 13:58:57,821 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 700, best=0.95, avg=0.93, std=0.00, steps=2.871e+06
2023-07-07 13:59:01,802 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 800, best=0.94, avg=0.94, std=0.00, steps=3.281e+06
2023-07-07 13:59:05,769 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 900, best=0.95, avg=0.94, std=0.00, steps=3.690e+06
2023-07-07 13:59:09,769 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1000, best=0.95, avg=0.94, std=0.00, steps=4.100e+06
2023-07-07 13:59:13,713 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1100, best=0.95, avg=0.94, std=0.00, steps=4.510e+06
2023-07-07 13:59:17,638 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1200, best=0.95, avg=0.94, std=0.00, steps=4.919e+06
2023-07-07 13:59:21,566 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1300, best=0.95, avg=0.95, std=0.00, steps=5.329e+06
2023-07-07 13:59:25,522 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1400, best=0.95, avg=0.95, std=0.00, steps=5.738e+06
2023-07-07 13:59:29,546 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1500, best=0.96, avg=0.95, std=0.00, steps=6.148e+06
2023-07-07 13:59:33,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1600, best=0.96, avg=0.95, std=0.00, steps=6.558e+06
2023-07-07 13:59:37,541 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1700, best=0.96, avg=0.95, std=0.00, steps=6.967e+06
2023-07-07 13:59:41,504 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1800, best=0.96, avg=0.95, std=0.00, steps=7.377e+06
2023-07-07 13:59:45,450 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1900, best=0.96, avg=0.95, std=0.00, steps=7.786e+06
2023-07-07 13:59:49,398 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2000, best=0.96, avg=0.95, std=0.00, steps=8.196e+06
2023-07-07 13:59:53,347 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2100, best=0.96, avg=0.95, std=0.00, steps=8.606e+06
2023-07-07 13:59:57,280 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2200, best=0.96, avg=0.95, std=0.00, steps=9.015e+06
2023-07-07 14:00:01,234 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2300, best=0.96, avg=0.95, std=0.00, steps=9.425e+06
2023-07-07 14:00:05,155 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2400, best=0.96, avg=0.95, std=0.00, steps=9.834e+06
2023-07-07 14:00:09,082 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2500, best=0.96, avg=0.95, std=0.00, steps=1.024e+07
2023-07-07 14:00:13,011 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2600, best=0.96, avg=0.95, std=0.00, steps=1.065e+07
2023-07-07 14:00:16,966 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2700, best=0.96, avg=0.95, std=0.00, steps=1.106e+07
2023-07-07 14:00:20,894 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2800, best=0.96, avg=0.95, std=0.00, steps=1.147e+07
2023-07-07 14:00:24,825 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2900, best=0.96, avg=0.95, std=0.00, steps=1.188e+07
2023-07-07 14:00:28,757 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3000, best=0.96, avg=0.95, std=0.00, steps=1.229e+07
2023-07-07 14:00:32,682 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3100, best=0.96, avg=0.95, std=0.00, steps=1.270e+07
2023-07-07 14:00:36,616 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3200, best=0.96, avg=0.95, std=0.00, steps=1.311e+07
2023-07-07 14:00:40,565 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3300, best=0.96, avg=0.95, std=0.00, steps=1.352e+07
2023-07-07 14:00:44,507 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3400, best=0.96, avg=0.95, std=0.00, steps=1.393e+07
2023-07-07 14:00:48,438 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3500, best=0.96, avg=0.95, std=0.00, steps=1.434e+07
2023-07-07 14:00:52,385 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3600, best=0.96, avg=0.95, std=0.00, steps=1.475e+07
2023-07-07 14:00:56,330 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3700, best=0.96, avg=0.95, std=0.00, steps=1.516e+07
2023-07-07 14:01:00,270 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3800, best=0.96, avg=0.95, std=0.00, steps=1.557e+07
2023-07-07 14:01:04,224 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3900, best=0.96, avg=0.95, std=0.00, steps=1.598e+07
2023-07-07 14:01:08,168 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4000, best=0.96, avg=0.95, std=0.00, steps=1.639e+07
2023-07-07 14:01:12,120 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4100, best=0.96, avg=0.95, std=0.00, steps=1.680e+07
2023-07-07 14:01:16,060 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4200, best=0.96, avg=0.95, std=0.00, steps=1.721e+07
2023-07-07 14:01:19,996 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4300, best=0.96, avg=0.95, std=0.00, steps=1.762e+07
2023-07-07 14:01:23,929 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4400, best=0.96, avg=0.95, std=0.00, steps=1.803e+07
2023-07-07 14:01:27,882 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4500, best=0.96, avg=0.95, std=0.00, steps=1.844e+07
2023-07-07 14:01:31,818 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4600, best=0.96, avg=0.95, std=0.00, steps=1.885e+07
2023-07-07 14:01:35,778 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4700, best=0.96, avg=0.95, std=0.00, steps=1.926e+07
2023-07-07 14:01:39,705 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4800, best=0.96, avg=0.95, std=0.00, steps=1.966e+07
2023-07-07 14:01:43,640 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4900, best=0.96, avg=0.95, std=0.00, steps=2.007e+07
2023-07-07 14:01:47,579 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5000, best=0.96, avg=0.95, std=0.00, steps=2.048e+07
2023-07-07 14:01:51,517 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5100, best=0.96, avg=0.95, std=0.00, steps=2.089e+07
2023-07-07 14:01:55,435 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5200, best=0.96, avg=0.95, std=0.00, steps=2.130e+07
2023-07-07 14:01:59,354 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5300, best=0.96, avg=0.95, std=0.00, steps=2.171e+07
2023-07-07 14:02:03,295 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5400, best=0.96, avg=0.95, std=0.00, steps=2.212e+07
2023-07-07 14:02:07,228 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5500, best=0.96, avg=0.95, std=0.00, steps=2.253e+07
2023-07-07 14:02:11,165 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5600, best=0.96, avg=0.95, std=0.00, steps=2.294e+07
2023-07-07 14:02:15,122 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5700, best=0.96, avg=0.95, std=0.00, steps=2.335e+07
2023-07-07 14:02:19,081 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5800, best=0.96, avg=0.95, std=0.00, steps=2.376e+07
2023-07-07 14:02:23,016 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5900, best=0.96, avg=0.95, std=0.00, steps=2.417e+07
2023-07-07 14:02:26,967 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6000, best=0.96, avg=0.95, std=0.00, steps=2.458e+07
2023-07-07 14:02:30,900 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6100, best=0.96, avg=0.95, std=0.00, steps=2.499e+07
2023-07-07 14:02:34,832 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6200, best=0.96, avg=0.95, std=0.00, steps=2.540e+07
2023-07-07 14:02:38,766 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6300, best=0.96, avg=0.95, std=0.00, steps=2.581e+07
2023-07-07 14:02:42,710 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6400, best=0.96, avg=0.95, std=0.00, steps=2.622e+07
2023-07-07 14:02:46,665 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6500, best=0.96, avg=0.95, std=0.00, steps=2.663e+07
2023-07-07 14:02:50,606 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6600, best=0.96, avg=0.95, std=0.00, steps=2.704e+07
2023-07-07 14:02:54,533 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6700, best=0.96, avg=0.95, std=0.00, steps=2.745e+07
2023-07-07 14:02:58,469 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6800, best=0.96, avg=0.95, std=0.00, steps=2.786e+07
2023-07-07 14:03:02,424 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6900, best=0.96, avg=0.95, std=0.00, steps=2.827e+07
2023-07-07 14:03:06,377 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7000, best=0.96, avg=0.95, std=0.00, steps=2.868e+07
2023-07-07 14:03:10,325 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7100, best=0.96, avg=0.95, std=0.00, steps=2.909e+07
2023-07-07 14:03:14,260 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7200, best=0.96, avg=0.95, std=0.00, steps=2.950e+07
2023-07-07 14:03:18,201 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7300, best=0.96, avg=0.95, std=0.00, steps=2.990e+07
2023-07-07 14:03:22,158 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7400, best=0.96, avg=0.95, std=0.00, steps=3.031e+07
2023-07-07 14:03:26,105 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7500, best=0.96, avg=0.95, std=0.00, steps=3.072e+07
2023-07-07 14:03:30,045 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7600, best=0.96, avg=0.95, std=0.00, steps=3.113e+07
2023-07-07 14:03:33,985 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7700, best=0.96, avg=0.95, std=0.00, steps=3.154e+07
2023-07-07 14:03:37,929 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7800, best=0.96, avg=0.95, std=0.00, steps=3.195e+07
2023-07-07 14:03:41,880 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7900, best=0.96, avg=0.95, std=0.00, steps=3.236e+07
2023-07-07 14:03:45,819 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8000, best=0.96, avg=0.95, std=0.00, steps=3.277e+07
2023-07-07 14:03:49,780 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8100, best=0.96, avg=0.95, std=0.00, steps=3.318e+07
2023-07-07 14:03:53,702 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8200, best=0.96, avg=0.95, std=0.00, steps=3.359e+07
2023-07-07 14:03:57,637 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8300, best=0.96, avg=0.95, std=0.00, steps=3.400e+07
2023-07-07 14:04:01,569 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8400, best=0.96, avg=0.95, std=0.00, steps=3.441e+07
2023-07-07 14:04:05,495 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8500, best=0.96, avg=0.95, std=0.00, steps=3.482e+07
2023-07-07 14:04:09,429 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8600, best=0.96, avg=0.95, std=0.00, steps=3.523e+07
2023-07-07 14:04:13,366 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8700, best=0.96, avg=0.95, std=0.00, steps=3.564e+07
2023-07-07 14:04:17,289 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8800, best=0.96, avg=0.95, std=0.00, steps=3.605e+07
2023-07-07 14:04:21,218 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8900, best=0.96, avg=0.95, std=0.00, steps=3.646e+07
2023-07-07 14:04:25,142 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9000, best=0.96, avg=0.95, std=0.00, steps=3.687e+07
2023-07-07 14:04:29,075 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9100, best=0.96, avg=0.95, std=0.00, steps=3.728e+07
2023-07-07 14:04:32,999 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9200, best=0.96, avg=0.95, std=0.00, steps=3.769e+07
2023-07-07 14:04:36,933 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9300, best=0.96, avg=0.95, std=0.00, steps=3.810e+07
2023-07-07 14:04:40,870 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9400, best=0.96, avg=0.95, std=0.00, steps=3.851e+07
2023-07-07 14:04:44,812 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9500, best=0.96, avg=0.95, std=0.00, steps=3.892e+07
2023-07-07 14:04:48,751 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9600, best=0.96, avg=0.95, std=0.00, steps=3.933e+07
2023-07-07 14:04:52,669 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9700, best=0.96, avg=0.95, std=0.00, steps=3.974e+07
2023-07-07 14:04:56,615 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9800, best=0.96, avg=0.95, std=0.00, steps=4.014e+07
2023-07-07 14:05:00,553 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9900, best=0.96, avg=0.95, std=0.00, steps=4.055e+07
2023-07-07 14:05:04,471 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10000, best=0.96, avg=0.95, std=0.00, steps=4.096e+07
2023-07-07 14:05:08,376 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10100, best=0.96, avg=0.95, std=0.00, steps=4.137e+07
2023-07-07 14:05:12,306 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10200, best=0.96, avg=0.95, std=0.00, steps=4.178e+07
2023-07-07 14:05:16,249 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10300, best=0.96, avg=0.95, std=0.00, steps=4.219e+07
2023-07-07 14:05:20,195 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10400, best=0.96, avg=0.95, std=0.00, steps=4.260e+07
2023-07-07 14:05:24,145 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10500, best=0.96, avg=0.95, std=0.00, steps=4.301e+07
2023-07-07 14:05:28,099 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10600, best=0.96, avg=0.95, std=0.00, steps=4.342e+07
2023-07-07 14:05:32,047 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10700, best=0.96, avg=0.95, std=0.00, steps=4.383e+07
2023-07-07 14:05:35,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10800, best=0.96, avg=0.95, std=0.00, steps=4.424e+07
2023-07-07 14:05:39,918 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10900, best=0.96, avg=0.95, std=0.00, steps=4.465e+07
2023-07-07 14:05:43,850 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11000, best=0.96, avg=0.95, std=0.00, steps=4.506e+07
2023-07-07 14:05:47,793 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11100, best=0.96, avg=0.95, std=0.00, steps=4.547e+07
2023-07-07 14:05:51,720 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11200, best=0.96, avg=0.95, std=0.00, steps=4.588e+07
2023-07-07 14:05:55,651 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11300, best=0.96, avg=0.95, std=0.00, steps=4.629e+07
2023-07-07 14:05:59,579 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11400, best=0.96, avg=0.95, std=0.00, steps=4.670e+07
2023-07-07 14:06:03,511 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11500, best=0.96, avg=0.95, std=0.00, steps=4.711e+07
2023-07-07 14:06:07,447 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11600, best=0.96, avg=0.95, std=0.00, steps=4.752e+07
2023-07-07 14:06:11,379 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11700, best=0.96, avg=0.95, std=0.00, steps=4.793e+07
2023-07-07 14:06:15,305 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11800, best=0.96, avg=0.95, std=0.00, steps=4.834e+07
2023-07-07 14:06:19,243 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11900, best=0.96, avg=0.95, std=0.00, steps=4.875e+07
2023-07-07 14:06:23,139 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11999, best=0.96, avg=0.95, std=0.00, steps=4.915e+07
2023-07-07 14:06:23,139 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 14:06:23,164 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:06:23,197 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:06:30,957 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 100, best=0.70, avg=0.69, std=0.01, steps=6.205e+05
2023-07-07 14:06:36,699 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 200, best=0.75, avg=0.73, std=0.01, steps=1.235e+06
2023-07-07 14:06:42,443 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 300, best=0.79, avg=0.77, std=0.01, steps=1.849e+06
2023-07-07 14:06:48,164 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 400, best=0.80, avg=0.79, std=0.01, steps=2.464e+06
2023-07-07 14:06:53,873 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 500, best=0.83, avg=0.81, std=0.01, steps=3.078e+06
2023-07-07 14:06:59,596 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 600, best=0.83, avg=0.82, std=0.01, steps=3.693e+06
2023-07-07 14:07:05,317 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 700, best=0.85, avg=0.83, std=0.01, steps=4.307e+06
2023-07-07 14:07:11,042 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 800, best=0.85, avg=0.84, std=0.01, steps=4.921e+06
2023-07-07 14:07:16,776 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 900, best=0.86, avg=0.85, std=0.01, steps=5.536e+06
2023-07-07 14:07:22,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1000, best=0.87, avg=0.86, std=0.01, steps=6.150e+06
2023-07-07 14:07:28,271 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1100, best=0.88, avg=0.87, std=0.01, steps=6.765e+06
2023-07-07 14:07:34,008 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1200, best=0.89, avg=0.88, std=0.01, steps=7.379e+06
2023-07-07 14:07:39,715 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1300, best=0.90, avg=0.89, std=0.01, steps=7.993e+06
2023-07-07 14:07:45,451 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1400, best=0.90, avg=0.89, std=0.01, steps=8.608e+06
2023-07-07 14:07:51,180 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1500, best=0.91, avg=0.90, std=0.01, steps=9.222e+06
2023-07-07 14:07:56,900 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1600, best=0.92, avg=0.90, std=0.00, steps=9.837e+06
2023-07-07 14:08:02,634 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1700, best=0.92, avg=0.90, std=0.00, steps=1.045e+07
2023-07-07 14:08:08,359 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1800, best=0.92, avg=0.91, std=0.00, steps=1.107e+07
2023-07-07 14:08:14,088 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1900, best=0.92, avg=0.91, std=0.00, steps=1.168e+07
2023-07-07 14:08:19,816 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2000, best=0.92, avg=0.91, std=0.00, steps=1.229e+07
2023-07-07 14:08:25,535 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2100, best=0.92, avg=0.91, std=0.00, steps=1.291e+07
2023-07-07 14:08:31,256 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2200, best=0.93, avg=0.92, std=0.00, steps=1.352e+07
2023-07-07 14:08:36,997 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2300, best=0.93, avg=0.92, std=0.00, steps=1.414e+07
2023-07-07 14:08:42,745 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2400, best=0.93, avg=0.92, std=0.00, steps=1.475e+07
2023-07-07 14:08:48,462 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2500, best=0.93, avg=0.92, std=0.00, steps=1.537e+07
2023-07-07 14:08:54,175 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2600, best=0.93, avg=0.92, std=0.00, steps=1.598e+07
2023-07-07 14:08:59,888 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2700, best=0.93, avg=0.92, std=0.00, steps=1.659e+07
2023-07-07 14:09:05,618 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2800, best=0.93, avg=0.92, std=0.00, steps=1.721e+07
2023-07-07 14:09:11,340 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2900, best=0.93, avg=0.92, std=0.00, steps=1.782e+07
2023-07-07 14:09:17,075 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3000, best=0.93, avg=0.92, std=0.00, steps=1.844e+07
2023-07-07 14:09:22,806 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3100, best=0.93, avg=0.92, std=0.00, steps=1.905e+07
2023-07-07 14:09:28,534 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3200, best=0.93, avg=0.92, std=0.00, steps=1.967e+07
2023-07-07 14:09:34,271 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3300, best=0.93, avg=0.92, std=0.00, steps=2.028e+07
2023-07-07 14:09:39,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3400, best=0.93, avg=0.92, std=0.00, steps=2.090e+07
2023-07-07 14:09:45,714 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3500, best=0.93, avg=0.92, std=0.00, steps=2.151e+07
2023-07-07 14:09:51,442 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3600, best=0.93, avg=0.92, std=0.00, steps=2.212e+07
2023-07-07 14:09:57,182 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3700, best=0.93, avg=0.92, std=0.00, steps=2.274e+07
2023-07-07 14:10:02,932 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3800, best=0.93, avg=0.92, std=0.00, steps=2.335e+07
2023-07-07 14:10:08,690 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3900, best=0.94, avg=0.93, std=0.00, steps=2.397e+07
2023-07-07 14:10:14,441 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4000, best=0.94, avg=0.93, std=0.00, steps=2.458e+07
2023-07-07 14:10:20,178 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4100, best=0.94, avg=0.93, std=0.00, steps=2.520e+07
2023-07-07 14:10:25,908 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4200, best=0.94, avg=0.93, std=0.00, steps=2.581e+07
2023-07-07 14:10:31,645 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4300, best=0.94, avg=0.93, std=0.00, steps=2.643e+07
2023-07-07 14:10:37,372 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4400, best=0.94, avg=0.93, std=0.00, steps=2.704e+07
2023-07-07 14:10:43,101 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4500, best=0.94, avg=0.93, std=0.00, steps=2.765e+07
2023-07-07 14:10:48,844 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4600, best=0.94, avg=0.93, std=0.00, steps=2.827e+07
2023-07-07 14:10:54,570 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4700, best=0.94, avg=0.93, std=0.00, steps=2.888e+07
2023-07-07 14:11:00,293 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4800, best=0.94, avg=0.93, std=0.00, steps=2.950e+07
2023-07-07 14:11:06,019 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4900, best=0.94, avg=0.93, std=0.00, steps=3.011e+07
2023-07-07 14:11:11,758 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5000, best=0.94, avg=0.93, std=0.00, steps=3.073e+07
2023-07-07 14:11:17,502 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5100, best=0.94, avg=0.93, std=0.00, steps=3.134e+07
2023-07-07 14:11:23,258 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5200, best=0.94, avg=0.93, std=0.00, steps=3.195e+07
2023-07-07 14:11:28,987 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5300, best=0.94, avg=0.93, std=0.00, steps=3.257e+07
2023-07-07 14:11:34,708 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5400, best=0.94, avg=0.93, std=0.00, steps=3.318e+07
2023-07-07 14:11:40,443 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5500, best=0.94, avg=0.93, std=0.00, steps=3.380e+07
2023-07-07 14:11:46,166 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5600, best=0.94, avg=0.93, std=0.00, steps=3.441e+07
2023-07-07 14:11:51,906 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5700, best=0.94, avg=0.93, std=0.00, steps=3.503e+07
2023-07-07 14:11:57,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5800, best=0.94, avg=0.93, std=0.00, steps=3.564e+07
2023-07-07 14:12:03,343 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5900, best=0.94, avg=0.93, std=0.00, steps=3.626e+07
2023-07-07 14:12:09,060 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6000, best=0.94, avg=0.93, std=0.00, steps=3.687e+07
2023-07-07 14:12:14,768 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6100, best=0.94, avg=0.93, std=0.00, steps=3.748e+07
2023-07-07 14:12:20,482 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6200, best=0.94, avg=0.93, std=0.00, steps=3.810e+07
2023-07-07 14:12:26,199 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6300, best=0.94, avg=0.93, std=0.00, steps=3.871e+07
2023-07-07 14:12:31,909 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6400, best=0.94, avg=0.93, std=0.00, steps=3.933e+07
2023-07-07 14:12:37,639 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6500, best=0.94, avg=0.93, std=0.00, steps=3.994e+07
2023-07-07 14:12:43,365 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6600, best=0.94, avg=0.93, std=0.00, steps=4.056e+07
2023-07-07 14:12:49,097 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6700, best=0.94, avg=0.93, std=0.00, steps=4.117e+07
2023-07-07 14:12:54,835 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6800, best=0.94, avg=0.93, std=0.00, steps=4.179e+07
2023-07-07 14:13:00,566 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6900, best=0.94, avg=0.93, std=0.00, steps=4.240e+07
2023-07-07 14:13:06,307 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7000, best=0.95, avg=0.93, std=0.00, steps=4.301e+07
2023-07-07 14:13:12,070 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7100, best=0.94, avg=0.93, std=0.00, steps=4.363e+07
2023-07-07 14:13:17,799 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7200, best=0.94, avg=0.93, std=0.00, steps=4.424e+07
2023-07-07 14:13:23,521 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7300, best=0.94, avg=0.93, std=0.00, steps=4.486e+07
2023-07-07 14:13:29,258 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7400, best=0.94, avg=0.93, std=0.00, steps=4.547e+07
2023-07-07 14:13:35,021 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7500, best=0.94, avg=0.93, std=0.00, steps=4.609e+07
2023-07-07 14:13:40,769 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7600, best=0.94, avg=0.93, std=0.00, steps=4.670e+07
2023-07-07 14:13:46,525 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7700, best=0.94, avg=0.93, std=0.00, steps=4.731e+07
2023-07-07 14:13:52,258 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7800, best=0.94, avg=0.93, std=0.00, steps=4.793e+07
2023-07-07 14:13:57,997 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7900, best=0.94, avg=0.93, std=0.00, steps=4.854e+07
2023-07-07 14:14:03,718 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8000, best=0.94, avg=0.93, std=0.00, steps=4.916e+07
2023-07-07 14:14:09,442 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8100, best=0.94, avg=0.93, std=0.00, steps=4.977e+07
2023-07-07 14:14:15,181 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8200, best=0.94, avg=0.93, std=0.00, steps=5.039e+07
2023-07-07 14:14:20,914 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8300, best=0.94, avg=0.93, std=0.00, steps=5.100e+07
2023-07-07 14:14:26,623 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8400, best=0.94, avg=0.93, std=0.00, steps=5.162e+07
2023-07-07 14:14:32,349 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8500, best=0.94, avg=0.93, std=0.00, steps=5.223e+07
2023-07-07 14:14:38,072 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8600, best=0.94, avg=0.93, std=0.00, steps=5.284e+07
2023-07-07 14:14:43,788 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8700, best=0.94, avg=0.93, std=0.00, steps=5.346e+07
2023-07-07 14:14:49,513 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8800, best=0.94, avg=0.93, std=0.00, steps=5.407e+07
2023-07-07 14:14:55,260 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8900, best=0.94, avg=0.93, std=0.00, steps=5.469e+07
2023-07-07 14:15:01,000 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9000, best=0.94, avg=0.93, std=0.00, steps=5.530e+07
2023-07-07 14:15:06,742 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9100, best=0.94, avg=0.93, std=0.00, steps=5.592e+07
2023-07-07 14:15:12,482 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9200, best=0.94, avg=0.93, std=0.00, steps=5.653e+07
2023-07-07 14:15:18,213 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9300, best=0.94, avg=0.93, std=0.00, steps=5.715e+07
2023-07-07 14:15:23,938 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9400, best=0.94, avg=0.93, std=0.00, steps=5.776e+07
2023-07-07 14:15:29,678 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9500, best=0.94, avg=0.93, std=0.00, steps=5.837e+07
2023-07-07 14:15:35,416 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9600, best=0.94, avg=0.93, std=0.00, steps=5.899e+07
2023-07-07 14:15:41,167 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9700, best=0.94, avg=0.93, std=0.00, steps=5.960e+07
2023-07-07 14:15:46,901 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9800, best=0.94, avg=0.93, std=0.00, steps=6.022e+07
2023-07-07 14:15:52,625 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9900, best=0.94, avg=0.93, std=0.00, steps=6.083e+07
2023-07-07 14:15:58,344 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10000, best=0.94, avg=0.93, std=0.00, steps=6.145e+07
2023-07-07 14:16:04,071 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10100, best=0.94, avg=0.93, std=0.00, steps=6.206e+07
2023-07-07 14:16:09,831 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10200, best=0.94, avg=0.93, std=0.00, steps=6.267e+07
2023-07-07 14:16:15,552 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10300, best=0.94, avg=0.93, std=0.00, steps=6.329e+07
2023-07-07 14:16:21,287 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10400, best=0.94, avg=0.93, std=0.00, steps=6.390e+07
2023-07-07 14:16:27,019 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10500, best=0.94, avg=0.93, std=0.00, steps=6.452e+07
2023-07-07 14:16:32,738 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10600, best=0.94, avg=0.93, std=0.00, steps=6.513e+07
2023-07-07 14:16:38,479 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10700, best=0.94, avg=0.93, std=0.00, steps=6.575e+07
2023-07-07 14:16:44,198 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10800, best=0.94, avg=0.93, std=0.00, steps=6.636e+07
2023-07-07 14:16:49,933 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10900, best=0.94, avg=0.93, std=0.00, steps=6.698e+07
2023-07-07 14:16:55,670 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11000, best=0.94, avg=0.93, std=0.00, steps=6.759e+07
2023-07-07 14:17:01,413 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11100, best=0.94, avg=0.93, std=0.00, steps=6.820e+07
2023-07-07 14:17:07,159 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11200, best=0.94, avg=0.93, std=0.00, steps=6.882e+07
2023-07-07 14:17:12,882 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11300, best=0.94, avg=0.93, std=0.00, steps=6.943e+07
2023-07-07 14:17:18,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11400, best=0.94, avg=0.93, std=0.00, steps=7.005e+07
2023-07-07 14:17:24,339 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11500, best=0.94, avg=0.93, std=0.00, steps=7.066e+07
2023-07-07 14:17:30,075 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11600, best=0.94, avg=0.93, std=0.00, steps=7.128e+07
2023-07-07 14:17:35,814 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11700, best=0.94, avg=0.93, std=0.00, steps=7.189e+07
2023-07-07 14:17:41,569 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11800, best=0.94, avg=0.93, std=0.00, steps=7.251e+07
2023-07-07 14:17:47,321 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11900, best=0.94, avg=0.93, std=0.00, steps=7.312e+07
2023-07-07 14:17:53,003 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11999, best=0.94, avg=0.93, std=0.00, steps=7.373e+07
2023-07-07 14:17:53,004 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 14:17:53,028 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:17:53,061 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:18:02,627 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 100, best=0.53, avg=0.50, std=0.01, steps=8.274e+05
2023-07-07 14:18:10,147 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 200, best=0.52, avg=0.50, std=0.01, steps=1.647e+06
2023-07-07 14:18:17,676 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 300, best=0.52, avg=0.50, std=0.01, steps=2.466e+06
2023-07-07 14:18:25,219 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 400, best=0.52, avg=0.50, std=0.01, steps=3.285e+06
2023-07-07 14:18:32,778 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 500, best=0.52, avg=0.50, std=0.01, steps=4.104e+06
2023-07-07 14:18:40,328 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 600, best=0.52, avg=0.50, std=0.01, steps=4.923e+06
2023-07-07 14:18:47,886 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 700, best=0.71, avg=0.69, std=0.01, steps=5.743e+06
2023-07-07 14:18:55,444 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 800, best=0.74, avg=0.72, std=0.01, steps=6.562e+06
2023-07-07 14:19:02,990 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 900, best=0.76, avg=0.74, std=0.01, steps=7.381e+06
2023-07-07 14:19:10,509 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1000, best=0.77, avg=0.75, std=0.01, steps=8.200e+06
2023-07-07 14:19:18,043 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1100, best=0.78, avg=0.76, std=0.01, steps=9.019e+06
2023-07-07 14:19:25,573 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1200, best=0.79, avg=0.77, std=0.01, steps=9.839e+06
2023-07-07 14:19:33,105 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1300, best=0.80, avg=0.78, std=0.01, steps=1.066e+07
2023-07-07 14:19:40,639 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1400, best=0.80, avg=0.79, std=0.01, steps=1.148e+07
2023-07-07 14:19:48,181 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1500, best=0.81, avg=0.79, std=0.01, steps=1.230e+07
2023-07-07 14:19:55,708 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1600, best=0.82, avg=0.81, std=0.01, steps=1.312e+07
2023-07-07 14:20:03,268 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1700, best=0.82, avg=0.81, std=0.01, steps=1.393e+07
2023-07-07 14:20:10,809 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1800, best=0.83, avg=0.82, std=0.01, steps=1.475e+07
2023-07-07 14:20:18,327 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1900, best=0.84, avg=0.82, std=0.01, steps=1.557e+07
2023-07-07 14:20:25,868 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2000, best=0.85, avg=0.83, std=0.00, steps=1.639e+07
2023-07-07 14:20:33,398 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2100, best=0.85, avg=0.83, std=0.01, steps=1.721e+07
2023-07-07 14:20:40,958 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2200, best=0.85, avg=0.84, std=0.00, steps=1.803e+07
2023-07-07 14:20:48,502 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2300, best=0.85, avg=0.84, std=0.00, steps=1.885e+07
2023-07-07 14:20:56,037 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2400, best=0.86, avg=0.84, std=0.00, steps=1.967e+07
2023-07-07 14:21:03,590 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2500, best=0.86, avg=0.85, std=0.00, steps=2.049e+07
2023-07-07 14:21:11,190 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2600, best=0.86, avg=0.85, std=0.00, steps=2.131e+07
2023-07-07 14:21:18,751 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2700, best=0.86, avg=0.85, std=0.00, steps=2.213e+07
2023-07-07 14:21:26,300 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2800, best=0.86, avg=0.85, std=0.00, steps=2.295e+07
2023-07-07 14:21:33,823 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2900, best=0.87, avg=0.85, std=0.00, steps=2.376e+07
2023-07-07 14:21:41,357 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3000, best=0.87, avg=0.86, std=0.00, steps=2.458e+07
2023-07-07 14:21:48,912 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3100, best=0.87, avg=0.86, std=0.00, steps=2.540e+07
2023-07-07 14:21:56,448 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3200, best=0.87, avg=0.86, std=0.00, steps=2.622e+07
2023-07-07 14:22:03,997 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3300, best=0.87, avg=0.86, std=0.00, steps=2.704e+07
2023-07-07 14:22:11,560 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3400, best=0.88, avg=0.86, std=0.00, steps=2.786e+07
2023-07-07 14:22:19,125 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3500, best=0.88, avg=0.87, std=0.00, steps=2.868e+07
2023-07-07 14:22:26,668 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3600, best=0.88, avg=0.87, std=0.00, steps=2.950e+07
2023-07-07 14:22:34,207 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3700, best=0.88, avg=0.87, std=0.00, steps=3.032e+07
2023-07-07 14:22:41,755 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3800, best=0.88, avg=0.87, std=0.00, steps=3.114e+07
2023-07-07 14:22:49,289 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3900, best=0.88, avg=0.87, std=0.00, steps=3.196e+07
2023-07-07 14:22:56,846 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4000, best=0.88, avg=0.87, std=0.00, steps=3.278e+07
2023-07-07 14:23:04,410 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4100, best=0.88, avg=0.87, std=0.00, steps=3.360e+07
2023-07-07 14:23:11,952 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4200, best=0.88, avg=0.87, std=0.00, steps=3.441e+07
2023-07-07 14:23:19,513 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4300, best=0.88, avg=0.87, std=0.00, steps=3.523e+07
2023-07-07 14:23:27,065 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4400, best=0.89, avg=0.88, std=0.00, steps=3.605e+07
2023-07-07 14:23:34,633 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4500, best=0.89, avg=0.88, std=0.00, steps=3.687e+07
2023-07-07 14:23:42,200 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4600, best=0.89, avg=0.88, std=0.00, steps=3.769e+07
2023-07-07 14:23:49,767 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4700, best=0.89, avg=0.88, std=0.00, steps=3.851e+07
2023-07-07 14:23:57,317 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4800, best=0.89, avg=0.88, std=0.00, steps=3.933e+07
2023-07-07 14:24:04,864 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4900, best=0.89, avg=0.88, std=0.00, steps=4.015e+07
2023-07-07 14:24:12,431 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5000, best=0.89, avg=0.88, std=0.00, steps=4.097e+07
2023-07-07 14:24:20,000 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5100, best=0.89, avg=0.88, std=0.00, steps=4.179e+07
2023-07-07 14:24:27,553 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5200, best=0.89, avg=0.88, std=0.00, steps=4.261e+07
2023-07-07 14:24:35,120 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5300, best=0.89, avg=0.88, std=0.00, steps=4.343e+07
2023-07-07 14:24:42,685 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5400, best=0.89, avg=0.88, std=0.00, steps=4.424e+07
2023-07-07 14:24:50,257 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5500, best=0.89, avg=0.88, std=0.00, steps=4.506e+07
2023-07-07 14:24:57,866 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5600, best=0.89, avg=0.88, std=0.00, steps=4.588e+07
2023-07-07 14:25:05,425 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5700, best=0.89, avg=0.88, std=0.00, steps=4.670e+07
2023-07-07 14:25:12,975 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5800, best=0.89, avg=0.88, std=0.00, steps=4.752e+07
2023-07-07 14:25:20,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5900, best=0.90, avg=0.89, std=0.00, steps=4.834e+07
2023-07-07 14:25:28,095 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6000, best=0.90, avg=0.89, std=0.00, steps=4.916e+07
2023-07-07 14:25:35,661 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6100, best=0.90, avg=0.89, std=0.00, steps=4.998e+07
2023-07-07 14:25:43,215 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6200, best=0.90, avg=0.89, std=0.00, steps=5.080e+07
2023-07-07 14:25:50,771 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6300, best=0.90, avg=0.89, std=0.00, steps=5.162e+07
2023-07-07 14:25:58,322 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6400, best=0.90, avg=0.89, std=0.00, steps=5.244e+07
2023-07-07 14:26:05,885 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6500, best=0.90, avg=0.89, std=0.00, steps=5.326e+07
2023-07-07 14:26:13,424 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6600, best=0.90, avg=0.89, std=0.00, steps=5.408e+07
2023-07-07 14:26:20,983 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6700, best=0.90, avg=0.89, std=0.00, steps=5.489e+07
2023-07-07 14:26:28,515 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6800, best=0.90, avg=0.89, std=0.00, steps=5.571e+07
2023-07-07 14:26:36,062 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6900, best=0.90, avg=0.89, std=0.00, steps=5.653e+07
2023-07-07 14:26:43,623 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7000, best=0.90, avg=0.89, std=0.00, steps=5.735e+07
2023-07-07 14:26:51,177 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7100, best=0.90, avg=0.89, std=0.00, steps=5.817e+07
2023-07-07 14:26:58,730 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7200, best=0.90, avg=0.89, std=0.00, steps=5.899e+07
2023-07-07 14:27:06,279 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7300, best=0.90, avg=0.89, std=0.00, steps=5.981e+07
2023-07-07 14:27:13,835 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7400, best=0.90, avg=0.89, std=0.00, steps=6.063e+07
2023-07-07 14:27:21,380 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7500, best=0.90, avg=0.89, std=0.00, steps=6.145e+07
2023-07-07 14:27:28,935 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7600, best=0.90, avg=0.89, std=0.00, steps=6.227e+07
2023-07-07 14:27:36,484 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7700, best=0.90, avg=0.89, std=0.00, steps=6.309e+07
2023-07-07 14:27:44,035 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7800, best=0.90, avg=0.89, std=0.00, steps=6.391e+07
2023-07-07 14:27:51,582 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7900, best=0.90, avg=0.89, std=0.00, steps=6.472e+07
2023-07-07 14:27:59,107 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8000, best=0.91, avg=0.89, std=0.00, steps=6.554e+07
2023-07-07 14:28:06,640 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8100, best=0.91, avg=0.90, std=0.00, steps=6.636e+07
2023-07-07 14:28:14,187 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8200, best=0.91, avg=0.90, std=0.00, steps=6.718e+07
2023-07-07 14:28:21,729 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8300, best=0.90, avg=0.90, std=0.00, steps=6.800e+07
2023-07-07 14:28:29,255 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8400, best=0.91, avg=0.90, std=0.00, steps=6.882e+07
2023-07-07 14:28:36,807 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8500, best=0.91, avg=0.90, std=0.00, steps=6.964e+07
2023-07-07 14:28:44,340 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8600, best=0.91, avg=0.90, std=0.00, steps=7.046e+07
2023-07-07 14:28:51,889 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8700, best=0.91, avg=0.90, std=0.00, steps=7.128e+07
2023-07-07 14:28:59,421 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8800, best=0.91, avg=0.90, std=0.00, steps=7.210e+07
2023-07-07 14:29:06,960 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8900, best=0.91, avg=0.90, std=0.00, steps=7.292e+07
2023-07-07 14:29:14,503 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9000, best=0.91, avg=0.90, std=0.00, steps=7.374e+07
2023-07-07 14:29:22,056 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9100, best=0.91, avg=0.90, std=0.00, steps=7.456e+07
2023-07-07 14:29:29,599 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9200, best=0.91, avg=0.90, std=0.00, steps=7.537e+07
2023-07-07 14:29:37,154 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9300, best=0.91, avg=0.90, std=0.00, steps=7.619e+07
2023-07-07 14:29:44,692 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9400, best=0.91, avg=0.90, std=0.00, steps=7.701e+07
2023-07-07 14:29:52,231 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9500, best=0.91, avg=0.90, std=0.00, steps=7.783e+07
2023-07-07 14:29:59,747 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9600, best=0.91, avg=0.90, std=0.00, steps=7.865e+07
2023-07-07 14:30:07,278 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9700, best=0.91, avg=0.90, std=0.00, steps=7.947e+07
2023-07-07 14:30:14,810 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9800, best=0.91, avg=0.90, std=0.00, steps=8.029e+07
2023-07-07 14:30:22,333 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9900, best=0.91, avg=0.90, std=0.00, steps=8.111e+07
2023-07-07 14:30:29,858 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10000, best=0.91, avg=0.90, std=0.00, steps=8.193e+07
2023-07-07 14:30:37,390 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10100, best=0.91, avg=0.90, std=0.00, steps=8.275e+07
2023-07-07 14:30:44,920 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10200, best=0.91, avg=0.90, std=0.00, steps=8.357e+07
2023-07-07 14:30:52,467 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10300, best=0.91, avg=0.90, std=0.00, steps=8.439e+07
2023-07-07 14:31:00,019 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10400, best=0.91, avg=0.90, std=0.00, steps=8.520e+07
2023-07-07 14:31:07,603 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10500, best=0.91, avg=0.90, std=0.00, steps=8.602e+07
2023-07-07 14:31:15,160 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10600, best=0.91, avg=0.90, std=0.00, steps=8.684e+07
2023-07-07 14:31:22,693 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10700, best=0.91, avg=0.90, std=0.00, steps=8.766e+07
2023-07-07 14:31:30,247 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10800, best=0.91, avg=0.90, std=0.00, steps=8.848e+07
2023-07-07 14:31:37,790 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10900, best=0.91, avg=0.90, std=0.00, steps=8.930e+07
2023-07-07 14:31:45,340 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11000, best=0.91, avg=0.90, std=0.00, steps=9.012e+07
2023-07-07 14:31:52,882 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11100, best=0.91, avg=0.90, std=0.00, steps=9.094e+07
2023-07-07 14:32:00,436 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11200, best=0.91, avg=0.90, std=0.00, steps=9.176e+07
2023-07-07 14:32:07,982 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11300, best=0.91, avg=0.90, std=0.00, steps=9.258e+07
2023-07-07 14:32:15,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11400, best=0.91, avg=0.90, std=0.00, steps=9.340e+07
2023-07-07 14:32:23,064 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11500, best=0.91, avg=0.90, std=0.00, steps=9.422e+07
2023-07-07 14:32:30,599 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11600, best=0.91, avg=0.90, std=0.00, steps=9.504e+07
2023-07-07 14:32:38,125 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11700, best=0.91, avg=0.90, std=0.00, steps=9.585e+07
2023-07-07 14:32:45,660 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11800, best=0.91, avg=0.90, std=0.00, steps=9.667e+07
2023-07-07 14:32:53,188 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11900, best=0.91, avg=0.90, std=0.00, steps=9.749e+07
2023-07-07 14:33:00,662 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11999, best=0.91, avg=0.90, std=0.00, steps=9.830e+07
2023-07-07 14:33:00,662 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 14:33:00,685 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:33:00,715 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:33:13,918 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 100, best=0.52, avg=0.50, std=0.01, steps=1.241e+06
2023-07-07 14:33:25,079 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 200, best=0.52, avg=0.50, std=0.01, steps=2.470e+06
2023-07-07 14:33:36,240 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 300, best=0.52, avg=0.50, std=0.01, steps=3.699e+06
2023-07-07 14:33:47,404 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 400, best=0.65, avg=0.63, std=0.01, steps=4.927e+06
2023-07-07 14:33:58,594 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 500, best=0.67, avg=0.66, std=0.01, steps=6.156e+06
2023-07-07 14:34:09,798 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 600, best=0.70, avg=0.69, std=0.01, steps=7.385e+06
2023-07-07 14:34:20,954 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 700, best=0.72, avg=0.71, std=0.01, steps=8.614e+06
2023-07-07 14:34:32,101 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 800, best=0.73, avg=0.72, std=0.01, steps=9.843e+06
2023-07-07 14:34:43,268 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 900, best=0.74, avg=0.72, std=0.01, steps=1.107e+07
2023-07-07 14:34:54,419 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1000, best=0.74, avg=0.73, std=0.01, steps=1.230e+07
2023-07-07 14:35:05,584 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1100, best=0.75, avg=0.74, std=0.01, steps=1.353e+07
2023-07-07 14:35:16,742 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1200, best=0.76, avg=0.74, std=0.01, steps=1.476e+07
2023-07-07 14:35:27,904 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1300, best=0.76, avg=0.74, std=0.01, steps=1.599e+07
2023-07-07 14:35:39,065 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1400, best=0.77, avg=0.75, std=0.01, steps=1.722e+07
2023-07-07 14:35:50,235 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1500, best=0.77, avg=0.75, std=0.01, steps=1.844e+07
2023-07-07 14:36:01,404 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1600, best=0.77, avg=0.75, std=0.01, steps=1.967e+07
2023-07-07 14:36:12,550 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1700, best=0.76, avg=0.75, std=0.01, steps=2.090e+07
2023-07-07 14:36:23,701 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1800, best=0.77, avg=0.76, std=0.01, steps=2.213e+07
2023-07-07 14:36:34,860 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1900, best=0.77, avg=0.76, std=0.01, steps=2.336e+07
2023-07-07 14:36:46,037 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2000, best=0.77, avg=0.76, std=0.00, steps=2.459e+07
2023-07-07 14:36:57,180 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2100, best=0.78, avg=0.76, std=0.01, steps=2.582e+07
2023-07-07 14:37:08,323 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2200, best=0.78, avg=0.76, std=0.01, steps=2.705e+07
2023-07-07 14:37:19,477 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2300, best=0.78, avg=0.77, std=0.01, steps=2.827e+07
2023-07-07 14:37:30,612 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2400, best=0.78, avg=0.77, std=0.00, steps=2.950e+07
2023-07-07 14:37:41,774 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2500, best=0.78, avg=0.77, std=0.01, steps=3.073e+07
2023-07-07 14:37:52,950 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2600, best=0.78, avg=0.77, std=0.00, steps=3.196e+07
2023-07-07 14:38:04,120 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2700, best=0.78, avg=0.77, std=0.00, steps=3.319e+07
2023-07-07 14:38:15,280 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2800, best=0.79, avg=0.78, std=0.01, steps=3.442e+07
2023-07-07 14:38:26,434 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2900, best=0.79, avg=0.78, std=0.00, steps=3.565e+07
2023-07-07 14:38:37,608 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3000, best=0.79, avg=0.78, std=0.01, steps=3.688e+07
2023-07-07 14:38:48,786 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3100, best=0.79, avg=0.78, std=0.00, steps=3.811e+07
2023-07-07 14:38:59,948 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3200, best=0.79, avg=0.78, std=0.00, steps=3.933e+07
2023-07-07 14:39:11,127 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3300, best=0.80, avg=0.78, std=0.01, steps=4.056e+07
2023-07-07 14:39:22,308 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3400, best=0.80, avg=0.79, std=0.00, steps=4.179e+07
2023-07-07 14:39:33,479 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3500, best=0.80, avg=0.79, std=0.00, steps=4.302e+07
2023-07-07 14:39:44,624 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3600, best=0.80, avg=0.79, std=0.00, steps=4.425e+07
2023-07-07 14:39:55,780 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3700, best=0.80, avg=0.79, std=0.00, steps=4.548e+07
2023-07-07 14:40:06,914 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3800, best=0.80, avg=0.79, std=0.00, steps=4.671e+07
2023-07-07 14:40:18,051 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3900, best=0.80, avg=0.79, std=0.00, steps=4.794e+07
2023-07-07 14:40:29,202 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4000, best=0.79, avg=0.79, std=0.00, steps=4.916e+07
2023-07-07 14:40:40,367 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4100, best=0.80, avg=0.79, std=0.00, steps=5.039e+07
2023-07-07 14:40:51,561 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4200, best=0.80, avg=0.79, std=0.00, steps=5.162e+07
2023-07-07 14:41:02,719 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4300, best=0.80, avg=0.79, std=0.00, steps=5.285e+07
2023-07-07 14:41:13,872 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4400, best=0.80, avg=0.79, std=0.00, steps=5.408e+07
2023-07-07 14:41:25,038 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4500, best=0.80, avg=0.79, std=0.00, steps=5.531e+07
2023-07-07 14:41:36,178 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4600, best=0.80, avg=0.79, std=0.00, steps=5.654e+07
2023-07-07 14:41:47,340 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4700, best=0.80, avg=0.79, std=0.00, steps=5.777e+07
2023-07-07 14:41:58,493 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4800, best=0.81, avg=0.79, std=0.00, steps=5.899e+07
2023-07-07 14:42:09,650 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4900, best=0.81, avg=0.80, std=0.00, steps=6.022e+07
2023-07-07 14:42:20,808 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5000, best=0.81, avg=0.80, std=0.00, steps=6.145e+07
2023-07-07 14:42:31,972 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5100, best=0.81, avg=0.80, std=0.01, steps=6.268e+07
2023-07-07 14:42:43,140 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5200, best=0.81, avg=0.80, std=0.00, steps=6.391e+07
2023-07-07 14:42:54,292 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5300, best=0.81, avg=0.80, std=0.00, steps=6.514e+07
2023-07-07 14:43:05,451 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5400, best=0.81, avg=0.80, std=0.00, steps=6.637e+07
2023-07-07 14:43:16,614 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5500, best=0.81, avg=0.80, std=0.00, steps=6.760e+07
2023-07-07 14:43:27,756 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5600, best=0.81, avg=0.80, std=0.00, steps=6.883e+07
2023-07-07 14:43:38,938 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5700, best=0.82, avg=0.81, std=0.00, steps=7.005e+07
2023-07-07 14:43:50,131 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5800, best=0.82, avg=0.81, std=0.00, steps=7.128e+07
2023-07-07 14:44:01,277 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5900, best=0.82, avg=0.81, std=0.00, steps=7.251e+07
2023-07-07 14:44:12,457 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6000, best=0.82, avg=0.81, std=0.00, steps=7.374e+07
2023-07-07 14:44:23,635 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6100, best=0.82, avg=0.81, std=0.00, steps=7.497e+07
2023-07-07 14:44:34,815 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6200, best=0.82, avg=0.81, std=0.00, steps=7.620e+07
2023-07-07 14:44:45,993 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6300, best=0.82, avg=0.81, std=0.01, steps=7.743e+07
2023-07-07 14:44:57,155 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6400, best=0.82, avg=0.81, std=0.00, steps=7.866e+07
2023-07-07 14:45:08,324 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6500, best=0.82, avg=0.81, std=0.00, steps=7.988e+07
2023-07-07 14:45:19,499 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6600, best=0.83, avg=0.81, std=0.00, steps=8.111e+07
2023-07-07 14:45:30,653 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6700, best=0.82, avg=0.81, std=0.00, steps=8.234e+07
2023-07-07 14:45:41,827 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6800, best=0.82, avg=0.81, std=0.00, steps=8.357e+07
2023-07-07 14:45:52,997 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6900, best=0.83, avg=0.81, std=0.01, steps=8.480e+07
2023-07-07 14:46:04,148 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7000, best=0.82, avg=0.81, std=0.00, steps=8.603e+07
2023-07-07 14:46:15,295 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7100, best=0.83, avg=0.81, std=0.00, steps=8.726e+07
2023-07-07 14:46:26,448 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7200, best=0.82, avg=0.81, std=0.00, steps=8.849e+07
2023-07-07 14:46:37,594 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7300, best=0.83, avg=0.81, std=0.00, steps=8.971e+07
2023-07-07 14:46:48,768 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7400, best=0.83, avg=0.82, std=0.00, steps=9.094e+07
2023-07-07 14:46:59,928 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7500, best=0.83, avg=0.82, std=0.00, steps=9.217e+07
2023-07-07 14:47:11,087 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7600, best=0.83, avg=0.82, std=0.00, steps=9.340e+07
2023-07-07 14:47:22,246 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7700, best=0.83, avg=0.82, std=0.00, steps=9.463e+07
2023-07-07 14:47:33,405 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7800, best=0.83, avg=0.82, std=0.00, steps=9.586e+07
2023-07-07 14:47:44,570 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7900, best=0.83, avg=0.82, std=0.00, steps=9.709e+07
2023-07-07 14:47:55,727 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8000, best=0.83, avg=0.82, std=0.00, steps=9.832e+07
2023-07-07 14:48:06,863 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8100, best=0.83, avg=0.82, std=0.00, steps=9.955e+07
2023-07-07 14:48:18,003 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8200, best=0.83, avg=0.82, std=0.00, steps=1.008e+08
2023-07-07 14:48:29,142 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8300, best=0.83, avg=0.82, std=0.00, steps=1.020e+08
2023-07-07 14:48:40,298 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8400, best=0.84, avg=0.82, std=0.00, steps=1.032e+08
2023-07-07 14:48:51,440 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8500, best=0.83, avg=0.82, std=0.00, steps=1.045e+08
2023-07-07 14:49:02,591 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8600, best=0.83, avg=0.82, std=0.00, steps=1.057e+08
2023-07-07 14:49:13,739 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8700, best=0.83, avg=0.82, std=0.00, steps=1.069e+08
2023-07-07 14:49:24,895 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8800, best=0.83, avg=0.82, std=0.00, steps=1.081e+08
2023-07-07 14:49:36,034 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8900, best=0.83, avg=0.82, std=0.00, steps=1.094e+08
2023-07-07 14:49:47,174 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9000, best=0.83, avg=0.82, std=0.00, steps=1.106e+08
2023-07-07 14:49:58,323 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9100, best=0.84, avg=0.82, std=0.00, steps=1.118e+08
2023-07-07 14:50:09,485 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9200, best=0.83, avg=0.82, std=0.00, steps=1.131e+08
2023-07-07 14:50:20,640 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9300, best=0.84, avg=0.82, std=0.00, steps=1.143e+08
2023-07-07 14:50:31,801 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9400, best=0.84, avg=0.82, std=0.00, steps=1.155e+08
2023-07-07 14:50:42,979 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9500, best=0.83, avg=0.82, std=0.00, steps=1.167e+08
2023-07-07 14:50:54,202 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9600, best=0.84, avg=0.82, std=0.00, steps=1.180e+08
2023-07-07 14:51:05,376 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9700, best=0.83, avg=0.82, std=0.00, steps=1.192e+08
2023-07-07 14:51:16,541 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9800, best=0.83, avg=0.82, std=0.00, steps=1.204e+08
2023-07-07 14:51:27,716 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9900, best=0.84, avg=0.82, std=0.00, steps=1.217e+08
2023-07-07 14:51:38,886 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10000, best=0.83, avg=0.82, std=0.00, steps=1.229e+08
2023-07-07 14:51:50,056 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10100, best=0.83, avg=0.82, std=0.00, steps=1.241e+08
2023-07-07 14:52:01,204 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10200, best=0.83, avg=0.82, std=0.00, steps=1.253e+08
2023-07-07 14:52:12,399 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10300, best=0.83, avg=0.82, std=0.00, steps=1.266e+08
2023-07-07 14:52:23,597 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10400, best=0.83, avg=0.82, std=0.00, steps=1.278e+08
2023-07-07 14:52:34,780 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10500, best=0.83, avg=0.82, std=0.00, steps=1.290e+08
2023-07-07 14:52:45,950 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10600, best=0.83, avg=0.82, std=0.00, steps=1.303e+08
2023-07-07 14:52:57,101 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10700, best=0.84, avg=0.82, std=0.00, steps=1.315e+08
2023-07-07 14:53:08,256 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10800, best=0.84, avg=0.82, std=0.00, steps=1.327e+08
2023-07-07 14:53:19,413 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10900, best=0.83, avg=0.82, std=0.00, steps=1.340e+08
2023-07-07 14:53:30,597 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11000, best=0.84, avg=0.82, std=0.00, steps=1.352e+08
2023-07-07 14:53:41,760 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11100, best=0.83, avg=0.82, std=0.00, steps=1.364e+08
2023-07-07 14:53:52,907 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11200, best=0.84, avg=0.82, std=0.00, steps=1.376e+08
2023-07-07 14:54:04,084 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11300, best=0.83, avg=0.82, std=0.00, steps=1.389e+08
2023-07-07 14:54:15,256 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11400, best=0.83, avg=0.82, std=0.00, steps=1.401e+08
2023-07-07 14:54:26,419 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11500, best=0.84, avg=0.82, std=0.00, steps=1.413e+08
2023-07-07 14:54:37,592 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11600, best=0.83, avg=0.82, std=0.00, steps=1.426e+08
2023-07-07 14:54:48,779 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11700, best=0.84, avg=0.82, std=0.00, steps=1.438e+08
2023-07-07 14:54:59,970 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11800, best=0.84, avg=0.82, std=0.00, steps=1.450e+08
2023-07-07 14:55:11,121 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11900, best=0.84, avg=0.82, std=0.00, steps=1.462e+08
2023-07-07 14:55:22,174 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11999, best=0.83, avg=0.82, std=0.00, steps=1.475e+08
2023-07-07 14:55:22,175 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 14:55:22,201 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:55:22,232 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:55:31,775 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 100, best=0.68, avg=0.67, std=0.00, steps=8.274e+05
2023-07-07 14:55:39,300 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 200, best=0.72, avg=0.71, std=0.00, steps=1.647e+06
2023-07-07 14:55:46,823 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 300, best=0.75, avg=0.74, std=0.00, steps=2.466e+06
2023-07-07 14:55:54,370 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 400, best=0.77, avg=0.76, std=0.00, steps=3.285e+06
2023-07-07 14:56:01,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 500, best=0.79, avg=0.78, std=0.00, steps=4.104e+06
2023-07-07 14:56:09,478 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 600, best=0.81, avg=0.80, std=0.00, steps=4.923e+06
2023-07-07 14:56:17,010 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 700, best=0.82, avg=0.82, std=0.00, steps=5.743e+06
2023-07-07 14:56:24,523 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 800, best=0.83, avg=0.82, std=0.00, steps=6.562e+06
2023-07-07 14:56:32,047 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 900, best=0.84, avg=0.83, std=0.00, steps=7.381e+06
2023-07-07 14:56:39,586 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1000, best=0.84, avg=0.83, std=0.00, steps=8.200e+06
2023-07-07 14:56:47,119 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1100, best=0.85, avg=0.84, std=0.00, steps=9.019e+06
2023-07-07 14:56:54,655 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1200, best=0.85, avg=0.84, std=0.00, steps=9.839e+06
2023-07-07 14:57:02,196 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1300, best=0.86, avg=0.84, std=0.00, steps=1.066e+07
2023-07-07 14:57:09,743 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1400, best=0.86, avg=0.85, std=0.00, steps=1.148e+07
2023-07-07 14:57:17,301 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1500, best=0.86, avg=0.85, std=0.00, steps=1.230e+07
2023-07-07 14:57:24,848 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1600, best=0.86, avg=0.85, std=0.00, steps=1.312e+07
2023-07-07 14:57:32,381 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1700, best=0.86, avg=0.85, std=0.00, steps=1.393e+07
2023-07-07 14:57:39,929 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1800, best=0.87, avg=0.86, std=0.00, steps=1.475e+07
2023-07-07 14:57:47,467 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1900, best=0.86, avg=0.86, std=0.00, steps=1.557e+07
2023-07-07 14:57:55,009 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2000, best=0.87, avg=0.86, std=0.00, steps=1.639e+07
2023-07-07 14:58:02,568 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2100, best=0.87, avg=0.86, std=0.00, steps=1.721e+07
2023-07-07 14:58:10,121 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2200, best=0.87, avg=0.86, std=0.00, steps=1.803e+07
2023-07-07 14:58:17,667 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2300, best=0.87, avg=0.86, std=0.00, steps=1.885e+07
2023-07-07 14:58:25,207 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2400, best=0.87, avg=0.87, std=0.00, steps=1.967e+07
2023-07-07 14:58:32,747 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2500, best=0.88, avg=0.87, std=0.00, steps=2.049e+07
2023-07-07 14:58:40,310 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2600, best=0.88, avg=0.87, std=0.00, steps=2.131e+07
2023-07-07 14:58:47,861 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2700, best=0.88, avg=0.87, std=0.00, steps=2.213e+07
2023-07-07 14:58:55,401 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2800, best=0.88, avg=0.87, std=0.00, steps=2.295e+07
2023-07-07 14:59:02,941 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2900, best=0.88, avg=0.87, std=0.00, steps=2.376e+07
2023-07-07 14:59:10,506 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3000, best=0.88, avg=0.87, std=0.00, steps=2.458e+07
2023-07-07 14:59:18,049 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3100, best=0.88, avg=0.87, std=0.00, steps=2.540e+07
2023-07-07 14:59:25,602 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3200, best=0.88, avg=0.87, std=0.00, steps=2.622e+07
2023-07-07 14:59:33,152 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3300, best=0.88, avg=0.88, std=0.00, steps=2.704e+07
2023-07-07 14:59:40,698 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3400, best=0.88, avg=0.88, std=0.00, steps=2.786e+07
2023-07-07 14:59:48,250 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3500, best=0.89, avg=0.88, std=0.00, steps=2.868e+07
2023-07-07 14:59:55,819 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3600, best=0.89, avg=0.88, std=0.00, steps=2.950e+07
2023-07-07 15:00:03,372 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3700, best=0.89, avg=0.88, std=0.00, steps=3.032e+07
2023-07-07 15:00:10,916 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3800, best=0.89, avg=0.88, std=0.00, steps=3.114e+07
2023-07-07 15:00:18,460 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3900, best=0.89, avg=0.88, std=0.00, steps=3.196e+07
2023-07-07 15:00:26,005 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4000, best=0.89, avg=0.88, std=0.00, steps=3.278e+07
2023-07-07 15:00:33,538 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4100, best=0.89, avg=0.88, std=0.00, steps=3.360e+07
2023-07-07 15:00:41,071 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4200, best=0.89, avg=0.88, std=0.00, steps=3.441e+07
2023-07-07 15:00:48,625 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4300, best=0.89, avg=0.88, std=0.00, steps=3.523e+07
2023-07-07 15:00:56,167 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4400, best=0.89, avg=0.88, std=0.00, steps=3.605e+07
2023-07-07 15:01:03,714 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4500, best=0.89, avg=0.88, std=0.00, steps=3.687e+07
2023-07-07 15:01:11,255 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4600, best=0.89, avg=0.88, std=0.00, steps=3.769e+07
2023-07-07 15:01:18,792 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4700, best=0.89, avg=0.88, std=0.00, steps=3.851e+07
2023-07-07 15:01:26,345 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4800, best=0.89, avg=0.88, std=0.00, steps=3.933e+07
2023-07-07 15:01:33,888 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4900, best=0.89, avg=0.88, std=0.00, steps=4.015e+07
2023-07-07 15:01:41,425 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5000, best=0.89, avg=0.88, std=0.00, steps=4.097e+07
2023-07-07 15:01:48,974 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5100, best=0.89, avg=0.88, std=0.00, steps=4.179e+07
2023-07-07 15:01:56,523 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5200, best=0.89, avg=0.88, std=0.00, steps=4.261e+07
2023-07-07 15:02:04,072 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5300, best=0.89, avg=0.88, std=0.00, steps=4.343e+07
2023-07-07 15:02:11,609 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5400, best=0.89, avg=0.88, std=0.00, steps=4.424e+07
2023-07-07 15:02:19,158 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5500, best=0.89, avg=0.88, std=0.00, steps=4.506e+07
2023-07-07 15:02:26,717 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5600, best=0.89, avg=0.88, std=0.00, steps=4.588e+07
2023-07-07 15:02:34,268 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5700, best=0.89, avg=0.88, std=0.00, steps=4.670e+07
2023-07-07 15:02:41,834 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5800, best=0.89, avg=0.88, std=0.00, steps=4.752e+07
2023-07-07 15:02:49,386 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5900, best=0.89, avg=0.88, std=0.00, steps=4.834e+07
2023-07-07 15:02:56,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6000, best=0.89, avg=0.88, std=0.00, steps=4.916e+07
2023-07-07 15:03:04,499 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6100, best=0.89, avg=0.88, std=0.00, steps=4.998e+07
2023-07-07 15:03:12,047 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6200, best=0.89, avg=0.88, std=0.00, steps=5.080e+07
2023-07-07 15:03:19,592 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6300, best=0.89, avg=0.88, std=0.00, steps=5.162e+07
2023-07-07 15:03:27,135 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6400, best=0.89, avg=0.88, std=0.00, steps=5.244e+07
2023-07-07 15:03:34,694 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6500, best=0.89, avg=0.88, std=0.00, steps=5.326e+07
2023-07-07 15:03:42,249 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6600, best=0.89, avg=0.88, std=0.00, steps=5.408e+07
2023-07-07 15:03:49,797 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6700, best=0.90, avg=0.88, std=0.00, steps=5.489e+07
2023-07-07 15:03:57,360 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6800, best=0.89, avg=0.88, std=0.00, steps=5.571e+07
2023-07-07 15:04:04,897 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6900, best=0.89, avg=0.88, std=0.00, steps=5.653e+07
2023-07-07 15:04:12,425 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7000, best=0.89, avg=0.88, std=0.00, steps=5.735e+07
2023-07-07 15:04:19,975 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7100, best=0.89, avg=0.88, std=0.00, steps=5.817e+07
2023-07-07 15:04:27,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7200, best=0.89, avg=0.89, std=0.00, steps=5.899e+07
2023-07-07 15:04:35,065 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7300, best=0.89, avg=0.89, std=0.00, steps=5.981e+07
2023-07-07 15:04:42,624 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7400, best=0.89, avg=0.89, std=0.00, steps=6.063e+07
2023-07-07 15:04:50,184 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7500, best=0.89, avg=0.89, std=0.00, steps=6.145e+07
2023-07-07 15:04:57,725 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7600, best=0.89, avg=0.89, std=0.00, steps=6.227e+07
2023-07-07 15:05:05,280 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7700, best=0.89, avg=0.89, std=0.00, steps=6.309e+07
2023-07-07 15:05:12,818 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7800, best=0.90, avg=0.90, std=0.00, steps=6.391e+07
2023-07-07 15:05:20,341 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7900, best=0.91, avg=0.91, std=0.00, steps=6.472e+07
2023-07-07 15:05:27,870 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8000, best=0.92, avg=0.91, std=0.00, steps=6.554e+07
2023-07-07 15:05:35,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8100, best=0.92, avg=0.91, std=0.00, steps=6.636e+07
2023-07-07 15:05:42,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8200, best=0.92, avg=0.92, std=0.00, steps=6.718e+07
2023-07-07 15:05:50,461 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8300, best=0.92, avg=0.92, std=0.00, steps=6.800e+07
2023-07-07 15:05:58,005 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8400, best=0.92, avg=0.92, std=0.00, steps=6.882e+07
2023-07-07 15:06:05,566 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8500, best=0.92, avg=0.92, std=0.00, steps=6.964e+07
2023-07-07 15:06:13,129 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8600, best=0.92, avg=0.92, std=0.00, steps=7.046e+07
2023-07-07 15:06:20,673 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8700, best=0.92, avg=0.92, std=0.00, steps=7.128e+07
2023-07-07 15:06:28,199 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8800, best=0.93, avg=0.92, std=0.00, steps=7.210e+07
2023-07-07 15:06:35,727 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8900, best=0.93, avg=0.92, std=0.00, steps=7.292e+07
2023-07-07 15:06:43,284 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9000, best=0.93, avg=0.93, std=0.00, steps=7.374e+07
2023-07-07 15:06:50,826 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9100, best=0.94, avg=0.93, std=0.00, steps=7.456e+07
2023-07-07 15:06:58,357 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9200, best=0.94, avg=0.93, std=0.00, steps=7.537e+07
2023-07-07 15:07:05,920 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9300, best=0.94, avg=0.93, std=0.00, steps=7.619e+07
2023-07-07 15:07:13,470 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9400, best=0.94, avg=0.93, std=0.00, steps=7.701e+07
2023-07-07 15:07:21,009 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9500, best=0.94, avg=0.93, std=0.00, steps=7.783e+07
2023-07-07 15:07:28,569 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9600, best=0.94, avg=0.93, std=0.00, steps=7.865e+07
2023-07-07 15:07:36,106 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9700, best=0.94, avg=0.93, std=0.00, steps=7.947e+07
2023-07-07 15:07:43,654 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9800, best=0.94, avg=0.93, std=0.00, steps=8.029e+07
2023-07-07 15:07:51,184 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9900, best=0.94, avg=0.93, std=0.00, steps=8.111e+07
2023-07-07 15:07:58,713 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10000, best=0.94, avg=0.93, std=0.00, steps=8.193e+07
2023-07-07 15:08:06,262 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10100, best=0.94, avg=0.93, std=0.00, steps=8.275e+07
2023-07-07 15:08:13,808 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10200, best=0.94, avg=0.94, std=0.00, steps=8.357e+07
2023-07-07 15:08:21,349 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10300, best=0.94, avg=0.94, std=0.00, steps=8.439e+07
2023-07-07 15:08:28,887 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10400, best=0.94, avg=0.94, std=0.00, steps=8.520e+07
2023-07-07 15:08:36,428 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10500, best=0.94, avg=0.94, std=0.00, steps=8.602e+07
2023-07-07 15:08:43,979 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10600, best=0.95, avg=0.94, std=0.00, steps=8.684e+07
2023-07-07 15:08:51,538 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10700, best=0.95, avg=0.94, std=0.00, steps=8.766e+07
2023-07-07 15:08:59,090 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10800, best=0.95, avg=0.94, std=0.00, steps=8.848e+07
2023-07-07 15:09:06,642 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10900, best=0.95, avg=0.94, std=0.00, steps=8.930e+07
2023-07-07 15:09:14,208 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11000, best=0.95, avg=0.94, std=0.00, steps=9.012e+07
2023-07-07 15:09:21,761 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11100, best=0.95, avg=0.94, std=0.00, steps=9.094e+07
2023-07-07 15:09:29,325 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11200, best=0.95, avg=0.94, std=0.00, steps=9.176e+07
2023-07-07 15:09:36,878 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11300, best=0.95, avg=0.94, std=0.00, steps=9.258e+07
2023-07-07 15:09:44,453 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11400, best=0.95, avg=0.94, std=0.00, steps=9.340e+07
2023-07-07 15:09:52,004 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11500, best=0.95, avg=0.94, std=0.00, steps=9.422e+07
2023-07-07 15:09:59,573 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11600, best=0.95, avg=0.95, std=0.00, steps=9.504e+07
2023-07-07 15:10:07,135 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11700, best=0.95, avg=0.95, std=0.00, steps=9.585e+07
2023-07-07 15:10:14,693 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11800, best=0.95, avg=0.95, std=0.00, steps=9.667e+07
2023-07-07 15:10:22,237 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11900, best=0.95, avg=0.95, std=0.00, steps=9.749e+07
2023-07-07 15:10:29,718 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11999, best=0.96, avg=0.95, std=0.00, steps=9.830e+07
2023-07-07 15:10:29,719 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 15:10:29,744 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:10:29,776 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:10:41,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.034e+06
2023-07-07 15:10:50,746 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 200, best=0.51, avg=0.50, std=0.01, steps=2.058e+06
2023-07-07 15:11:00,100 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 300, best=0.52, avg=0.50, std=0.01, steps=3.082e+06
2023-07-07 15:11:09,479 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 400, best=0.51, avg=0.50, std=0.01, steps=4.106e+06
2023-07-07 15:11:18,842 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 500, best=0.51, avg=0.50, std=0.01, steps=5.130e+06
2023-07-07 15:11:28,212 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 600, best=0.51, avg=0.50, std=0.01, steps=6.154e+06
2023-07-07 15:11:37,567 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 700, best=0.51, avg=0.50, std=0.01, steps=7.178e+06
2023-07-07 15:11:46,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 800, best=0.51, avg=0.50, std=0.01, steps=8.202e+06
2023-07-07 15:11:56,275 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 900, best=0.52, avg=0.50, std=0.01, steps=9.226e+06
2023-07-07 15:12:05,625 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.01, steps=1.025e+07
2023-07-07 15:12:14,973 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1100, best=0.52, avg=0.50, std=0.01, steps=1.127e+07
2023-07-07 15:12:24,313 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1200, best=0.52, avg=0.50, std=0.01, steps=1.230e+07
2023-07-07 15:12:33,675 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.01, steps=1.332e+07
2023-07-07 15:12:43,067 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.01, steps=1.435e+07
2023-07-07 15:12:52,383 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.01, steps=1.537e+07
2023-07-07 15:13:01,707 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=1.639e+07
2023-07-07 15:13:11,048 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.01, steps=1.742e+07
2023-07-07 15:13:20,382 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1800, best=0.52, avg=0.50, std=0.01, steps=1.844e+07
2023-07-07 15:13:29,735 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1900, best=0.52, avg=0.50, std=0.01, steps=1.947e+07
2023-07-07 15:13:39,091 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.01, steps=2.049e+07
2023-07-07 15:13:48,455 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.01, steps=2.151e+07
2023-07-07 15:13:57,796 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.00, steps=2.254e+07
2023-07-07 15:14:07,124 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2300, best=0.52, avg=0.50, std=0.01, steps=2.356e+07
2023-07-07 15:14:16,487 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2400, best=0.52, avg=0.50, std=0.01, steps=2.459e+07
2023-07-07 15:14:25,838 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.01, steps=2.561e+07
2023-07-07 15:14:35,216 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.01, steps=2.663e+07
2023-07-07 15:14:44,563 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2700, best=0.52, avg=0.50, std=0.01, steps=2.766e+07
2023-07-07 15:14:53,910 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2800, best=0.52, avg=0.50, std=0.01, steps=2.868e+07
2023-07-07 15:15:03,234 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.01, steps=2.971e+07
2023-07-07 15:15:12,581 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3000, best=0.52, avg=0.50, std=0.01, steps=3.073e+07
2023-07-07 15:15:21,927 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.01, steps=3.175e+07
2023-07-07 15:15:31,289 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3200, best=0.52, avg=0.50, std=0.01, steps=3.278e+07
2023-07-07 15:15:40,637 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3300, best=0.51, avg=0.50, std=0.01, steps=3.380e+07
2023-07-07 15:15:50,005 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.01, steps=3.483e+07
2023-07-07 15:15:59,379 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3500, best=0.51, avg=0.50, std=0.01, steps=3.585e+07
2023-07-07 15:16:08,740 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3600, best=0.52, avg=0.50, std=0.01, steps=3.687e+07
2023-07-07 15:16:18,088 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.01, steps=3.790e+07
2023-07-07 15:16:27,441 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.01, steps=3.892e+07
2023-07-07 15:16:36,794 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3900, best=0.51, avg=0.50, std=0.01, steps=3.995e+07
2023-07-07 15:16:46,143 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4000, best=0.52, avg=0.50, std=0.01, steps=4.097e+07
2023-07-07 15:16:55,503 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.01, steps=4.199e+07
2023-07-07 15:17:04,866 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.00, steps=4.302e+07
2023-07-07 15:17:14,219 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4300, best=0.52, avg=0.50, std=0.01, steps=4.404e+07
2023-07-07 15:17:23,571 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4400, best=0.52, avg=0.50, std=0.01, steps=4.507e+07
2023-07-07 15:17:32,915 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4500, best=0.52, avg=0.50, std=0.01, steps=4.609e+07
2023-07-07 15:17:42,243 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.01, steps=4.711e+07
2023-07-07 15:17:51,609 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.01, steps=4.814e+07
2023-07-07 15:18:00,957 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4800, best=0.52, avg=0.50, std=0.01, steps=4.916e+07
2023-07-07 15:18:10,277 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4900, best=0.52, avg=0.50, std=0.01, steps=5.019e+07
2023-07-07 15:18:19,607 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5000, best=0.51, avg=0.50, std=0.01, steps=5.121e+07
2023-07-07 15:18:28,950 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5100, best=0.52, avg=0.50, std=0.01, steps=5.223e+07
2023-07-07 15:18:38,300 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.01, steps=5.326e+07
2023-07-07 15:18:47,637 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5300, best=0.51, avg=0.50, std=0.01, steps=5.428e+07
2023-07-07 15:18:56,997 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5400, best=0.51, avg=0.50, std=0.01, steps=5.531e+07
2023-07-07 15:19:06,339 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.01, steps=5.633e+07
2023-07-07 15:19:15,680 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5600, best=0.52, avg=0.50, std=0.01, steps=5.735e+07
2023-07-07 15:19:25,036 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5700, best=0.51, avg=0.50, std=0.01, steps=5.838e+07
2023-07-07 15:19:34,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5800, best=0.51, avg=0.50, std=0.01, steps=5.940e+07
2023-07-07 15:19:43,776 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5900, best=0.51, avg=0.50, std=0.01, steps=6.043e+07
2023-07-07 15:19:53,119 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.01, steps=6.145e+07
2023-07-07 15:20:02,477 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6100, best=0.52, avg=0.50, std=0.01, steps=6.247e+07
2023-07-07 15:20:11,876 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.01, steps=6.350e+07
2023-07-07 15:20:21,240 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.01, steps=6.452e+07
2023-07-07 15:20:30,580 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6400, best=0.51, avg=0.50, std=0.00, steps=6.555e+07
2023-07-07 15:20:39,920 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6500, best=0.51, avg=0.50, std=0.01, steps=6.657e+07
2023-07-07 15:20:49,267 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6600, best=0.51, avg=0.50, std=0.01, steps=6.759e+07
2023-07-07 15:20:58,633 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6700, best=0.52, avg=0.50, std=0.01, steps=6.862e+07
2023-07-07 15:21:08,019 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.01, steps=6.964e+07
2023-07-07 15:21:17,380 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.01, steps=7.067e+07
2023-07-07 15:21:26,737 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7000, best=0.51, avg=0.50, std=0.01, steps=7.169e+07
2023-07-07 15:21:36,077 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7100, best=0.52, avg=0.50, std=0.01, steps=7.271e+07
2023-07-07 15:21:45,427 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7200, best=0.51, avg=0.50, std=0.01, steps=7.374e+07
2023-07-07 15:21:54,783 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.01, steps=7.476e+07
2023-07-07 15:22:04,132 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7400, best=0.51, avg=0.50, std=0.01, steps=7.579e+07
2023-07-07 15:22:13,481 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7500, best=0.52, avg=0.50, std=0.01, steps=7.681e+07
2023-07-07 15:22:22,870 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7600, best=0.51, avg=0.50, std=0.01, steps=7.783e+07
2023-07-07 15:22:32,214 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7700, best=0.52, avg=0.50, std=0.01, steps=7.886e+07
2023-07-07 15:22:41,577 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7800, best=0.51, avg=0.50, std=0.01, steps=7.988e+07
2023-07-07 15:22:50,941 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7900, best=0.51, avg=0.50, std=0.01, steps=8.091e+07
2023-07-07 15:23:00,294 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8000, best=0.52, avg=0.50, std=0.01, steps=8.193e+07
2023-07-07 15:23:09,654 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8100, best=0.52, avg=0.50, std=0.01, steps=8.295e+07
2023-07-07 15:23:19,005 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8200, best=0.51, avg=0.50, std=0.01, steps=8.398e+07
2023-07-07 15:23:28,363 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8300, best=0.51, avg=0.50, std=0.01, steps=8.500e+07
2023-07-07 15:23:37,707 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8400, best=0.52, avg=0.50, std=0.01, steps=8.603e+07
2023-07-07 15:23:47,047 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8500, best=0.51, avg=0.50, std=0.01, steps=8.705e+07
2023-07-07 15:23:56,415 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8600, best=0.51, avg=0.50, std=0.01, steps=8.807e+07
2023-07-07 15:24:05,789 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8700, best=0.51, avg=0.50, std=0.01, steps=8.910e+07
2023-07-07 15:24:15,149 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8800, best=0.51, avg=0.50, std=0.01, steps=9.012e+07
2023-07-07 15:24:24,503 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8900, best=0.51, avg=0.50, std=0.01, steps=9.115e+07
2023-07-07 15:24:33,861 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9000, best=0.52, avg=0.50, std=0.01, steps=9.217e+07
2023-07-07 15:24:43,209 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9100, best=0.52, avg=0.50, std=0.01, steps=9.319e+07
2023-07-07 15:24:52,573 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9200, best=0.51, avg=0.50, std=0.01, steps=9.422e+07
2023-07-07 15:25:01,918 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9300, best=0.51, avg=0.50, std=0.01, steps=9.524e+07
2023-07-07 15:25:11,279 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9400, best=0.51, avg=0.50, std=0.01, steps=9.627e+07
2023-07-07 15:25:20,621 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9500, best=0.52, avg=0.50, std=0.01, steps=9.729e+07
2023-07-07 15:25:29,967 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9600, best=0.51, avg=0.50, std=0.00, steps=9.831e+07
2023-07-07 15:25:39,328 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9700, best=0.51, avg=0.50, std=0.01, steps=9.934e+07
2023-07-07 15:25:48,677 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9800, best=0.51, avg=0.50, std=0.01, steps=1.004e+08
2023-07-07 15:25:58,026 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9900, best=0.66, avg=0.65, std=0.00, steps=1.014e+08
2023-07-07 15:26:07,381 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10000, best=0.68, avg=0.68, std=0.00, steps=1.024e+08
2023-07-07 15:26:16,736 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10100, best=0.70, avg=0.69, std=0.00, steps=1.034e+08
2023-07-07 15:26:26,095 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10200, best=0.71, avg=0.70, std=0.00, steps=1.045e+08
2023-07-07 15:26:35,437 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10300, best=0.72, avg=0.70, std=0.00, steps=1.055e+08
2023-07-07 15:26:44,788 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10400, best=0.72, avg=0.71, std=0.00, steps=1.065e+08
2023-07-07 15:26:54,131 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10500, best=0.73, avg=0.72, std=0.00, steps=1.075e+08
2023-07-07 15:27:03,464 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10600, best=0.73, avg=0.72, std=0.00, steps=1.086e+08
2023-07-07 15:27:12,808 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10700, best=0.74, avg=0.73, std=0.00, steps=1.096e+08
2023-07-07 15:27:22,157 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10800, best=0.75, avg=0.73, std=0.00, steps=1.106e+08
2023-07-07 15:27:31,507 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10900, best=0.75, avg=0.74, std=0.00, steps=1.116e+08
2023-07-07 15:27:40,859 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11000, best=0.75, avg=0.74, std=0.00, steps=1.127e+08
2023-07-07 15:27:50,192 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11100, best=0.76, avg=0.75, std=0.00, steps=1.137e+08
2023-07-07 15:27:59,538 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11200, best=0.76, avg=0.75, std=0.00, steps=1.147e+08
2023-07-07 15:28:08,893 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11300, best=0.77, avg=0.75, std=0.00, steps=1.157e+08
2023-07-07 15:28:18,226 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11400, best=0.77, avg=0.76, std=0.00, steps=1.167e+08
2023-07-07 15:28:27,575 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11500, best=0.77, avg=0.76, std=0.00, steps=1.178e+08
2023-07-07 15:28:36,926 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11600, best=0.77, avg=0.76, std=0.00, steps=1.188e+08
2023-07-07 15:28:46,266 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11700, best=0.78, avg=0.77, std=0.00, steps=1.198e+08
2023-07-07 15:28:55,625 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11800, best=0.78, avg=0.77, std=0.00, steps=1.208e+08
2023-07-07 15:29:04,967 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11900, best=0.79, avg=0.77, std=0.00, steps=1.219e+08
2023-07-07 15:29:14,215 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11999, best=0.79, avg=0.78, std=0.00, steps=1.229e+08
2023-07-07 15:29:14,216 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 15:29:14,240 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:29:14,279 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:29:27,610 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.241e+06
2023-07-07 15:29:38,767 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 200, best=0.66, avg=0.65, std=0.00, steps=2.470e+06
2023-07-07 15:29:49,944 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 300, best=0.68, avg=0.67, std=0.00, steps=3.699e+06
2023-07-07 15:30:01,110 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 400, best=0.69, avg=0.68, std=0.00, steps=4.927e+06
2023-07-07 15:30:12,304 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 500, best=0.70, avg=0.69, std=0.00, steps=6.156e+06
2023-07-07 15:30:23,509 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 600, best=0.72, avg=0.70, std=0.00, steps=7.385e+06
2023-07-07 15:30:34,665 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 700, best=0.72, avg=0.71, std=0.00, steps=8.614e+06
2023-07-07 15:30:45,839 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 800, best=0.72, avg=0.72, std=0.00, steps=9.843e+06
2023-07-07 15:30:57,016 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 900, best=0.74, avg=0.72, std=0.00, steps=1.107e+07
2023-07-07 15:31:08,183 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1000, best=0.74, avg=0.73, std=0.00, steps=1.230e+07
2023-07-07 15:31:19,356 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1100, best=0.74, avg=0.73, std=0.00, steps=1.353e+07
2023-07-07 15:31:30,509 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1200, best=0.75, avg=0.74, std=0.00, steps=1.476e+07
2023-07-07 15:31:41,673 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1300, best=0.75, avg=0.74, std=0.00, steps=1.599e+07
2023-07-07 15:31:52,816 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1400, best=0.75, avg=0.74, std=0.00, steps=1.722e+07
2023-07-07 15:32:03,971 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1500, best=0.76, avg=0.75, std=0.00, steps=1.844e+07
2023-07-07 15:32:15,143 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1600, best=0.76, avg=0.75, std=0.00, steps=1.967e+07
2023-07-07 15:32:26,326 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1700, best=0.76, avg=0.75, std=0.00, steps=2.090e+07
2023-07-07 15:32:37,469 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1800, best=0.76, avg=0.75, std=0.00, steps=2.213e+07
2023-07-07 15:32:48,632 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1900, best=0.76, avg=0.75, std=0.00, steps=2.336e+07
2023-07-07 15:32:59,793 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2000, best=0.76, avg=0.76, std=0.00, steps=2.459e+07
2023-07-07 15:33:10,992 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2100, best=0.77, avg=0.76, std=0.00, steps=2.582e+07
2023-07-07 15:33:22,169 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2200, best=0.77, avg=0.76, std=0.00, steps=2.705e+07
2023-07-07 15:33:33,372 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2300, best=0.77, avg=0.76, std=0.00, steps=2.827e+07
2023-07-07 15:33:44,548 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2400, best=0.78, avg=0.76, std=0.00, steps=2.950e+07
2023-07-07 15:33:55,703 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2500, best=0.78, avg=0.77, std=0.00, steps=3.073e+07
2023-07-07 15:34:06,863 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2600, best=0.77, avg=0.77, std=0.00, steps=3.196e+07
2023-07-07 15:34:18,029 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2700, best=0.78, avg=0.77, std=0.00, steps=3.319e+07
2023-07-07 15:34:29,194 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2800, best=0.78, avg=0.77, std=0.00, steps=3.442e+07
2023-07-07 15:34:40,376 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2900, best=0.78, avg=0.77, std=0.00, steps=3.565e+07
2023-07-07 15:34:51,526 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3000, best=0.79, avg=0.77, std=0.00, steps=3.688e+07
2023-07-07 15:35:02,684 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3100, best=0.79, avg=0.77, std=0.00, steps=3.811e+07
2023-07-07 15:35:13,858 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3200, best=0.78, avg=0.78, std=0.00, steps=3.933e+07
2023-07-07 15:35:25,003 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3300, best=0.79, avg=0.78, std=0.00, steps=4.056e+07
2023-07-07 15:35:36,156 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3400, best=0.79, avg=0.78, std=0.00, steps=4.179e+07
2023-07-07 15:35:47,322 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3500, best=0.79, avg=0.78, std=0.00, steps=4.302e+07
2023-07-07 15:35:58,497 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3600, best=0.79, avg=0.78, std=0.00, steps=4.425e+07
2023-07-07 15:36:09,648 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3700, best=0.79, avg=0.78, std=0.00, steps=4.548e+07
2023-07-07 15:36:20,822 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3800, best=0.79, avg=0.78, std=0.00, steps=4.671e+07
2023-07-07 15:36:31,968 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3900, best=0.79, avg=0.78, std=0.00, steps=4.794e+07
2023-07-07 15:36:43,113 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4000, best=0.79, avg=0.78, std=0.00, steps=4.916e+07
2023-07-07 15:36:54,324 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4100, best=0.79, avg=0.78, std=0.00, steps=5.039e+07
2023-07-07 15:37:05,480 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4200, best=0.79, avg=0.78, std=0.00, steps=5.162e+07
2023-07-07 15:37:16,633 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4300, best=0.79, avg=0.78, std=0.00, steps=5.285e+07
2023-07-07 15:37:27,784 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4400, best=0.80, avg=0.79, std=0.00, steps=5.408e+07
2023-07-07 15:37:38,943 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4500, best=0.80, avg=0.79, std=0.00, steps=5.531e+07
2023-07-07 15:37:50,116 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4600, best=0.80, avg=0.79, std=0.00, steps=5.654e+07
2023-07-07 15:38:01,292 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4700, best=0.80, avg=0.79, std=0.00, steps=5.777e+07
2023-07-07 15:38:12,488 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4800, best=0.80, avg=0.79, std=0.00, steps=5.899e+07
2023-07-07 15:38:23,653 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4900, best=0.80, avg=0.79, std=0.00, steps=6.022e+07
2023-07-07 15:38:34,808 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5000, best=0.80, avg=0.79, std=0.00, steps=6.145e+07
2023-07-07 15:38:45,980 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5100, best=0.80, avg=0.79, std=0.00, steps=6.268e+07
2023-07-07 15:38:57,144 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5200, best=0.80, avg=0.79, std=0.00, steps=6.391e+07
2023-07-07 15:39:08,312 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5300, best=0.80, avg=0.79, std=0.00, steps=6.514e+07
2023-07-07 15:39:19,474 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5400, best=0.80, avg=0.79, std=0.00, steps=6.637e+07
2023-07-07 15:39:30,636 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5500, best=0.80, avg=0.79, std=0.00, steps=6.760e+07
2023-07-07 15:39:41,811 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5600, best=0.80, avg=0.79, std=0.00, steps=6.883e+07
2023-07-07 15:39:52,985 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5700, best=0.80, avg=0.79, std=0.00, steps=7.005e+07
2023-07-07 15:40:04,175 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5800, best=0.80, avg=0.79, std=0.00, steps=7.128e+07
2023-07-07 15:40:15,346 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5900, best=0.80, avg=0.80, std=0.00, steps=7.251e+07
2023-07-07 15:40:26,511 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6000, best=0.80, avg=0.80, std=0.00, steps=7.374e+07
2023-07-07 15:40:37,695 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6100, best=0.81, avg=0.80, std=0.00, steps=7.497e+07
2023-07-07 15:40:48,884 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6200, best=0.81, avg=0.80, std=0.00, steps=7.620e+07
2023-07-07 15:41:00,050 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6300, best=0.81, avg=0.80, std=0.00, steps=7.743e+07
2023-07-07 15:41:11,218 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6400, best=0.81, avg=0.80, std=0.00, steps=7.866e+07
2023-07-07 15:41:22,385 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6500, best=0.81, avg=0.80, std=0.00, steps=7.988e+07
2023-07-07 15:41:33,581 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6600, best=0.81, avg=0.80, std=0.00, steps=8.111e+07
2023-07-07 15:41:44,748 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6700, best=0.81, avg=0.80, std=0.00, steps=8.234e+07
2023-07-07 15:41:55,909 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6800, best=0.81, avg=0.80, std=0.00, steps=8.357e+07
2023-07-07 15:42:07,067 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6900, best=0.81, avg=0.80, std=0.00, steps=8.480e+07
2023-07-07 15:42:18,233 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7000, best=0.81, avg=0.80, std=0.00, steps=8.603e+07
2023-07-07 15:42:29,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7100, best=0.81, avg=0.80, std=0.00, steps=8.726e+07
2023-07-07 15:42:40,570 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7200, best=0.81, avg=0.81, std=0.00, steps=8.849e+07
2023-07-07 15:42:51,735 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7300, best=0.82, avg=0.81, std=0.00, steps=8.971e+07
2023-07-07 15:43:02,899 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7400, best=0.81, avg=0.81, std=0.00, steps=9.094e+07
2023-07-07 15:43:14,053 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7500, best=0.81, avg=0.81, std=0.00, steps=9.217e+07
2023-07-07 15:43:25,200 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7600, best=0.81, avg=0.81, std=0.00, steps=9.340e+07
2023-07-07 15:43:36,359 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7700, best=0.82, avg=0.81, std=0.00, steps=9.463e+07
2023-07-07 15:43:47,505 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7800, best=0.82, avg=0.81, std=0.00, steps=9.586e+07
2023-07-07 15:43:58,653 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7900, best=0.82, avg=0.81, std=0.00, steps=9.709e+07
2023-07-07 15:44:09,829 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8000, best=0.82, avg=0.81, std=0.00, steps=9.832e+07
2023-07-07 15:44:20,999 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8100, best=0.82, avg=0.81, std=0.00, steps=9.955e+07
2023-07-07 15:44:32,185 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8200, best=0.82, avg=0.81, std=0.00, steps=1.008e+08
2023-07-07 15:44:43,357 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8300, best=0.82, avg=0.81, std=0.00, steps=1.020e+08
2023-07-07 15:44:54,533 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8400, best=0.82, avg=0.81, std=0.00, steps=1.032e+08
2023-07-07 15:45:05,715 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8500, best=0.82, avg=0.81, std=0.00, steps=1.045e+08
2023-07-07 15:45:16,878 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8600, best=0.82, avg=0.81, std=0.00, steps=1.057e+08
2023-07-07 15:45:28,051 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8700, best=0.82, avg=0.81, std=0.00, steps=1.069e+08
2023-07-07 15:45:39,233 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8800, best=0.82, avg=0.81, std=0.00, steps=1.081e+08
2023-07-07 15:45:50,412 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8900, best=0.82, avg=0.81, std=0.00, steps=1.094e+08
2023-07-07 15:46:01,554 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9000, best=0.82, avg=0.81, std=0.00, steps=1.106e+08
2023-07-07 15:46:12,721 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9100, best=0.82, avg=0.82, std=0.00, steps=1.118e+08
2023-07-07 15:46:23,879 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9200, best=0.82, avg=0.82, std=0.00, steps=1.131e+08
2023-07-07 15:46:35,015 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9300, best=0.82, avg=0.82, std=0.00, steps=1.143e+08
2023-07-07 15:46:46,160 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9400, best=0.82, avg=0.82, std=0.00, steps=1.155e+08
2023-07-07 15:46:57,322 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9500, best=0.82, avg=0.82, std=0.00, steps=1.167e+08
2023-07-07 15:47:08,475 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9600, best=0.82, avg=0.82, std=0.00, steps=1.180e+08
2023-07-07 15:47:19,627 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9700, best=0.83, avg=0.82, std=0.00, steps=1.192e+08
2023-07-07 15:47:30,786 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9800, best=0.83, avg=0.82, std=0.00, steps=1.204e+08
2023-07-07 15:47:41,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9900, best=0.82, avg=0.82, std=0.00, steps=1.217e+08
2023-07-07 15:47:53,092 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10000, best=0.82, avg=0.82, std=0.00, steps=1.229e+08
2023-07-07 15:48:04,259 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10100, best=0.83, avg=0.82, std=0.00, steps=1.241e+08
2023-07-07 15:48:15,419 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10200, best=0.83, avg=0.82, std=0.00, steps=1.253e+08
2023-07-07 15:48:26,579 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10300, best=0.83, avg=0.82, std=0.00, steps=1.266e+08
2023-07-07 15:48:37,729 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10400, best=0.83, avg=0.82, std=0.00, steps=1.278e+08
2023-07-07 15:48:48,908 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10500, best=0.83, avg=0.82, std=0.00, steps=1.290e+08
2023-07-07 15:49:00,069 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10600, best=0.83, avg=0.82, std=0.00, steps=1.303e+08
2023-07-07 15:49:11,228 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10700, best=0.83, avg=0.82, std=0.00, steps=1.315e+08
2023-07-07 15:49:22,378 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10800, best=0.83, avg=0.82, std=0.00, steps=1.327e+08
2023-07-07 15:49:33,592 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10900, best=0.83, avg=0.82, std=0.00, steps=1.340e+08
2023-07-07 15:49:44,753 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11000, best=0.83, avg=0.82, std=0.00, steps=1.352e+08
2023-07-07 15:49:55,911 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11100, best=0.83, avg=0.82, std=0.00, steps=1.364e+08
2023-07-07 15:50:07,053 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11200, best=0.83, avg=0.82, std=0.00, steps=1.376e+08
2023-07-07 15:50:18,243 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11300, best=0.83, avg=0.82, std=0.00, steps=1.389e+08
2023-07-07 15:50:29,423 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11400, best=0.83, avg=0.83, std=0.00, steps=1.401e+08
2023-07-07 15:50:40,589 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11500, best=0.83, avg=0.83, std=0.00, steps=1.413e+08
2023-07-07 15:50:51,772 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11600, best=0.83, avg=0.83, std=0.00, steps=1.426e+08
2023-07-07 15:51:02,936 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11700, best=0.83, avg=0.83, std=0.00, steps=1.438e+08
2023-07-07 15:51:14,110 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11800, best=0.84, avg=0.83, std=0.00, steps=1.450e+08
2023-07-07 15:51:25,294 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11900, best=0.84, avg=0.83, std=0.00, steps=1.462e+08
2023-07-07 15:51:36,340 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11999, best=0.84, avg=0.83, std=0.00, steps=1.475e+08
2023-07-07 15:51:36,340 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 15:51:36,365 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:51:36,397 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:51:53,285 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.655e+06
2023-07-07 15:52:08,089 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.01, steps=3.293e+06
2023-07-07 15:52:22,915 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 300, best=0.60, avg=0.60, std=0.00, steps=4.932e+06
2023-07-07 15:52:37,713 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 400, best=0.64, avg=0.63, std=0.00, steps=6.570e+06
2023-07-07 15:52:52,530 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 500, best=0.66, avg=0.65, std=0.00, steps=8.208e+06
2023-07-07 15:53:07,338 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 600, best=0.66, avg=0.65, std=0.00, steps=9.847e+06
2023-07-07 15:53:22,150 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 700, best=0.67, avg=0.66, std=0.00, steps=1.149e+07
2023-07-07 15:53:36,957 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 800, best=0.67, avg=0.66, std=0.00, steps=1.312e+07
2023-07-07 15:53:51,767 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 900, best=0.68, avg=0.67, std=0.00, steps=1.476e+07
2023-07-07 15:54:06,573 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1000, best=0.68, avg=0.67, std=0.00, steps=1.640e+07
2023-07-07 15:54:21,391 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1100, best=0.68, avg=0.67, std=0.00, steps=1.804e+07
2023-07-07 15:54:36,207 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1200, best=0.68, avg=0.67, std=0.00, steps=1.968e+07
2023-07-07 15:54:51,028 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1300, best=0.68, avg=0.67, std=0.00, steps=2.132e+07
2023-07-07 15:55:05,811 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1400, best=0.69, avg=0.67, std=0.00, steps=2.295e+07
2023-07-07 15:55:20,617 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1500, best=0.68, avg=0.68, std=0.00, steps=2.459e+07
2023-07-07 15:55:35,440 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1600, best=0.69, avg=0.68, std=0.00, steps=2.623e+07
2023-07-07 15:55:50,252 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1700, best=0.69, avg=0.68, std=0.00, steps=2.787e+07
2023-07-07 15:56:05,052 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1800, best=0.69, avg=0.68, std=0.00, steps=2.951e+07
2023-07-07 15:56:19,877 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1900, best=0.69, avg=0.68, std=0.00, steps=3.115e+07
2023-07-07 15:56:34,703 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2000, best=0.69, avg=0.68, std=0.00, steps=3.278e+07
2023-07-07 15:56:49,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2100, best=0.69, avg=0.68, std=0.00, steps=3.442e+07
2023-07-07 15:57:04,329 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2200, best=0.70, avg=0.68, std=0.00, steps=3.606e+07
2023-07-07 15:57:19,130 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2300, best=0.69, avg=0.68, std=0.00, steps=3.770e+07
2023-07-07 15:57:33,935 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2400, best=0.69, avg=0.68, std=0.00, steps=3.934e+07
2023-07-07 15:57:48,738 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2500, best=0.69, avg=0.69, std=0.00, steps=4.098e+07
2023-07-07 15:58:03,569 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2600, best=0.69, avg=0.68, std=0.00, steps=4.261e+07
2023-07-07 15:58:18,366 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2700, best=0.70, avg=0.69, std=0.00, steps=4.425e+07
2023-07-07 15:58:33,184 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2800, best=0.70, avg=0.69, std=0.00, steps=4.589e+07
2023-07-07 15:58:47,991 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2900, best=0.70, avg=0.69, std=0.00, steps=4.753e+07
2023-07-07 15:59:02,804 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3000, best=0.70, avg=0.69, std=0.00, steps=4.917e+07
2023-07-07 15:59:17,598 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3100, best=0.70, avg=0.69, std=0.00, steps=5.081e+07
2023-07-07 15:59:32,399 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3200, best=0.70, avg=0.69, std=0.00, steps=5.245e+07
2023-07-07 15:59:47,215 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3300, best=0.70, avg=0.69, std=0.00, steps=5.408e+07
2023-07-07 16:00:02,014 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3400, best=0.70, avg=0.69, std=0.00, steps=5.572e+07
2023-07-07 16:00:16,793 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3500, best=0.70, avg=0.69, std=0.00, steps=5.736e+07
2023-07-07 16:00:31,576 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3600, best=0.70, avg=0.69, std=0.00, steps=5.900e+07
2023-07-07 16:00:46,373 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3700, best=0.70, avg=0.69, std=0.00, steps=6.064e+07
2023-07-07 16:01:01,162 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3800, best=0.70, avg=0.69, std=0.00, steps=6.228e+07
2023-07-07 16:01:15,937 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3900, best=0.70, avg=0.69, std=0.00, steps=6.391e+07
2023-07-07 16:01:30,705 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4000, best=0.70, avg=0.69, std=0.00, steps=6.555e+07
2023-07-07 16:01:45,492 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4100, best=0.70, avg=0.69, std=0.00, steps=6.719e+07
2023-07-07 16:02:00,287 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4200, best=0.70, avg=0.69, std=0.00, steps=6.883e+07
2023-07-07 16:02:15,076 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4300, best=0.70, avg=0.69, std=0.00, steps=7.047e+07
2023-07-07 16:02:29,882 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4400, best=0.70, avg=0.69, std=0.00, steps=7.211e+07
2023-07-07 16:02:44,670 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4500, best=0.71, avg=0.69, std=0.00, steps=7.374e+07
2023-07-07 16:02:59,483 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4600, best=0.70, avg=0.69, std=0.00, steps=7.538e+07
2023-07-07 16:03:14,273 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4700, best=0.70, avg=0.69, std=0.00, steps=7.702e+07
2023-07-07 16:03:29,055 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4800, best=0.71, avg=0.69, std=0.00, steps=7.866e+07
2023-07-07 16:03:43,860 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4900, best=0.70, avg=0.69, std=0.00, steps=8.030e+07
2023-07-07 16:03:58,679 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5000, best=0.70, avg=0.69, std=0.00, steps=8.194e+07
2023-07-07 16:04:13,477 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5100, best=0.70, avg=0.69, std=0.00, steps=8.357e+07
2023-07-07 16:04:28,275 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5200, best=0.70, avg=0.69, std=0.00, steps=8.521e+07
2023-07-07 16:04:43,050 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5300, best=0.70, avg=0.69, std=0.00, steps=8.685e+07
2023-07-07 16:04:57,819 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5400, best=0.71, avg=0.70, std=0.00, steps=8.849e+07
2023-07-07 16:05:12,599 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5500, best=0.70, avg=0.70, std=0.00, steps=9.013e+07
2023-07-07 16:05:27,402 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5600, best=0.71, avg=0.70, std=0.00, steps=9.177e+07
2023-07-07 16:05:42,192 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5700, best=0.70, avg=0.70, std=0.00, steps=9.341e+07
2023-07-07 16:05:57,022 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5800, best=0.70, avg=0.70, std=0.00, steps=9.504e+07
2023-07-07 16:06:11,830 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5900, best=0.70, avg=0.70, std=0.00, steps=9.668e+07
2023-07-07 16:06:26,609 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6000, best=0.71, avg=0.70, std=0.00, steps=9.832e+07
2023-07-07 16:06:41,379 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6100, best=0.71, avg=0.70, std=0.00, steps=9.996e+07
2023-07-07 16:06:56,168 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6200, best=0.71, avg=0.70, std=0.00, steps=1.016e+08
2023-07-07 16:07:10,971 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6300, best=0.71, avg=0.70, std=0.00, steps=1.032e+08
2023-07-07 16:07:25,744 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6400, best=0.71, avg=0.70, std=0.00, steps=1.049e+08
2023-07-07 16:07:40,547 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6500, best=0.71, avg=0.70, std=0.00, steps=1.065e+08
2023-07-07 16:07:55,337 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6600, best=0.71, avg=0.70, std=0.00, steps=1.082e+08
2023-07-07 16:08:10,108 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6700, best=0.71, avg=0.70, std=0.00, steps=1.098e+08
2023-07-07 16:08:24,879 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6800, best=0.71, avg=0.70, std=0.00, steps=1.114e+08
2023-07-07 16:08:39,644 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6900, best=0.71, avg=0.70, std=0.00, steps=1.131e+08
2023-07-07 16:08:54,433 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7000, best=0.71, avg=0.70, std=0.00, steps=1.147e+08
2023-07-07 16:09:09,230 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7100, best=0.71, avg=0.70, std=0.00, steps=1.163e+08
2023-07-07 16:09:24,018 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7200, best=0.71, avg=0.70, std=0.00, steps=1.180e+08
2023-07-07 16:09:38,813 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7300, best=0.71, avg=0.70, std=0.00, steps=1.196e+08
2023-07-07 16:09:53,597 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7400, best=0.71, avg=0.70, std=0.00, steps=1.213e+08
2023-07-07 16:10:08,385 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7500, best=0.71, avg=0.70, std=0.00, steps=1.229e+08
2023-07-07 16:10:23,174 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7600, best=0.71, avg=0.70, std=0.00, steps=1.245e+08
2023-07-07 16:10:37,956 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7700, best=0.71, avg=0.70, std=0.00, steps=1.262e+08
2023-07-07 16:10:52,742 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7800, best=0.72, avg=0.70, std=0.00, steps=1.278e+08
2023-07-07 16:11:07,517 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7900, best=0.71, avg=0.70, std=0.00, steps=1.294e+08
2023-07-07 16:11:22,284 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8000, best=0.71, avg=0.70, std=0.00, steps=1.311e+08
2023-07-07 16:11:37,075 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8100, best=0.72, avg=0.70, std=0.00, steps=1.327e+08
2023-07-07 16:11:51,870 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8200, best=0.71, avg=0.70, std=0.00, steps=1.344e+08
2023-07-07 16:12:06,712 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8300, best=0.71, avg=0.70, std=0.00, steps=1.360e+08
2023-07-07 16:12:21,535 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8400, best=0.71, avg=0.70, std=0.00, steps=1.376e+08
2023-07-07 16:12:36,324 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8500, best=0.71, avg=0.70, std=0.00, steps=1.393e+08
2023-07-07 16:12:51,119 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8600, best=0.71, avg=0.70, std=0.00, steps=1.409e+08
2023-07-07 16:13:05,915 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8700, best=0.71, avg=0.70, std=0.00, steps=1.426e+08
2023-07-07 16:13:20,711 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8800, best=0.71, avg=0.70, std=0.00, steps=1.442e+08
2023-07-07 16:13:35,509 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8900, best=0.71, avg=0.70, std=0.00, steps=1.458e+08
2023-07-07 16:13:50,293 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9000, best=0.71, avg=0.70, std=0.00, steps=1.475e+08
2023-07-07 16:14:05,072 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9100, best=0.71, avg=0.70, std=0.00, steps=1.491e+08
2023-07-07 16:14:19,872 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9200, best=0.71, avg=0.70, std=0.00, steps=1.507e+08
2023-07-07 16:14:34,653 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9300, best=0.72, avg=0.70, std=0.00, steps=1.524e+08
2023-07-07 16:14:49,450 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9400, best=0.71, avg=0.70, std=0.00, steps=1.540e+08
2023-07-07 16:15:04,234 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9500, best=0.71, avg=0.70, std=0.00, steps=1.557e+08
2023-07-07 16:15:19,020 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9600, best=0.71, avg=0.70, std=0.00, steps=1.573e+08
2023-07-07 16:15:33,822 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9700, best=0.71, avg=0.70, std=0.00, steps=1.589e+08
2023-07-07 16:15:48,620 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9800, best=0.71, avg=0.70, std=0.00, steps=1.606e+08
2023-07-07 16:16:03,401 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9900, best=0.71, avg=0.70, std=0.00, steps=1.622e+08
2023-07-07 16:16:18,184 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10000, best=0.72, avg=0.70, std=0.00, steps=1.639e+08
2023-07-07 16:16:32,968 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10100, best=0.71, avg=0.70, std=0.00, steps=1.655e+08
2023-07-07 16:16:47,760 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10200, best=0.72, avg=0.70, std=0.00, steps=1.671e+08
2023-07-07 16:17:02,540 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10300, best=0.71, avg=0.70, std=0.00, steps=1.688e+08
2023-07-07 16:17:17,321 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10400, best=0.71, avg=0.70, std=0.00, steps=1.704e+08
2023-07-07 16:17:32,106 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10500, best=0.71, avg=0.70, std=0.00, steps=1.720e+08
2023-07-07 16:17:46,879 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10600, best=0.71, avg=0.70, std=0.00, steps=1.737e+08
2023-07-07 16:18:01,700 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10700, best=0.71, avg=0.71, std=0.00, steps=1.753e+08
2023-07-07 16:18:16,480 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10800, best=0.72, avg=0.71, std=0.00, steps=1.770e+08
2023-07-07 16:18:31,255 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10900, best=0.71, avg=0.71, std=0.00, steps=1.786e+08
2023-07-07 16:18:46,071 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11000, best=0.72, avg=0.71, std=0.00, steps=1.802e+08
2023-07-07 16:19:00,861 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11100, best=0.72, avg=0.71, std=0.00, steps=1.819e+08
2023-07-07 16:19:15,643 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11200, best=0.72, avg=0.71, std=0.00, steps=1.835e+08
2023-07-07 16:19:30,441 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11300, best=0.72, avg=0.71, std=0.00, steps=1.852e+08
2023-07-07 16:19:45,229 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11400, best=0.71, avg=0.71, std=0.00, steps=1.868e+08
2023-07-07 16:20:00,034 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11500, best=0.72, avg=0.71, std=0.00, steps=1.884e+08
2023-07-07 16:20:14,831 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11600, best=0.71, avg=0.71, std=0.00, steps=1.901e+08
2023-07-07 16:20:29,623 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11700, best=0.71, avg=0.71, std=0.00, steps=1.917e+08
2023-07-07 16:20:44,422 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11800, best=0.71, avg=0.71, std=0.00, steps=1.933e+08
2023-07-07 16:20:59,203 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11900, best=0.72, avg=0.71, std=0.00, steps=1.950e+08
2023-07-07 16:21:13,883 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11999, best=0.71, avg=0.71, std=0.00, steps=1.966e+08
2023-07-07 16:21:13,884 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 16:21:13,907 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:21:13,938 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:21:30,837 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.655e+06
2023-07-07 16:21:45,672 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 200, best=0.60, avg=0.59, std=0.00, steps=3.293e+06
2023-07-07 16:22:00,484 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 300, best=0.64, avg=0.63, std=0.00, steps=4.932e+06
2023-07-07 16:22:15,297 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 400, best=0.66, avg=0.65, std=0.00, steps=6.570e+06
2023-07-07 16:22:30,092 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 500, best=0.67, avg=0.66, std=0.00, steps=8.208e+06
2023-07-07 16:22:44,959 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 600, best=0.68, avg=0.67, std=0.00, steps=9.847e+06
2023-07-07 16:22:59,733 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 700, best=0.69, avg=0.68, std=0.00, steps=1.149e+07
2023-07-07 16:23:14,534 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 800, best=0.69, avg=0.69, std=0.00, steps=1.312e+07
2023-07-07 16:23:29,325 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 900, best=0.70, avg=0.69, std=0.00, steps=1.476e+07
2023-07-07 16:23:44,139 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1000, best=0.71, avg=0.70, std=0.00, steps=1.640e+07
2023-07-07 16:23:59,056 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1100, best=0.71, avg=0.70, std=0.00, steps=1.804e+07
2023-07-07 16:24:13,927 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1200, best=0.71, avg=0.70, std=0.00, steps=1.968e+07
2023-07-07 16:24:28,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1300, best=0.71, avg=0.71, std=0.00, steps=2.132e+07
2023-07-07 16:24:43,510 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1400, best=0.72, avg=0.71, std=0.00, steps=2.295e+07
2023-07-07 16:24:58,405 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1500, best=0.72, avg=0.71, std=0.00, steps=2.459e+07
2023-07-07 16:25:13,243 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1600, best=0.72, avg=0.71, std=0.00, steps=2.623e+07
2023-07-07 16:25:28,064 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1700, best=0.72, avg=0.72, std=0.00, steps=2.787e+07
2023-07-07 16:25:42,975 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1800, best=0.73, avg=0.72, std=0.00, steps=2.951e+07
2023-07-07 16:25:57,804 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1900, best=0.73, avg=0.72, std=0.00, steps=3.115e+07
2023-07-07 16:26:12,614 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2000, best=0.73, avg=0.72, std=0.00, steps=3.278e+07
2023-07-07 16:26:27,400 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2100, best=0.73, avg=0.72, std=0.00, steps=3.442e+07
2023-07-07 16:26:42,253 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2200, best=0.73, avg=0.73, std=0.00, steps=3.606e+07
2023-07-07 16:26:57,151 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2300, best=0.74, avg=0.73, std=0.00, steps=3.770e+07
2023-07-07 16:27:11,954 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2400, best=0.74, avg=0.73, std=0.00, steps=3.934e+07
2023-07-07 16:27:26,749 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2500, best=0.74, avg=0.73, std=0.00, steps=4.098e+07
2023-07-07 16:27:41,560 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2600, best=0.74, avg=0.73, std=0.00, steps=4.261e+07
2023-07-07 16:27:56,352 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2700, best=0.74, avg=0.73, std=0.00, steps=4.425e+07
2023-07-07 16:28:11,174 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2800, best=0.74, avg=0.74, std=0.00, steps=4.589e+07
2023-07-07 16:28:25,983 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2900, best=0.75, avg=0.74, std=0.00, steps=4.753e+07
2023-07-07 16:28:40,808 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3000, best=0.75, avg=0.74, std=0.00, steps=4.917e+07
2023-07-07 16:28:55,631 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3100, best=0.75, avg=0.74, std=0.00, steps=5.081e+07
2023-07-07 16:29:10,437 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3200, best=0.75, avg=0.74, std=0.00, steps=5.245e+07
2023-07-07 16:29:25,238 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3300, best=0.75, avg=0.74, std=0.00, steps=5.408e+07
2023-07-07 16:29:40,069 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3400, best=0.75, avg=0.74, std=0.00, steps=5.572e+07
2023-07-07 16:29:54,902 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3500, best=0.75, avg=0.74, std=0.00, steps=5.736e+07
2023-07-07 16:30:09,714 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3600, best=0.76, avg=0.75, std=0.00, steps=5.900e+07
2023-07-07 16:30:24,531 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3700, best=0.76, avg=0.75, std=0.00, steps=6.064e+07
2023-07-07 16:30:39,352 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3800, best=0.76, avg=0.75, std=0.00, steps=6.228e+07
2023-07-07 16:30:54,176 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3900, best=0.76, avg=0.75, std=0.00, steps=6.391e+07
2023-07-07 16:31:08,987 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4000, best=0.76, avg=0.75, std=0.00, steps=6.555e+07
2023-07-07 16:31:23,785 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4100, best=0.76, avg=0.75, std=0.00, steps=6.719e+07
2023-07-07 16:31:38,576 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4200, best=0.76, avg=0.75, std=0.00, steps=6.883e+07
2023-07-07 16:31:53,378 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4300, best=0.76, avg=0.75, std=0.00, steps=7.047e+07
2023-07-07 16:32:08,177 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4400, best=0.76, avg=0.75, std=0.00, steps=7.211e+07
2023-07-07 16:32:22,974 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4500, best=0.76, avg=0.76, std=0.00, steps=7.374e+07
2023-07-07 16:32:37,788 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4600, best=0.76, avg=0.76, std=0.00, steps=7.538e+07
2023-07-07 16:32:52,615 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4700, best=0.76, avg=0.76, std=0.00, steps=7.702e+07
2023-07-07 16:33:07,439 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4800, best=0.76, avg=0.76, std=0.00, steps=7.866e+07
2023-07-07 16:33:22,254 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4900, best=0.77, avg=0.76, std=0.00, steps=8.030e+07
2023-07-07 16:33:37,103 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5000, best=0.77, avg=0.76, std=0.00, steps=8.194e+07
2023-07-07 16:33:51,984 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5100, best=0.77, avg=0.76, std=0.00, steps=8.357e+07
2023-07-07 16:34:06,821 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5200, best=0.77, avg=0.76, std=0.00, steps=8.521e+07
2023-07-07 16:34:21,621 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5300, best=0.77, avg=0.76, std=0.00, steps=8.685e+07
2023-07-07 16:34:36,434 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5400, best=0.77, avg=0.76, std=0.00, steps=8.849e+07
2023-07-07 16:34:51,246 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5500, best=0.77, avg=0.77, std=0.00, steps=9.013e+07
2023-07-07 16:35:06,061 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5600, best=0.77, avg=0.77, std=0.00, steps=9.177e+07
2023-07-07 16:35:20,894 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5700, best=0.77, avg=0.77, std=0.00, steps=9.341e+07
2023-07-07 16:35:35,693 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5800, best=0.78, avg=0.77, std=0.00, steps=9.504e+07
2023-07-07 16:35:50,539 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5900, best=0.78, avg=0.77, std=0.00, steps=9.668e+07
2023-07-07 16:36:05,403 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6000, best=0.78, avg=0.77, std=0.00, steps=9.832e+07
2023-07-07 16:36:20,223 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6100, best=0.78, avg=0.77, std=0.00, steps=9.996e+07
2023-07-07 16:36:35,044 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6200, best=0.78, avg=0.77, std=0.00, steps=1.016e+08
2023-07-07 16:36:49,849 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6300, best=0.78, avg=0.77, std=0.00, steps=1.032e+08
2023-07-07 16:37:04,673 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6400, best=0.78, avg=0.77, std=0.00, steps=1.049e+08
2023-07-07 16:37:19,503 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6500, best=0.78, avg=0.78, std=0.00, steps=1.065e+08
2023-07-07 16:37:34,428 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6600, best=0.78, avg=0.77, std=0.00, steps=1.082e+08
2023-07-07 16:37:49,240 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6700, best=0.78, avg=0.78, std=0.00, steps=1.098e+08
2023-07-07 16:38:04,060 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6800, best=0.78, avg=0.78, std=0.00, steps=1.114e+08
2023-07-07 16:38:18,875 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6900, best=0.78, avg=0.78, std=0.00, steps=1.131e+08
2023-07-07 16:38:33,679 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7000, best=0.79, avg=0.78, std=0.00, steps=1.147e+08
2023-07-07 16:38:48,470 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7100, best=0.79, avg=0.78, std=0.00, steps=1.163e+08
2023-07-07 16:39:03,374 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7200, best=0.79, avg=0.78, std=0.00, steps=1.180e+08
2023-07-07 16:39:18,262 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7300, best=0.79, avg=0.78, std=0.00, steps=1.196e+08
2023-07-07 16:39:33,096 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7400, best=0.79, avg=0.78, std=0.00, steps=1.213e+08
2023-07-07 16:39:47,896 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7500, best=0.79, avg=0.78, std=0.00, steps=1.229e+08
2023-07-07 16:40:02,704 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7600, best=0.79, avg=0.78, std=0.00, steps=1.245e+08
2023-07-07 16:40:17,500 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7700, best=0.79, avg=0.78, std=0.00, steps=1.262e+08
2023-07-07 16:40:32,320 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7800, best=0.79, avg=0.78, std=0.00, steps=1.278e+08
2023-07-07 16:40:47,120 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7900, best=0.79, avg=0.78, std=0.00, steps=1.294e+08
2023-07-07 16:41:01,936 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8000, best=0.79, avg=0.79, std=0.00, steps=1.311e+08
2023-07-07 16:41:16,735 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8100, best=0.79, avg=0.79, std=0.00, steps=1.327e+08
2023-07-07 16:41:31,520 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8200, best=0.79, avg=0.79, std=0.00, steps=1.344e+08
2023-07-07 16:41:46,310 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8300, best=0.80, avg=0.79, std=0.00, steps=1.360e+08
2023-07-07 16:42:01,111 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8400, best=0.79, avg=0.79, std=0.00, steps=1.376e+08
2023-07-07 16:42:15,906 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8500, best=0.80, avg=0.79, std=0.00, steps=1.393e+08
2023-07-07 16:42:30,719 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8600, best=0.80, avg=0.79, std=0.00, steps=1.409e+08
2023-07-07 16:42:45,600 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8700, best=0.80, avg=0.79, std=0.00, steps=1.426e+08
2023-07-07 16:43:00,425 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8800, best=0.80, avg=0.79, std=0.00, steps=1.442e+08
2023-07-07 16:43:15,209 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8900, best=0.80, avg=0.79, std=0.00, steps=1.458e+08
2023-07-07 16:43:29,977 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9000, best=0.80, avg=0.79, std=0.00, steps=1.475e+08
2023-07-07 16:43:44,761 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9100, best=0.80, avg=0.79, std=0.00, steps=1.491e+08
2023-07-07 16:43:59,661 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9200, best=0.80, avg=0.79, std=0.00, steps=1.507e+08
2023-07-07 16:44:14,493 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9300, best=0.80, avg=0.79, std=0.00, steps=1.524e+08
2023-07-07 16:44:29,286 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9400, best=0.80, avg=0.79, std=0.00, steps=1.540e+08
2023-07-07 16:44:44,089 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9500, best=0.80, avg=0.79, std=0.00, steps=1.557e+08
2023-07-07 16:44:58,891 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9600, best=0.80, avg=0.79, std=0.00, steps=1.573e+08
2023-07-07 16:45:13,719 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9700, best=0.80, avg=0.79, std=0.00, steps=1.589e+08
2023-07-07 16:45:28,627 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9800, best=0.80, avg=0.79, std=0.00, steps=1.606e+08
2023-07-07 16:45:43,458 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9900, best=0.80, avg=0.80, std=0.00, steps=1.622e+08
2023-07-07 16:45:58,250 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10000, best=0.80, avg=0.80, std=0.00, steps=1.639e+08
2023-07-07 16:46:13,050 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10100, best=0.80, avg=0.80, std=0.00, steps=1.655e+08
2023-07-07 16:46:27,827 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10200, best=0.80, avg=0.80, std=0.00, steps=1.671e+08
2023-07-07 16:46:42,625 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10300, best=0.80, avg=0.80, std=0.00, steps=1.688e+08
2023-07-07 16:46:57,567 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10400, best=0.81, avg=0.80, std=0.00, steps=1.704e+08
2023-07-07 16:47:12,375 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10500, best=0.81, avg=0.80, std=0.00, steps=1.720e+08
2023-07-07 16:47:27,170 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10600, best=0.80, avg=0.80, std=0.00, steps=1.737e+08
2023-07-07 16:47:42,065 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10700, best=0.80, avg=0.80, std=0.00, steps=1.753e+08
2023-07-07 16:47:56,879 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10800, best=0.80, avg=0.80, std=0.00, steps=1.770e+08
2023-07-07 16:48:11,686 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10900, best=0.81, avg=0.80, std=0.00, steps=1.786e+08
2023-07-07 16:48:26,493 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11000, best=0.81, avg=0.80, std=0.00, steps=1.802e+08
2023-07-07 16:48:41,320 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11100, best=0.81, avg=0.80, std=0.00, steps=1.819e+08
2023-07-07 16:48:56,113 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11200, best=0.81, avg=0.80, std=0.00, steps=1.835e+08
2023-07-07 16:49:10,889 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11300, best=0.81, avg=0.80, std=0.00, steps=1.852e+08
2023-07-07 16:49:25,693 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11400, best=0.81, avg=0.80, std=0.00, steps=1.868e+08
2023-07-07 16:49:40,476 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11500, best=0.81, avg=0.80, std=0.00, steps=1.884e+08
2023-07-07 16:49:55,334 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11600, best=0.81, avg=0.80, std=0.00, steps=1.901e+08
2023-07-07 16:50:10,138 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11700, best=0.81, avg=0.80, std=0.00, steps=1.917e+08
2023-07-07 16:50:24,926 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11800, best=0.81, avg=0.80, std=0.00, steps=1.933e+08
2023-07-07 16:50:39,729 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11900, best=0.81, avg=0.80, std=0.00, steps=1.950e+08
2023-07-07 16:50:54,389 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11999, best=0.81, avg=0.80, std=0.00, steps=1.966e+08
2023-07-07 16:50:54,390 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 16:50:54,413 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:50:54,444 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:51:13,142 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.862e+06
2023-07-07 16:51:29,730 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 200, best=0.58, avg=0.57, std=0.00, steps=3.705e+06
2023-07-07 16:51:46,323 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 300, best=0.59, avg=0.59, std=0.00, steps=5.548e+06
2023-07-07 16:52:02,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 400, best=0.63, avg=0.62, std=0.00, steps=7.391e+06
2023-07-07 16:52:19,541 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 500, best=0.64, avg=0.63, std=0.00, steps=9.234e+06
2023-07-07 16:52:36,162 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 600, best=0.65, avg=0.64, std=0.00, steps=1.108e+07
2023-07-07 16:52:52,779 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 700, best=0.66, avg=0.65, std=0.00, steps=1.292e+07
2023-07-07 16:53:09,424 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 800, best=0.67, avg=0.66, std=0.00, steps=1.476e+07
2023-07-07 16:53:26,056 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 900, best=0.67, avg=0.66, std=0.00, steps=1.661e+07
2023-07-07 16:53:42,695 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1000, best=0.67, avg=0.66, std=0.00, steps=1.845e+07
2023-07-07 16:53:59,323 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1100, best=0.67, avg=0.66, std=0.00, steps=2.029e+07
2023-07-07 16:54:15,953 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1200, best=0.67, avg=0.67, std=0.00, steps=2.214e+07
2023-07-07 16:54:32,571 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1300, best=0.68, avg=0.67, std=0.00, steps=2.398e+07
2023-07-07 16:54:49,179 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1400, best=0.68, avg=0.67, std=0.00, steps=2.582e+07
2023-07-07 16:55:05,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1500, best=0.68, avg=0.67, std=0.00, steps=2.767e+07
2023-07-07 16:55:22,442 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1600, best=0.68, avg=0.67, std=0.00, steps=2.951e+07
2023-07-07 16:55:39,066 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1700, best=0.68, avg=0.67, std=0.00, steps=3.135e+07
2023-07-07 16:55:55,679 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1800, best=0.68, avg=0.68, std=0.00, steps=3.320e+07
2023-07-07 16:56:12,309 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1900, best=0.68, avg=0.68, std=0.00, steps=3.504e+07
2023-07-07 16:56:28,911 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2000, best=0.69, avg=0.68, std=0.00, steps=3.688e+07
2023-07-07 16:56:45,520 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2100, best=0.69, avg=0.68, std=0.00, steps=3.873e+07
2023-07-07 16:57:02,137 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2200, best=0.69, avg=0.68, std=0.00, steps=4.057e+07
2023-07-07 16:57:18,761 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2300, best=0.69, avg=0.68, std=0.00, steps=4.241e+07
2023-07-07 16:57:35,352 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2400, best=0.69, avg=0.68, std=0.00, steps=4.426e+07
2023-07-07 16:57:51,972 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2500, best=0.69, avg=0.68, std=0.00, steps=4.610e+07
2023-07-07 16:58:08,578 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2600, best=0.69, avg=0.68, std=0.00, steps=4.794e+07
2023-07-07 16:58:25,197 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2700, best=0.69, avg=0.68, std=0.00, steps=4.978e+07
2023-07-07 16:58:41,807 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2800, best=0.69, avg=0.69, std=0.00, steps=5.163e+07
2023-07-07 16:58:58,410 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2900, best=0.69, avg=0.68, std=0.00, steps=5.347e+07
2023-07-07 16:59:15,023 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3000, best=0.69, avg=0.69, std=0.00, steps=5.531e+07
2023-07-07 16:59:31,624 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3100, best=0.69, avg=0.69, std=0.00, steps=5.716e+07
2023-07-07 16:59:48,253 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3200, best=0.69, avg=0.69, std=0.00, steps=5.900e+07
2023-07-07 17:00:04,862 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3300, best=0.70, avg=0.69, std=0.00, steps=6.084e+07
2023-07-07 17:00:21,463 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3400, best=0.70, avg=0.69, std=0.00, steps=6.269e+07
2023-07-07 17:00:38,102 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3500, best=0.70, avg=0.69, std=0.00, steps=6.453e+07
2023-07-07 17:00:54,728 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3600, best=0.70, avg=0.69, std=0.00, steps=6.637e+07
2023-07-07 17:01:11,335 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3700, best=0.70, avg=0.69, std=0.00, steps=6.822e+07
2023-07-07 17:01:27,942 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3800, best=0.70, avg=0.69, std=0.00, steps=7.006e+07
2023-07-07 17:01:44,552 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3900, best=0.70, avg=0.69, std=0.00, steps=7.190e+07
2023-07-07 17:02:01,169 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4000, best=0.70, avg=0.69, std=0.00, steps=7.375e+07
2023-07-07 17:02:17,780 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4100, best=0.70, avg=0.70, std=0.00, steps=7.559e+07
2023-07-07 17:02:34,379 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4200, best=0.70, avg=0.70, std=0.00, steps=7.743e+07
2023-07-07 17:02:50,974 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4300, best=0.70, avg=0.70, std=0.00, steps=7.928e+07
2023-07-07 17:03:07,563 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4400, best=0.70, avg=0.70, std=0.00, steps=8.112e+07
2023-07-07 17:03:24,159 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4500, best=0.71, avg=0.70, std=0.00, steps=8.296e+07
2023-07-07 17:03:40,755 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4600, best=0.70, avg=0.70, std=0.00, steps=8.481e+07
2023-07-07 17:03:57,348 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4700, best=0.71, avg=0.70, std=0.00, steps=8.665e+07
2023-07-07 17:04:13,969 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4800, best=0.71, avg=0.70, std=0.00, steps=8.849e+07
2023-07-07 17:04:30,578 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4900, best=0.71, avg=0.70, std=0.00, steps=9.034e+07
2023-07-07 17:04:47,176 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5000, best=0.71, avg=0.70, std=0.00, steps=9.218e+07
2023-07-07 17:05:03,774 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5100, best=0.71, avg=0.70, std=0.00, steps=9.402e+07
2023-07-07 17:05:20,365 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5200, best=0.71, avg=0.70, std=0.00, steps=9.586e+07
2023-07-07 17:05:36,968 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5300, best=0.71, avg=0.70, std=0.00, steps=9.771e+07
2023-07-07 17:05:53,567 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5400, best=0.71, avg=0.70, std=0.00, steps=9.955e+07
2023-07-07 17:06:10,167 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5500, best=0.71, avg=0.70, std=0.00, steps=1.014e+08
2023-07-07 17:06:26,772 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5600, best=0.71, avg=0.70, std=0.00, steps=1.032e+08
2023-07-07 17:06:43,412 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5700, best=0.71, avg=0.71, std=0.00, steps=1.051e+08
2023-07-07 17:07:00,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5800, best=0.71, avg=0.71, std=0.00, steps=1.069e+08
2023-07-07 17:07:16,649 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5900, best=0.71, avg=0.71, std=0.00, steps=1.088e+08
2023-07-07 17:07:33,265 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6000, best=0.71, avg=0.71, std=0.00, steps=1.106e+08
2023-07-07 17:07:49,879 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6100, best=0.71, avg=0.71, std=0.00, steps=1.125e+08
2023-07-07 17:08:06,634 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6200, best=0.71, avg=0.71, std=0.00, steps=1.143e+08
2023-07-07 17:08:23,310 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6300, best=0.71, avg=0.71, std=0.00, steps=1.161e+08
2023-07-07 17:08:39,930 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6400, best=0.72, avg=0.71, std=0.00, steps=1.180e+08
2023-07-07 17:08:56,658 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6500, best=0.72, avg=0.71, std=0.00, steps=1.198e+08
2023-07-07 17:09:13,306 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6600, best=0.72, avg=0.71, std=0.00, steps=1.217e+08
2023-07-07 17:09:29,926 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6700, best=0.72, avg=0.71, std=0.00, steps=1.235e+08
2023-07-07 17:09:46,540 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6800, best=0.72, avg=0.71, std=0.00, steps=1.254e+08
2023-07-07 17:10:03,123 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6900, best=0.72, avg=0.71, std=0.00, steps=1.272e+08
2023-07-07 17:10:19,735 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7000, best=0.72, avg=0.71, std=0.00, steps=1.290e+08
2023-07-07 17:10:36,364 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7100, best=0.72, avg=0.71, std=0.00, steps=1.309e+08
2023-07-07 17:10:52,998 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7200, best=0.72, avg=0.71, std=0.00, steps=1.327e+08
2023-07-07 17:11:09,622 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7300, best=0.72, avg=0.71, std=0.00, steps=1.346e+08
2023-07-07 17:11:26,222 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7400, best=0.72, avg=0.71, std=0.00, steps=1.364e+08
2023-07-07 17:11:42,814 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7500, best=0.72, avg=0.71, std=0.00, steps=1.383e+08
2023-07-07 17:11:59,402 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7600, best=0.72, avg=0.71, std=0.00, steps=1.401e+08
2023-07-07 17:12:16,000 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7700, best=0.72, avg=0.71, std=0.00, steps=1.419e+08
2023-07-07 17:12:32,601 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7800, best=0.72, avg=0.71, std=0.00, steps=1.438e+08
2023-07-07 17:12:49,191 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7900, best=0.72, avg=0.71, std=0.00, steps=1.456e+08
2023-07-07 17:13:05,795 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8000, best=0.72, avg=0.72, std=0.00, steps=1.475e+08
2023-07-07 17:13:22,390 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8100, best=0.72, avg=0.72, std=0.00, steps=1.493e+08
2023-07-07 17:13:38,984 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8200, best=0.72, avg=0.72, std=0.00, steps=1.512e+08
2023-07-07 17:13:55,566 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8300, best=0.72, avg=0.72, std=0.00, steps=1.530e+08
2023-07-07 17:14:12,182 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8400, best=0.72, avg=0.72, std=0.00, steps=1.548e+08
2023-07-07 17:14:28,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8500, best=0.72, avg=0.72, std=0.00, steps=1.567e+08
2023-07-07 17:14:45,567 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8600, best=0.72, avg=0.72, std=0.00, steps=1.585e+08
2023-07-07 17:15:02,167 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8700, best=0.73, avg=0.72, std=0.00, steps=1.604e+08
2023-07-07 17:15:18,765 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8800, best=0.72, avg=0.72, std=0.00, steps=1.622e+08
2023-07-07 17:15:35,380 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8900, best=0.73, avg=0.72, std=0.00, steps=1.641e+08
2023-07-07 17:15:51,995 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9000, best=0.73, avg=0.72, std=0.00, steps=1.659e+08
2023-07-07 17:16:08,594 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9100, best=0.73, avg=0.72, std=0.00, steps=1.677e+08
2023-07-07 17:16:25,201 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9200, best=0.73, avg=0.72, std=0.00, steps=1.696e+08
2023-07-07 17:16:41,903 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9300, best=0.73, avg=0.72, std=0.00, steps=1.714e+08
2023-07-07 17:16:58,537 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9400, best=0.73, avg=0.72, std=0.00, steps=1.733e+08
2023-07-07 17:17:15,167 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9500, best=0.73, avg=0.72, std=0.00, steps=1.751e+08
2023-07-07 17:17:31,813 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9600, best=0.73, avg=0.72, std=0.00, steps=1.770e+08
2023-07-07 17:17:48,444 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9700, best=0.73, avg=0.72, std=0.00, steps=1.788e+08
2023-07-07 17:18:05,052 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9800, best=0.73, avg=0.72, std=0.00, steps=1.807e+08
2023-07-07 17:18:21,670 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9900, best=0.73, avg=0.72, std=0.00, steps=1.825e+08
2023-07-07 17:18:38,293 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10000, best=0.73, avg=0.72, std=0.00, steps=1.843e+08
2023-07-07 17:18:54,943 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10100, best=0.73, avg=0.72, std=0.00, steps=1.862e+08
2023-07-07 17:19:11,591 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10200, best=0.73, avg=0.72, std=0.00, steps=1.880e+08
2023-07-07 17:19:28,278 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10300, best=0.73, avg=0.72, std=0.00, steps=1.899e+08
2023-07-07 17:19:45,029 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10400, best=0.73, avg=0.72, std=0.00, steps=1.917e+08
2023-07-07 17:20:01,635 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10500, best=0.73, avg=0.73, std=0.00, steps=1.936e+08
2023-07-07 17:20:18,264 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10600, best=0.73, avg=0.73, std=0.00, steps=1.954e+08
2023-07-07 17:20:34,961 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10700, best=0.73, avg=0.73, std=0.00, steps=1.972e+08
2023-07-07 17:20:51,599 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10800, best=0.74, avg=0.73, std=0.00, steps=1.991e+08
2023-07-07 17:21:08,213 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10900, best=0.73, avg=0.73, std=0.00, steps=2.009e+08
2023-07-07 17:21:24,809 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11000, best=0.73, avg=0.73, std=0.00, steps=2.028e+08
2023-07-07 17:21:41,423 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11100, best=0.73, avg=0.73, std=0.00, steps=2.046e+08
2023-07-07 17:21:58,199 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11200, best=0.73, avg=0.73, std=0.00, steps=2.065e+08
2023-07-07 17:22:14,810 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11300, best=0.74, avg=0.73, std=0.00, steps=2.083e+08
2023-07-07 17:22:31,406 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11400, best=0.73, avg=0.73, std=0.00, steps=2.101e+08
2023-07-07 17:22:48,022 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11500, best=0.73, avg=0.73, std=0.00, steps=2.120e+08
2023-07-07 17:23:04,632 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11600, best=0.73, avg=0.73, std=0.00, steps=2.138e+08
2023-07-07 17:23:21,243 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11700, best=0.74, avg=0.73, std=0.00, steps=2.157e+08
2023-07-07 17:23:37,959 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11800, best=0.74, avg=0.73, std=0.00, steps=2.175e+08
2023-07-07 17:23:54,612 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11900, best=0.74, avg=0.73, std=0.00, steps=2.194e+08
2023-07-07 17:24:11,080 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11999, best=0.74, avg=0.73, std=0.00, steps=2.212e+08
2023-07-07 17:24:11,081 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 17:24:11,106 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 17:24:11,138 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 17:24:31,722 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.068e+06
2023-07-07 17:24:50,257 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=4.116e+06
2023-07-07 17:25:08,686 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 300, best=0.53, avg=0.52, std=0.00, steps=6.164e+06
2023-07-07 17:25:27,119 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 400, best=0.58, avg=0.58, std=0.00, steps=8.212e+06
2023-07-07 17:25:45,549 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 500, best=0.61, avg=0.60, std=0.00, steps=1.026e+07
2023-07-07 17:26:04,000 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 600, best=0.61, avg=0.61, std=0.00, steps=1.231e+07
2023-07-07 17:26:22,412 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 700, best=0.62, avg=0.62, std=0.00, steps=1.436e+07
2023-07-07 17:26:40,832 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 800, best=0.63, avg=0.62, std=0.00, steps=1.640e+07
2023-07-07 17:26:59,263 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 900, best=0.64, avg=0.63, std=0.00, steps=1.845e+07
2023-07-07 17:27:17,701 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1000, best=0.64, avg=0.63, std=0.00, steps=2.050e+07
2023-07-07 17:27:36,131 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1100, best=0.64, avg=0.64, std=0.00, steps=2.255e+07
2023-07-07 17:27:54,547 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1200, best=0.65, avg=0.64, std=0.00, steps=2.460e+07
2023-07-07 17:28:12,995 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1300, best=0.66, avg=0.65, std=0.00, steps=2.664e+07
2023-07-07 17:28:31,411 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1400, best=0.66, avg=0.65, std=0.00, steps=2.869e+07
2023-07-07 17:28:49,813 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1500, best=0.66, avg=0.65, std=0.00, steps=3.074e+07
2023-07-07 17:29:08,209 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1600, best=0.66, avg=0.66, std=0.00, steps=3.279e+07
2023-07-07 17:29:26,617 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1700, best=0.66, avg=0.66, std=0.00, steps=3.484e+07
2023-07-07 17:29:45,038 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1800, best=0.67, avg=0.66, std=0.00, steps=3.688e+07
2023-07-07 17:30:03,449 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1900, best=0.67, avg=0.66, std=0.00, steps=3.893e+07
2023-07-07 17:30:21,869 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2000, best=0.67, avg=0.67, std=0.00, steps=4.098e+07
2023-07-07 17:30:40,294 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2100, best=0.67, avg=0.67, std=0.00, steps=4.303e+07
2023-07-07 17:30:58,699 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2200, best=0.68, avg=0.67, std=0.00, steps=4.508e+07
2023-07-07 17:31:17,128 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2300, best=0.68, avg=0.67, std=0.00, steps=4.712e+07
2023-07-07 17:31:35,532 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2400, best=0.68, avg=0.67, std=0.00, steps=4.917e+07
2023-07-07 17:31:53,938 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2500, best=0.68, avg=0.67, std=0.00, steps=5.122e+07
2023-07-07 17:32:12,351 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2600, best=0.68, avg=0.68, std=0.00, steps=5.327e+07
2023-07-07 17:32:30,761 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2700, best=0.68, avg=0.68, std=0.00, steps=5.532e+07
2023-07-07 17:32:49,170 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2800, best=0.69, avg=0.68, std=0.00, steps=5.736e+07
2023-07-07 17:33:07,611 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2900, best=0.69, avg=0.68, std=0.00, steps=5.941e+07
2023-07-07 17:33:26,009 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3000, best=0.69, avg=0.68, std=0.00, steps=6.146e+07
2023-07-07 17:33:44,418 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3100, best=0.69, avg=0.68, std=0.00, steps=6.351e+07
2023-07-07 17:34:02,832 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3200, best=0.69, avg=0.68, std=0.00, steps=6.556e+07
2023-07-07 17:34:21,243 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3300, best=0.69, avg=0.68, std=0.00, steps=6.760e+07
2023-07-07 17:34:39,651 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3400, best=0.69, avg=0.69, std=0.00, steps=6.965e+07
2023-07-07 17:34:58,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3500, best=0.69, avg=0.69, std=0.00, steps=7.170e+07
2023-07-07 17:35:16,494 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3600, best=0.70, avg=0.69, std=0.00, steps=7.375e+07
2023-07-07 17:35:34,917 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3700, best=0.70, avg=0.69, std=0.00, steps=7.580e+07
2023-07-07 17:35:53,350 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3800, best=0.70, avg=0.69, std=0.00, steps=7.784e+07
2023-07-07 17:36:11,803 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3900, best=0.70, avg=0.69, std=0.00, steps=7.989e+07
2023-07-07 17:36:30,248 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4000, best=0.70, avg=0.69, std=0.00, steps=8.194e+07
2023-07-07 17:36:48,654 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4100, best=0.70, avg=0.69, std=0.00, steps=8.399e+07
2023-07-07 17:37:07,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4200, best=0.70, avg=0.69, std=0.00, steps=8.604e+07
2023-07-07 17:37:25,493 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4300, best=0.70, avg=0.69, std=0.00, steps=8.808e+07
2023-07-07 17:37:43,907 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4400, best=0.70, avg=0.69, std=0.00, steps=9.013e+07
2023-07-07 17:38:02,321 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4500, best=0.70, avg=0.69, std=0.00, steps=9.218e+07
2023-07-07 17:38:20,713 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4600, best=0.70, avg=0.70, std=0.00, steps=9.423e+07
2023-07-07 17:38:39,134 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4700, best=0.70, avg=0.70, std=0.00, steps=9.628e+07
2023-07-07 17:38:57,565 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4800, best=0.70, avg=0.70, std=0.00, steps=9.832e+07
2023-07-07 17:39:15,971 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4900, best=0.70, avg=0.70, std=0.00, steps=1.004e+08
2023-07-07 17:39:34,388 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5000, best=0.70, avg=0.70, std=0.00, steps=1.024e+08
2023-07-07 17:39:52,820 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5100, best=0.71, avg=0.70, std=0.00, steps=1.045e+08
2023-07-07 17:40:11,243 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5200, best=0.71, avg=0.70, std=0.00, steps=1.065e+08
2023-07-07 17:40:29,682 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5300, best=0.70, avg=0.70, std=0.00, steps=1.086e+08
2023-07-07 17:40:48,126 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5400, best=0.71, avg=0.70, std=0.00, steps=1.106e+08
2023-07-07 17:41:06,543 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5500, best=0.71, avg=0.70, std=0.00, steps=1.127e+08
2023-07-07 17:41:24,957 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5600, best=0.71, avg=0.70, std=0.00, steps=1.147e+08
2023-07-07 17:41:43,355 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5700, best=0.71, avg=0.70, std=0.00, steps=1.168e+08
2023-07-07 17:42:01,760 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5800, best=0.71, avg=0.70, std=0.00, steps=1.188e+08
2023-07-07 17:42:20,164 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5900, best=0.71, avg=0.70, std=0.00, steps=1.209e+08
2023-07-07 17:42:38,596 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6000, best=0.71, avg=0.70, std=0.00, steps=1.229e+08
2023-07-07 17:42:57,010 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6100, best=0.71, avg=0.70, std=0.00, steps=1.249e+08
2023-07-07 17:43:15,409 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6200, best=0.71, avg=0.70, std=0.00, steps=1.270e+08
2023-07-07 17:43:33,815 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6300, best=0.72, avg=0.70, std=0.00, steps=1.290e+08
2023-07-07 17:43:52,232 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6400, best=0.71, avg=0.70, std=0.00, steps=1.311e+08
2023-07-07 17:44:10,662 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6500, best=0.71, avg=0.71, std=0.00, steps=1.331e+08
2023-07-07 17:44:29,095 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6600, best=0.71, avg=0.70, std=0.00, steps=1.352e+08
2023-07-07 17:44:47,515 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6700, best=0.71, avg=0.71, std=0.00, steps=1.372e+08
2023-07-07 17:45:05,935 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6800, best=0.71, avg=0.71, std=0.00, steps=1.393e+08
2023-07-07 17:45:24,362 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6900, best=0.71, avg=0.71, std=0.00, steps=1.413e+08
2023-07-07 17:45:42,783 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7000, best=0.71, avg=0.71, std=0.00, steps=1.434e+08
2023-07-07 17:46:01,210 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7100, best=0.71, avg=0.71, std=0.00, steps=1.454e+08
2023-07-07 17:46:19,637 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7200, best=0.71, avg=0.71, std=0.00, steps=1.475e+08
2023-07-07 17:46:38,041 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7300, best=0.71, avg=0.71, std=0.00, steps=1.495e+08
2023-07-07 17:46:56,458 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7400, best=0.72, avg=0.71, std=0.00, steps=1.516e+08
2023-07-07 17:47:14,888 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7500, best=0.72, avg=0.71, std=0.00, steps=1.536e+08
2023-07-07 17:47:33,299 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7600, best=0.72, avg=0.71, std=0.00, steps=1.557e+08
2023-07-07 17:47:51,727 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7700, best=0.72, avg=0.71, std=0.00, steps=1.577e+08
2023-07-07 17:48:10,142 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7800, best=0.72, avg=0.71, std=0.00, steps=1.598e+08
2023-07-07 17:48:28,569 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7900, best=0.72, avg=0.71, std=0.00, steps=1.618e+08
2023-07-07 17:48:46,986 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8000, best=0.72, avg=0.71, std=0.00, steps=1.639e+08
2023-07-07 17:49:05,395 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8100, best=0.72, avg=0.71, std=0.00, steps=1.659e+08
2023-07-07 17:49:23,823 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8200, best=0.72, avg=0.71, std=0.00, steps=1.680e+08
2023-07-07 17:49:42,228 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8300, best=0.72, avg=0.71, std=0.00, steps=1.700e+08
2023-07-07 17:50:00,649 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8400, best=0.72, avg=0.71, std=0.00, steps=1.721e+08
2023-07-07 17:50:19,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8500, best=0.72, avg=0.71, std=0.00, steps=1.741e+08
2023-07-07 17:50:37,509 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8600, best=0.72, avg=0.71, std=0.00, steps=1.761e+08
2023-07-07 17:50:55,928 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8700, best=0.72, avg=0.71, std=0.00, steps=1.782e+08
2023-07-07 17:51:14,337 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8800, best=0.72, avg=0.71, std=0.00, steps=1.802e+08
2023-07-07 17:51:32,772 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8900, best=0.72, avg=0.71, std=0.00, steps=1.823e+08
2023-07-07 17:51:51,171 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9000, best=0.72, avg=0.71, std=0.00, steps=1.843e+08
2023-07-07 17:52:09,578 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9100, best=0.72, avg=0.71, std=0.00, steps=1.864e+08
2023-07-07 17:52:28,008 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9200, best=0.72, avg=0.71, std=0.00, steps=1.884e+08
2023-07-07 17:52:46,402 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9300, best=0.72, avg=0.71, std=0.00, steps=1.905e+08
2023-07-07 17:53:04,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9400, best=0.72, avg=0.71, std=0.00, steps=1.925e+08
2023-07-07 17:53:23,222 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9500, best=0.72, avg=0.71, std=0.00, steps=1.946e+08
2023-07-07 17:53:41,635 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9600, best=0.72, avg=0.71, std=0.00, steps=1.966e+08
2023-07-07 17:54:00,051 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9700, best=0.72, avg=0.71, std=0.00, steps=1.987e+08
2023-07-07 17:54:18,459 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9800, best=0.72, avg=0.71, std=0.00, steps=2.007e+08
2023-07-07 17:54:36,853 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9900, best=0.72, avg=0.71, std=0.00, steps=2.028e+08
2023-07-07 17:54:55,260 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10000, best=0.72, avg=0.71, std=0.00, steps=2.048e+08
2023-07-07 17:55:13,681 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10100, best=0.72, avg=0.72, std=0.00, steps=2.069e+08
2023-07-07 17:55:32,100 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10200, best=0.72, avg=0.72, std=0.00, steps=2.089e+08
2023-07-07 17:55:50,520 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10300, best=0.72, avg=0.72, std=0.00, steps=2.110e+08
2023-07-07 17:56:08,924 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10400, best=0.72, avg=0.72, std=0.00, steps=2.130e+08
2023-07-07 17:56:27,318 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10500, best=0.72, avg=0.72, std=0.00, steps=2.151e+08
2023-07-07 17:56:45,719 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10600, best=0.72, avg=0.72, std=0.00, steps=2.171e+08
2023-07-07 17:57:04,130 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10700, best=0.72, avg=0.72, std=0.00, steps=2.192e+08
2023-07-07 17:57:22,568 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10800, best=0.73, avg=0.72, std=0.00, steps=2.212e+08
2023-07-07 17:57:40,966 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10900, best=0.72, avg=0.72, std=0.00, steps=2.233e+08
2023-07-07 17:57:59,383 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11000, best=0.73, avg=0.72, std=0.00, steps=2.253e+08
2023-07-07 17:58:17,807 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11100, best=0.72, avg=0.72, std=0.00, steps=2.273e+08
2023-07-07 17:58:36,206 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11200, best=0.73, avg=0.72, std=0.00, steps=2.294e+08
2023-07-07 17:58:54,634 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11300, best=0.73, avg=0.72, std=0.00, steps=2.314e+08
2023-07-07 17:59:13,032 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11400, best=0.73, avg=0.72, std=0.00, steps=2.335e+08
2023-07-07 17:59:31,451 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11500, best=0.73, avg=0.72, std=0.00, steps=2.355e+08
2023-07-07 17:59:49,865 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11600, best=0.72, avg=0.72, std=0.00, steps=2.376e+08
2023-07-07 18:00:08,280 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11700, best=0.73, avg=0.72, std=0.00, steps=2.396e+08
2023-07-07 18:00:26,681 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11800, best=0.73, avg=0.72, std=0.00, steps=2.417e+08
2023-07-07 18:00:45,091 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11900, best=0.73, avg=0.72, std=0.00, steps=2.437e+08
2023-07-07 18:01:03,330 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11999, best=0.73, avg=0.72, std=0.00, steps=2.458e+08
2023-07-07 18:01:03,330 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 18:01:03,354 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:01:03,385 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:01:27,607 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.482e+06
2023-07-07 18:01:49,655 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 200, best=0.53, avg=0.52, std=0.00, steps=4.940e+06
2023-07-07 18:02:11,708 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 300, best=0.58, avg=0.57, std=0.00, steps=7.397e+06
2023-07-07 18:02:33,772 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 400, best=0.58, avg=0.57, std=0.00, steps=9.855e+06
2023-07-07 18:02:55,808 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 500, best=0.58, avg=0.57, std=0.00, steps=1.231e+07
2023-07-07 18:03:17,849 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 600, best=0.58, avg=0.57, std=0.00, steps=1.477e+07
2023-07-07 18:03:39,924 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 700, best=0.58, avg=0.57, std=0.00, steps=1.723e+07
2023-07-07 18:04:01,980 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 800, best=0.58, avg=0.57, std=0.00, steps=1.969e+07
2023-07-07 18:04:24,029 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 900, best=0.58, avg=0.57, std=0.00, steps=2.214e+07
2023-07-07 18:04:46,103 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1000, best=0.59, avg=0.58, std=0.00, steps=2.460e+07
2023-07-07 18:05:08,149 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1100, best=0.60, avg=0.59, std=0.00, steps=2.706e+07
2023-07-07 18:05:30,194 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1200, best=0.60, avg=0.59, std=0.00, steps=2.952e+07
2023-07-07 18:05:52,270 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1300, best=0.61, avg=0.60, std=0.00, steps=3.197e+07
2023-07-07 18:06:14,339 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1400, best=0.61, avg=0.60, std=0.00, steps=3.443e+07
2023-07-07 18:06:36,421 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1500, best=0.61, avg=0.60, std=0.00, steps=3.689e+07
2023-07-07 18:06:58,484 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1600, best=0.61, avg=0.60, std=0.00, steps=3.935e+07
2023-07-07 18:07:20,567 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1700, best=0.61, avg=0.61, std=0.00, steps=4.180e+07
2023-07-07 18:07:42,675 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1800, best=0.61, avg=0.61, std=0.00, steps=4.426e+07
2023-07-07 18:08:04,765 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1900, best=0.62, avg=0.62, std=0.00, steps=4.672e+07
2023-07-07 18:08:26,815 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2000, best=0.63, avg=0.63, std=0.00, steps=4.918e+07
2023-07-07 18:08:48,875 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2100, best=0.64, avg=0.63, std=0.00, steps=5.163e+07
2023-07-07 18:09:10,932 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2200, best=0.64, avg=0.63, std=0.00, steps=5.409e+07
2023-07-07 18:09:33,000 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2300, best=0.64, avg=0.63, std=0.00, steps=5.655e+07
2023-07-07 18:09:55,044 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2400, best=0.64, avg=0.64, std=0.00, steps=5.901e+07
2023-07-07 18:10:17,101 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2500, best=0.65, avg=0.64, std=0.00, steps=6.146e+07
2023-07-07 18:10:39,154 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2600, best=0.65, avg=0.64, std=0.00, steps=6.392e+07
2023-07-07 18:11:01,241 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2700, best=0.65, avg=0.64, std=0.00, steps=6.638e+07
2023-07-07 18:11:23,286 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2800, best=0.65, avg=0.65, std=0.00, steps=6.884e+07
2023-07-07 18:11:45,316 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2900, best=0.66, avg=0.65, std=0.00, steps=7.129e+07
2023-07-07 18:12:07,349 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3000, best=0.65, avg=0.65, std=0.00, steps=7.375e+07
2023-07-07 18:12:29,407 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3100, best=0.66, avg=0.65, std=0.00, steps=7.621e+07
2023-07-07 18:12:51,446 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3200, best=0.66, avg=0.65, std=0.00, steps=7.867e+07
2023-07-07 18:13:13,495 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3300, best=0.66, avg=0.65, std=0.00, steps=8.113e+07
2023-07-07 18:13:35,556 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3400, best=0.66, avg=0.65, std=0.00, steps=8.358e+07
2023-07-07 18:13:57,651 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3500, best=0.66, avg=0.65, std=0.00, steps=8.604e+07
2023-07-07 18:14:19,714 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3600, best=0.66, avg=0.65, std=0.00, steps=8.850e+07
2023-07-07 18:14:41,760 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3700, best=0.66, avg=0.65, std=0.00, steps=9.096e+07
2023-07-07 18:15:03,801 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3800, best=0.66, avg=0.66, std=0.00, steps=9.341e+07
2023-07-07 18:15:25,843 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3900, best=0.66, avg=0.66, std=0.00, steps=9.587e+07
2023-07-07 18:15:47,884 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4000, best=0.66, avg=0.66, std=0.00, steps=9.833e+07
2023-07-07 18:16:09,938 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4100, best=0.66, avg=0.66, std=0.00, steps=1.008e+08
2023-07-07 18:16:31,986 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4200, best=0.66, avg=0.66, std=0.00, steps=1.032e+08
2023-07-07 18:16:54,037 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4300, best=0.67, avg=0.66, std=0.00, steps=1.057e+08
2023-07-07 18:17:16,103 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4400, best=0.67, avg=0.66, std=0.00, steps=1.082e+08
2023-07-07 18:17:38,165 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4500, best=0.67, avg=0.66, std=0.00, steps=1.106e+08
2023-07-07 18:18:00,210 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4600, best=0.67, avg=0.66, std=0.00, steps=1.131e+08
2023-07-07 18:18:22,259 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4700, best=0.67, avg=0.66, std=0.00, steps=1.155e+08
2023-07-07 18:18:44,369 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4800, best=0.67, avg=0.66, std=0.00, steps=1.180e+08
2023-07-07 18:19:06,413 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4900, best=0.67, avg=0.66, std=0.00, steps=1.204e+08
2023-07-07 18:19:28,474 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5000, best=0.67, avg=0.66, std=0.00, steps=1.229e+08
2023-07-07 18:19:50,525 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5100, best=0.67, avg=0.66, std=0.00, steps=1.254e+08
2023-07-07 18:20:12,565 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5200, best=0.67, avg=0.66, std=0.00, steps=1.278e+08
2023-07-07 18:20:34,609 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5300, best=0.67, avg=0.67, std=0.00, steps=1.303e+08
2023-07-07 18:20:56,664 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5400, best=0.67, avg=0.67, std=0.00, steps=1.327e+08
2023-07-07 18:21:18,719 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5500, best=0.67, avg=0.67, std=0.00, steps=1.352e+08
2023-07-07 18:21:40,780 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5600, best=0.67, avg=0.67, std=0.00, steps=1.377e+08
2023-07-07 18:22:02,846 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5700, best=0.67, avg=0.67, std=0.00, steps=1.401e+08
2023-07-07 18:22:24,925 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5800, best=0.67, avg=0.67, std=0.00, steps=1.426e+08
2023-07-07 18:22:46,984 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5900, best=0.68, avg=0.67, std=0.00, steps=1.450e+08
2023-07-07 18:23:09,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6000, best=0.68, avg=0.67, std=0.00, steps=1.475e+08
2023-07-07 18:23:31,100 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6100, best=0.68, avg=0.67, std=0.00, steps=1.499e+08
2023-07-07 18:23:53,194 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6200, best=0.68, avg=0.67, std=0.00, steps=1.524e+08
2023-07-07 18:24:15,240 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6300, best=0.68, avg=0.67, std=0.00, steps=1.549e+08
2023-07-07 18:24:37,263 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6400, best=0.68, avg=0.67, std=0.00, steps=1.573e+08
2023-07-07 18:24:59,300 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6500, best=0.68, avg=0.67, std=0.00, steps=1.598e+08
2023-07-07 18:25:21,352 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6600, best=0.68, avg=0.67, std=0.00, steps=1.622e+08
2023-07-07 18:25:43,406 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6700, best=0.68, avg=0.67, std=0.00, steps=1.647e+08
2023-07-07 18:26:05,477 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6800, best=0.68, avg=0.67, std=0.00, steps=1.671e+08
2023-07-07 18:26:27,534 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6900, best=0.68, avg=0.67, std=0.00, steps=1.696e+08
2023-07-07 18:26:49,594 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7000, best=0.68, avg=0.67, std=0.00, steps=1.721e+08
2023-07-07 18:27:11,681 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7100, best=0.68, avg=0.67, std=0.00, steps=1.745e+08
2023-07-07 18:27:33,758 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7200, best=0.68, avg=0.67, std=0.00, steps=1.770e+08
2023-07-07 18:27:55,864 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7300, best=0.68, avg=0.67, std=0.00, steps=1.794e+08
2023-07-07 18:28:17,921 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7400, best=0.68, avg=0.67, std=0.00, steps=1.819e+08
2023-07-07 18:28:39,988 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7500, best=0.68, avg=0.68, std=0.00, steps=1.843e+08
2023-07-07 18:29:02,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7600, best=0.68, avg=0.68, std=0.00, steps=1.868e+08
2023-07-07 18:29:24,083 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7700, best=0.68, avg=0.68, std=0.00, steps=1.893e+08
2023-07-07 18:29:46,122 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7800, best=0.68, avg=0.68, std=0.00, steps=1.917e+08
2023-07-07 18:30:08,167 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7900, best=0.68, avg=0.68, std=0.00, steps=1.942e+08
2023-07-07 18:30:30,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8000, best=0.68, avg=0.68, std=0.00, steps=1.966e+08
2023-07-07 18:30:52,262 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8100, best=0.68, avg=0.68, std=0.00, steps=1.991e+08
2023-07-07 18:31:14,296 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8200, best=0.68, avg=0.68, std=0.00, steps=2.015e+08
2023-07-07 18:31:36,345 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8300, best=0.69, avg=0.68, std=0.00, steps=2.040e+08
2023-07-07 18:31:58,405 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8400, best=0.69, avg=0.68, std=0.00, steps=2.065e+08
2023-07-07 18:32:20,470 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8500, best=0.69, avg=0.68, std=0.00, steps=2.089e+08
2023-07-07 18:32:42,522 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8600, best=0.69, avg=0.68, std=0.00, steps=2.114e+08
2023-07-07 18:33:04,593 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8700, best=0.69, avg=0.68, std=0.00, steps=2.138e+08
2023-07-07 18:33:26,637 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8800, best=0.69, avg=0.68, std=0.00, steps=2.163e+08
2023-07-07 18:33:48,683 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8900, best=0.69, avg=0.68, std=0.00, steps=2.188e+08
2023-07-07 18:34:10,729 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9000, best=0.69, avg=0.68, std=0.00, steps=2.212e+08
2023-07-07 18:34:32,770 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9100, best=0.69, avg=0.68, std=0.00, steps=2.237e+08
2023-07-07 18:34:54,818 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9200, best=0.69, avg=0.68, std=0.00, steps=2.261e+08
2023-07-07 18:35:16,854 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9300, best=0.69, avg=0.68, std=0.00, steps=2.286e+08
2023-07-07 18:35:38,902 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9400, best=0.69, avg=0.68, std=0.00, steps=2.310e+08
2023-07-07 18:36:00,941 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9500, best=0.69, avg=0.68, std=0.00, steps=2.335e+08
2023-07-07 18:36:23,000 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9600, best=0.69, avg=0.68, std=0.00, steps=2.360e+08
2023-07-07 18:36:45,045 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9700, best=0.69, avg=0.68, std=0.00, steps=2.384e+08
2023-07-07 18:37:07,083 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9800, best=0.69, avg=0.68, std=0.00, steps=2.409e+08
2023-07-07 18:37:29,143 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9900, best=0.69, avg=0.68, std=0.00, steps=2.433e+08
2023-07-07 18:37:51,185 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10000, best=0.69, avg=0.68, std=0.00, steps=2.458e+08
2023-07-07 18:38:13,240 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10100, best=0.69, avg=0.68, std=0.00, steps=2.482e+08
2023-07-07 18:38:35,275 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10200, best=0.69, avg=0.68, std=0.00, steps=2.507e+08
2023-07-07 18:38:57,327 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10300, best=0.69, avg=0.68, std=0.00, steps=2.532e+08
2023-07-07 18:39:19,390 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10400, best=0.69, avg=0.69, std=0.00, steps=2.556e+08
2023-07-07 18:39:41,481 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10500, best=0.70, avg=0.69, std=0.00, steps=2.581e+08
2023-07-07 18:40:03,515 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10600, best=0.69, avg=0.69, std=0.00, steps=2.605e+08
2023-07-07 18:40:25,546 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10700, best=0.69, avg=0.69, std=0.00, steps=2.630e+08
2023-07-07 18:40:47,585 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10800, best=0.69, avg=0.69, std=0.00, steps=2.654e+08
2023-07-07 18:41:09,625 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10900, best=0.69, avg=0.69, std=0.00, steps=2.679e+08
2023-07-07 18:41:31,690 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11000, best=0.69, avg=0.69, std=0.00, steps=2.704e+08
2023-07-07 18:41:53,734 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11100, best=0.70, avg=0.69, std=0.00, steps=2.728e+08
2023-07-07 18:42:15,792 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11200, best=0.70, avg=0.69, std=0.00, steps=2.753e+08
2023-07-07 18:42:37,855 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11300, best=0.70, avg=0.69, std=0.00, steps=2.777e+08
2023-07-07 18:42:59,922 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11400, best=0.69, avg=0.69, std=0.00, steps=2.802e+08
2023-07-07 18:43:21,976 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11500, best=0.70, avg=0.69, std=0.00, steps=2.826e+08
2023-07-07 18:43:44,019 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11600, best=0.70, avg=0.69, std=0.00, steps=2.851e+08
2023-07-07 18:44:06,056 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11700, best=0.70, avg=0.69, std=0.00, steps=2.876e+08
2023-07-07 18:44:28,122 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11800, best=0.70, avg=0.69, std=0.00, steps=2.900e+08
2023-07-07 18:44:50,173 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11900, best=0.70, avg=0.69, std=0.00, steps=2.925e+08
2023-07-07 18:45:11,991 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11999, best=0.70, avg=0.69, std=0.00, steps=2.949e+08
2023-07-07 18:45:11,992 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 18:45:12,015 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:45:12,046 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:45:43,888 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 100, best=0.50, avg=0.50, std=0.00, steps=3.310e+06
2023-07-07 18:46:13,198 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=6.586e+06
2023-07-07 18:46:42,520 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=9.863e+06
2023-07-07 18:47:11,823 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 400, best=0.56, avg=0.56, std=0.00, steps=1.314e+07
2023-07-07 18:47:41,109 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 500, best=0.58, avg=0.57, std=0.00, steps=1.642e+07
2023-07-07 18:48:10,396 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 600, best=0.58, avg=0.58, std=0.00, steps=1.969e+07
2023-07-07 18:48:39,667 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 700, best=0.59, avg=0.59, std=0.00, steps=2.297e+07
2023-07-07 18:49:08,934 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 800, best=0.60, avg=0.59, std=0.00, steps=2.625e+07
2023-07-07 18:49:38,224 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 900, best=0.61, avg=0.60, std=0.00, steps=2.952e+07
2023-07-07 18:50:07,537 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1000, best=0.61, avg=0.60, std=0.00, steps=3.280e+07
2023-07-07 18:50:36,870 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1100, best=0.61, avg=0.61, std=0.00, steps=3.608e+07
2023-07-07 18:51:06,182 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1200, best=0.61, avg=0.61, std=0.00, steps=3.935e+07
2023-07-07 18:51:35,468 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1300, best=0.62, avg=0.61, std=0.00, steps=4.263e+07
2023-07-07 18:52:04,760 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1400, best=0.62, avg=0.61, std=0.00, steps=4.591e+07
2023-07-07 18:52:34,045 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1500, best=0.62, avg=0.62, std=0.00, steps=4.918e+07
2023-07-07 18:53:03,330 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1600, best=0.62, avg=0.62, std=0.00, steps=5.246e+07
2023-07-07 18:53:32,644 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1700, best=0.62, avg=0.62, std=0.00, steps=5.574e+07
2023-07-07 18:54:01,961 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1800, best=0.63, avg=0.62, std=0.00, steps=5.902e+07
2023-07-07 18:54:31,329 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1900, best=0.63, avg=0.62, std=0.00, steps=6.229e+07
2023-07-07 18:55:00,637 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2000, best=0.63, avg=0.62, std=0.00, steps=6.557e+07
2023-07-07 18:55:29,944 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2100, best=0.63, avg=0.63, std=0.00, steps=6.885e+07
2023-07-07 18:55:59,252 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2200, best=0.63, avg=0.63, std=0.00, steps=7.212e+07
2023-07-07 18:56:28,557 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2300, best=0.63, avg=0.63, std=0.00, steps=7.540e+07
2023-07-07 18:56:57,855 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2400, best=0.63, avg=0.63, std=0.00, steps=7.868e+07
2023-07-07 18:57:27,161 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2500, best=0.63, avg=0.63, std=0.00, steps=8.195e+07
2023-07-07 18:57:56,489 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2600, best=0.64, avg=0.63, std=0.00, steps=8.523e+07
2023-07-07 18:58:25,820 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2700, best=0.64, avg=0.63, std=0.00, steps=8.851e+07
2023-07-07 18:58:55,116 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2800, best=0.64, avg=0.63, std=0.00, steps=9.178e+07
2023-07-07 18:59:24,452 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2900, best=0.64, avg=0.63, std=0.00, steps=9.506e+07
2023-07-07 18:59:53,749 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3000, best=0.64, avg=0.63, std=0.00, steps=9.834e+07
2023-07-07 19:00:23,051 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3100, best=0.64, avg=0.63, std=0.00, steps=1.016e+08
2023-07-07 19:00:52,351 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3200, best=0.64, avg=0.64, std=0.00, steps=1.049e+08
2023-07-07 19:01:21,684 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3300, best=0.64, avg=0.64, std=0.00, steps=1.082e+08
2023-07-07 19:01:50,988 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3400, best=0.64, avg=0.64, std=0.00, steps=1.114e+08
2023-07-07 19:02:20,321 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3500, best=0.64, avg=0.64, std=0.00, steps=1.147e+08
2023-07-07 19:02:49,641 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3600, best=0.64, avg=0.64, std=0.00, steps=1.180e+08
2023-07-07 19:03:18,934 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3700, best=0.64, avg=0.64, std=0.00, steps=1.213e+08
2023-07-07 19:03:48,220 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3800, best=0.64, avg=0.64, std=0.00, steps=1.246e+08
2023-07-07 19:04:17,545 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3900, best=0.65, avg=0.64, std=0.00, steps=1.278e+08
2023-07-07 19:04:46,844 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4000, best=0.65, avg=0.64, std=0.00, steps=1.311e+08
2023-07-07 19:05:16,144 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4100, best=0.65, avg=0.64, std=0.00, steps=1.344e+08
2023-07-07 19:05:45,456 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4200, best=0.65, avg=0.64, std=0.00, steps=1.377e+08
2023-07-07 19:06:14,753 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4300, best=0.65, avg=0.64, std=0.00, steps=1.409e+08
2023-07-07 19:06:44,052 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4400, best=0.65, avg=0.64, std=0.00, steps=1.442e+08
2023-07-07 19:07:13,348 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4500, best=0.65, avg=0.64, std=0.00, steps=1.475e+08
2023-07-07 19:07:42,655 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4600, best=0.65, avg=0.64, std=0.00, steps=1.508e+08
2023-07-07 19:08:11,937 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4700, best=0.65, avg=0.64, std=0.00, steps=1.540e+08
2023-07-07 19:08:41,223 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4800, best=0.65, avg=0.64, std=0.00, steps=1.573e+08
2023-07-07 19:09:10,509 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4900, best=0.65, avg=0.64, std=0.00, steps=1.606e+08
2023-07-07 19:09:39,797 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5000, best=0.65, avg=0.64, std=0.00, steps=1.639e+08
2023-07-07 19:10:09,128 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5100, best=0.65, avg=0.65, std=0.00, steps=1.671e+08
2023-07-07 19:10:38,439 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5200, best=0.65, avg=0.65, std=0.00, steps=1.704e+08
2023-07-07 19:11:07,771 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5300, best=0.65, avg=0.65, std=0.00, steps=1.737e+08
2023-07-07 19:11:37,068 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5400, best=0.65, avg=0.65, std=0.00, steps=1.770e+08
2023-07-07 19:12:06,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5500, best=0.65, avg=0.65, std=0.00, steps=1.803e+08
2023-07-07 19:12:35,699 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5600, best=0.65, avg=0.65, std=0.00, steps=1.835e+08
2023-07-07 19:13:05,000 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5700, best=0.65, avg=0.65, std=0.00, steps=1.868e+08
2023-07-07 19:13:34,299 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5800, best=0.65, avg=0.65, std=0.00, steps=1.901e+08
2023-07-07 19:14:03,607 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5900, best=0.65, avg=0.65, std=0.00, steps=1.934e+08
2023-07-07 19:14:32,927 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6000, best=0.66, avg=0.65, std=0.00, steps=1.966e+08
2023-07-07 19:15:02,261 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6100, best=0.65, avg=0.65, std=0.00, steps=1.999e+08
2023-07-07 19:15:31,558 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6200, best=0.65, avg=0.65, std=0.00, steps=2.032e+08
2023-07-07 19:16:00,847 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6300, best=0.66, avg=0.65, std=0.00, steps=2.065e+08
2023-07-07 19:16:30,140 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6400, best=0.66, avg=0.65, std=0.00, steps=2.097e+08
2023-07-07 19:16:59,439 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6500, best=0.66, avg=0.65, std=0.00, steps=2.130e+08
2023-07-07 19:17:28,747 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6600, best=0.65, avg=0.65, std=0.00, steps=2.163e+08
2023-07-07 19:17:58,035 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6700, best=0.65, avg=0.65, std=0.00, steps=2.196e+08
2023-07-07 19:18:27,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6800, best=0.66, avg=0.65, std=0.00, steps=2.229e+08
2023-07-07 19:18:56,594 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6900, best=0.66, avg=0.65, std=0.00, steps=2.261e+08
2023-07-07 19:19:25,862 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7000, best=0.66, avg=0.65, std=0.00, steps=2.294e+08
2023-07-07 19:19:55,149 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7100, best=0.66, avg=0.65, std=0.00, steps=2.327e+08
2023-07-07 19:20:24,440 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7200, best=0.66, avg=0.65, std=0.00, steps=2.360e+08
2023-07-07 19:20:53,719 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7300, best=0.66, avg=0.65, std=0.00, steps=2.392e+08
2023-07-07 19:21:23,012 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7400, best=0.66, avg=0.65, std=0.00, steps=2.425e+08
2023-07-07 19:21:52,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7500, best=0.66, avg=0.65, std=0.00, steps=2.458e+08
2023-07-07 19:22:21,620 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7600, best=0.66, avg=0.65, std=0.00, steps=2.491e+08
2023-07-07 19:22:50,906 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7700, best=0.66, avg=0.65, std=0.00, steps=2.523e+08
2023-07-07 19:23:20,192 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7800, best=0.66, avg=0.65, std=0.00, steps=2.556e+08
2023-07-07 19:23:49,484 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7900, best=0.66, avg=0.66, std=0.00, steps=2.589e+08
2023-07-07 19:24:18,770 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8000, best=0.66, avg=0.66, std=0.00, steps=2.622e+08
2023-07-07 19:24:48,070 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8100, best=0.66, avg=0.66, std=0.00, steps=2.655e+08
2023-07-07 19:25:17,374 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8200, best=0.66, avg=0.66, std=0.00, steps=2.687e+08
2023-07-07 19:25:46,682 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8300, best=0.66, avg=0.66, std=0.00, steps=2.720e+08
2023-07-07 19:26:16,012 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8400, best=0.66, avg=0.66, std=0.00, steps=2.753e+08
2023-07-07 19:26:45,331 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8500, best=0.66, avg=0.66, std=0.00, steps=2.786e+08
2023-07-07 19:27:14,595 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8600, best=0.66, avg=0.66, std=0.00, steps=2.818e+08
2023-07-07 19:27:43,888 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8700, best=0.66, avg=0.66, std=0.00, steps=2.851e+08
2023-07-07 19:28:13,177 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8800, best=0.66, avg=0.66, std=0.00, steps=2.884e+08
2023-07-07 19:28:42,487 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8900, best=0.66, avg=0.66, std=0.00, steps=2.917e+08
2023-07-07 19:29:11,795 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9000, best=0.66, avg=0.66, std=0.00, steps=2.949e+08
2023-07-07 19:29:41,090 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9100, best=0.66, avg=0.66, std=0.00, steps=2.982e+08
2023-07-07 19:30:10,354 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9200, best=0.66, avg=0.66, std=0.00, steps=3.015e+08
2023-07-07 19:30:39,658 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9300, best=0.66, avg=0.66, std=0.00, steps=3.048e+08
2023-07-07 19:31:08,951 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9400, best=0.67, avg=0.66, std=0.00, steps=3.081e+08
2023-07-07 19:31:38,242 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9500, best=0.66, avg=0.66, std=0.00, steps=3.113e+08
2023-07-07 19:32:07,555 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9600, best=0.66, avg=0.66, std=0.00, steps=3.146e+08
2023-07-07 19:32:36,861 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9700, best=0.66, avg=0.66, std=0.00, steps=3.179e+08
2023-07-07 19:33:06,167 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9800, best=0.66, avg=0.66, std=0.00, steps=3.212e+08
2023-07-07 19:33:35,490 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9900, best=0.66, avg=0.66, std=0.00, steps=3.244e+08
2023-07-07 19:34:04,790 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10000, best=0.67, avg=0.66, std=0.00, steps=3.277e+08
2023-07-07 19:34:34,059 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10100, best=0.66, avg=0.66, std=0.00, steps=3.310e+08
2023-07-07 19:35:03,366 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10200, best=0.67, avg=0.66, std=0.00, steps=3.343e+08
2023-07-07 19:35:32,674 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10300, best=0.66, avg=0.66, std=0.00, steps=3.375e+08
2023-07-07 19:36:01,983 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10400, best=0.66, avg=0.66, std=0.00, steps=3.408e+08
2023-07-07 19:36:31,288 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10500, best=0.66, avg=0.66, std=0.00, steps=3.441e+08
2023-07-07 19:37:00,596 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10600, best=0.67, avg=0.66, std=0.00, steps=3.474e+08
2023-07-07 19:37:29,904 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10700, best=0.67, avg=0.66, std=0.00, steps=3.507e+08
2023-07-07 19:37:59,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10800, best=0.67, avg=0.66, std=0.00, steps=3.539e+08
2023-07-07 19:38:28,530 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10900, best=0.67, avg=0.66, std=0.00, steps=3.572e+08
2023-07-07 19:38:57,797 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11000, best=0.67, avg=0.66, std=0.00, steps=3.605e+08
2023-07-07 19:39:27,101 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11100, best=0.67, avg=0.66, std=0.00, steps=3.638e+08
2023-07-07 19:39:56,388 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11200, best=0.67, avg=0.66, std=0.00, steps=3.670e+08
2023-07-07 19:40:25,685 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11300, best=0.67, avg=0.66, std=0.00, steps=3.703e+08
2023-07-07 19:40:55,000 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11400, best=0.67, avg=0.66, std=0.00, steps=3.736e+08
2023-07-07 19:41:24,308 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11500, best=0.67, avg=0.66, std=0.00, steps=3.769e+08
2023-07-07 19:41:53,582 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11600, best=0.67, avg=0.66, std=0.00, steps=3.801e+08
2023-07-07 19:42:22,857 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11700, best=0.67, avg=0.66, std=0.00, steps=3.834e+08
2023-07-07 19:42:52,171 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11800, best=0.67, avg=0.66, std=0.00, steps=3.867e+08
2023-07-07 19:43:21,483 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11900, best=0.67, avg=0.66, std=0.00, steps=3.900e+08
2023-07-07 19:43:50,478 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11999, best=0.67, avg=0.66, std=0.00, steps=3.932e+08
2023-07-07 19:43:50,479 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 19:43:50,503 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 19:43:50,538 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 19:44:24,001 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.516e+06
2023-07-07 19:44:55,116 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=6.998e+06
2023-07-07 19:45:26,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.048e+07
2023-07-07 19:45:57,339 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.396e+07
2023-07-07 19:46:28,442 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=1.744e+07
2023-07-07 19:46:59,521 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.092e+07
2023-07-07 19:47:30,600 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.441e+07
2023-07-07 19:48:01,750 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=2.789e+07
2023-07-07 19:48:32,878 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.137e+07
2023-07-07 19:49:03,970 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=3.485e+07
2023-07-07 19:49:35,088 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=3.833e+07
2023-07-07 19:50:06,154 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=4.181e+07
2023-07-07 19:50:37,277 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.00, steps=4.530e+07
2023-07-07 19:51:08,402 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.00, steps=4.878e+07
2023-07-07 19:51:39,529 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=5.226e+07
2023-07-07 19:52:10,620 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=5.574e+07
2023-07-07 19:52:41,696 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.00, steps=5.922e+07
2023-07-07 19:53:12,809 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1800, best=0.51, avg=0.50, std=0.00, steps=6.270e+07
2023-07-07 19:53:43,894 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.00, steps=6.619e+07
2023-07-07 19:54:15,014 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.00, steps=6.967e+07
2023-07-07 19:54:46,095 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.00, steps=7.315e+07
2023-07-07 19:55:17,171 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.00, steps=7.663e+07
2023-07-07 19:55:48,302 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2300, best=0.51, avg=0.50, std=0.00, steps=8.011e+07
2023-07-07 19:56:19,429 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2400, best=0.51, avg=0.50, std=0.00, steps=8.359e+07
2023-07-07 19:56:50,523 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.00, steps=8.707e+07
2023-07-07 19:57:21,637 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.00, steps=9.056e+07
2023-07-07 19:57:52,736 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2700, best=0.51, avg=0.50, std=0.00, steps=9.404e+07
2023-07-07 19:58:23,819 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2800, best=0.51, avg=0.50, std=0.00, steps=9.752e+07
2023-07-07 19:58:54,934 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.00, steps=1.010e+08
2023-07-07 19:59:26,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3000, best=0.51, avg=0.50, std=0.00, steps=1.045e+08
2023-07-07 19:59:57,102 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.00, steps=1.080e+08
2023-07-07 20:00:28,193 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3200, best=0.51, avg=0.50, std=0.00, steps=1.114e+08
2023-07-07 20:00:59,305 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3300, best=0.51, avg=0.50, std=0.00, steps=1.149e+08
2023-07-07 20:01:30,394 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.00, steps=1.184e+08
2023-07-07 20:02:01,493 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3500, best=0.51, avg=0.50, std=0.00, steps=1.219e+08
2023-07-07 20:02:32,628 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3600, best=0.51, avg=0.50, std=0.00, steps=1.254e+08
2023-07-07 20:03:03,798 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.00, steps=1.289e+08
2023-07-07 20:03:34,887 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.00, steps=1.323e+08
2023-07-07 20:04:05,995 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3900, best=0.51, avg=0.50, std=0.00, steps=1.358e+08
2023-07-07 20:04:37,107 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4000, best=0.51, avg=0.50, std=0.00, steps=1.393e+08
2023-07-07 20:05:08,224 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.00, steps=1.428e+08
2023-07-07 20:05:39,320 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.00, steps=1.463e+08
2023-07-07 20:06:10,422 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4300, best=0.51, avg=0.50, std=0.00, steps=1.497e+08
2023-07-07 20:06:41,504 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4400, best=0.51, avg=0.50, std=0.00, steps=1.532e+08
2023-07-07 20:07:12,599 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4500, best=0.51, avg=0.50, std=0.00, steps=1.567e+08
2023-07-07 20:07:43,705 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.00, steps=1.602e+08
2023-07-07 20:08:14,825 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.00, steps=1.637e+08
2023-07-07 20:08:45,928 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4800, best=0.51, avg=0.50, std=0.00, steps=1.672e+08
2023-07-07 20:09:17,001 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4900, best=0.51, avg=0.50, std=0.00, steps=1.706e+08
2023-07-07 20:09:48,102 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5000, best=0.51, avg=0.50, std=0.00, steps=1.741e+08
2023-07-07 20:10:19,193 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5100, best=0.51, avg=0.50, std=0.00, steps=1.776e+08
2023-07-07 20:10:50,296 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.00, steps=1.811e+08
2023-07-07 20:11:21,403 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5300, best=0.51, avg=0.50, std=0.00, steps=1.846e+08
2023-07-07 20:11:52,520 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5400, best=0.51, avg=0.50, std=0.00, steps=1.880e+08
2023-07-07 20:12:23,605 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.00, steps=1.915e+08
2023-07-07 20:12:54,707 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5600, best=0.51, avg=0.50, std=0.00, steps=1.950e+08
2023-07-07 20:13:25,834 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5700, best=0.51, avg=0.50, std=0.00, steps=1.985e+08
2023-07-07 20:13:56,972 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5800, best=0.51, avg=0.50, std=0.00, steps=2.020e+08
2023-07-07 20:14:28,070 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5900, best=0.51, avg=0.50, std=0.00, steps=2.054e+08
2023-07-07 20:14:59,170 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.00, steps=2.089e+08
2023-07-07 20:15:30,322 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6100, best=0.51, avg=0.50, std=0.00, steps=2.124e+08
2023-07-07 20:16:01,408 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.00, steps=2.159e+08
2023-07-07 20:16:32,501 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.00, steps=2.194e+08
2023-07-07 20:17:03,617 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6400, best=0.51, avg=0.50, std=0.00, steps=2.229e+08
2023-07-07 20:17:34,729 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6500, best=0.51, avg=0.50, std=0.00, steps=2.263e+08
2023-07-07 20:18:05,841 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6600, best=0.51, avg=0.50, std=0.00, steps=2.298e+08
2023-07-07 20:18:36,963 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6700, best=0.51, avg=0.50, std=0.00, steps=2.333e+08
2023-07-07 20:19:08,056 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.00, steps=2.368e+08
2023-07-07 20:19:39,144 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.00, steps=2.403e+08
2023-07-07 20:20:10,254 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7000, best=0.51, avg=0.50, std=0.00, steps=2.437e+08
2023-07-07 20:20:41,342 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7100, best=0.51, avg=0.50, std=0.00, steps=2.472e+08
2023-07-07 20:21:12,450 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7200, best=0.51, avg=0.50, std=0.00, steps=2.507e+08
2023-07-07 20:21:43,556 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.00, steps=2.542e+08
2023-07-07 20:22:14,630 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7400, best=0.51, avg=0.50, std=0.00, steps=2.577e+08
2023-07-07 20:22:45,731 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7500, best=0.51, avg=0.50, std=0.00, steps=2.612e+08
2023-07-07 20:23:16,881 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7600, best=0.51, avg=0.50, std=0.00, steps=2.646e+08
2023-07-07 20:23:47,941 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7700, best=0.51, avg=0.50, std=0.00, steps=2.681e+08
2023-07-07 20:24:19,054 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7800, best=0.51, avg=0.50, std=0.00, steps=2.716e+08
2023-07-07 20:24:50,174 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7900, best=0.51, avg=0.50, std=0.00, steps=2.751e+08
2023-07-07 20:25:21,281 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8000, best=0.51, avg=0.50, std=0.00, steps=2.786e+08
2023-07-07 20:25:52,379 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8100, best=0.51, avg=0.50, std=0.00, steps=2.820e+08
2023-07-07 20:26:23,446 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8200, best=0.51, avg=0.50, std=0.00, steps=2.855e+08
2023-07-07 20:26:54,560 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8300, best=0.51, avg=0.50, std=0.00, steps=2.890e+08
2023-07-07 20:27:25,672 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8400, best=0.51, avg=0.50, std=0.00, steps=2.925e+08
2023-07-07 20:27:56,778 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8500, best=0.51, avg=0.50, std=0.00, steps=2.960e+08
2023-07-07 20:28:27,887 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8600, best=0.51, avg=0.50, std=0.00, steps=2.995e+08
2023-07-07 20:28:58,993 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8700, best=0.51, avg=0.50, std=0.00, steps=3.029e+08
2023-07-07 20:29:30,097 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8800, best=0.51, avg=0.50, std=0.00, steps=3.064e+08
2023-07-07 20:30:01,320 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8900, best=0.51, avg=0.50, std=0.00, steps=3.099e+08
2023-07-07 20:30:32,470 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9000, best=0.51, avg=0.50, std=0.00, steps=3.134e+08
2023-07-07 20:31:03,604 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9100, best=0.51, avg=0.50, std=0.00, steps=3.169e+08
2023-07-07 20:31:34,716 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9200, best=0.51, avg=0.50, std=0.00, steps=3.203e+08
2023-07-07 20:32:05,830 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9300, best=0.51, avg=0.50, std=0.00, steps=3.238e+08
2023-07-07 20:32:36,946 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9400, best=0.51, avg=0.50, std=0.00, steps=3.273e+08
2023-07-07 20:33:08,074 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9500, best=0.52, avg=0.51, std=0.00, steps=3.308e+08
2023-07-07 20:33:39,157 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9600, best=0.56, avg=0.56, std=0.00, steps=3.343e+08
2023-07-07 20:34:10,243 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9700, best=0.57, avg=0.57, std=0.00, steps=3.378e+08
2023-07-07 20:34:41,338 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9800, best=0.58, avg=0.57, std=0.00, steps=3.412e+08
2023-07-07 20:35:12,443 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9900, best=0.58, avg=0.58, std=0.00, steps=3.447e+08
2023-07-07 20:35:43,544 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10000, best=0.58, avg=0.58, std=0.00, steps=3.482e+08
2023-07-07 20:36:14,666 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10100, best=0.59, avg=0.58, std=0.00, steps=3.517e+08
2023-07-07 20:36:45,767 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10200, best=0.59, avg=0.59, std=0.00, steps=3.552e+08
2023-07-07 20:37:16,870 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10300, best=0.59, avg=0.59, std=0.00, steps=3.586e+08
2023-07-07 20:37:47,972 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10400, best=0.59, avg=0.59, std=0.00, steps=3.621e+08
2023-07-07 20:38:19,119 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10500, best=0.60, avg=0.59, std=0.00, steps=3.656e+08
2023-07-07 20:38:50,249 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10600, best=0.60, avg=0.59, std=0.00, steps=3.691e+08
2023-07-07 20:39:21,385 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10700, best=0.60, avg=0.60, std=0.00, steps=3.726e+08
2023-07-07 20:39:52,532 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10800, best=0.60, avg=0.60, std=0.00, steps=3.760e+08
2023-07-07 20:40:23,671 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10900, best=0.60, avg=0.60, std=0.00, steps=3.795e+08
2023-07-07 20:40:54,797 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11000, best=0.61, avg=0.60, std=0.00, steps=3.830e+08
2023-07-07 20:41:25,932 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11100, best=0.61, avg=0.60, std=0.00, steps=3.865e+08
2023-07-07 20:41:57,063 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11200, best=0.61, avg=0.60, std=0.00, steps=3.900e+08
2023-07-07 20:42:28,194 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11300, best=0.61, avg=0.61, std=0.00, steps=3.935e+08
2023-07-07 20:42:59,332 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11400, best=0.61, avg=0.61, std=0.00, steps=3.969e+08
2023-07-07 20:43:30,464 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11500, best=0.61, avg=0.61, std=0.00, steps=4.004e+08
2023-07-07 20:44:01,564 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11600, best=0.61, avg=0.61, std=0.00, steps=4.039e+08
2023-07-07 20:44:32,662 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11700, best=0.62, avg=0.61, std=0.00, steps=4.074e+08
2023-07-07 20:45:03,802 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11800, best=0.61, avg=0.61, std=0.00, steps=4.109e+08
2023-07-07 20:45:34,915 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11900, best=0.62, avg=0.61, std=0.00, steps=4.143e+08
2023-07-07 20:46:05,726 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11999, best=0.62, avg=0.61, std=0.00, steps=4.178e+08
2023-07-07 20:46:05,727 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 20:46:05,751 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 20:46:05,784 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 20:46:41,052 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.723e+06
2023-07-07 20:47:14,031 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=7.410e+06
2023-07-07 20:47:47,117 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.110e+07
2023-07-07 20:48:20,351 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.478e+07
2023-07-07 20:48:53,382 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=1.847e+07
2023-07-07 20:49:26,565 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.216e+07
2023-07-07 20:49:59,706 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.584e+07
2023-07-07 20:50:32,948 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=2.953e+07
2023-07-07 20:51:06,090 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.321e+07
2023-07-07 20:51:39,530 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=3.690e+07
2023-07-07 20:52:12,817 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=4.059e+07
2023-07-07 20:52:46,056 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1200, best=0.53, avg=0.52, std=0.00, steps=4.427e+07
2023-07-07 20:53:19,307 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1300, best=0.56, avg=0.56, std=0.00, steps=4.796e+07
2023-07-07 20:53:52,475 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1400, best=0.57, avg=0.57, std=0.00, steps=5.165e+07
2023-07-07 20:54:25,766 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1500, best=0.58, avg=0.57, std=0.00, steps=5.533e+07
2023-07-07 20:54:58,955 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1600, best=0.58, avg=0.57, std=0.00, steps=5.902e+07
2023-07-07 20:55:32,216 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1700, best=0.58, avg=0.58, std=0.00, steps=6.271e+07
2023-07-07 20:56:05,554 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1800, best=0.58, avg=0.58, std=0.00, steps=6.639e+07
2023-07-07 20:56:38,787 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1900, best=0.58, avg=0.58, std=0.00, steps=7.008e+07
2023-07-07 20:57:11,971 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2000, best=0.59, avg=0.58, std=0.00, steps=7.376e+07
2023-07-07 20:57:45,118 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2100, best=0.59, avg=0.58, std=0.00, steps=7.745e+07
2023-07-07 20:58:18,243 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2200, best=0.59, avg=0.58, std=0.00, steps=8.114e+07
2023-07-07 20:58:51,429 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2300, best=0.59, avg=0.58, std=0.00, steps=8.482e+07
2023-07-07 20:59:24,614 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2400, best=0.59, avg=0.59, std=0.00, steps=8.851e+07
2023-07-07 20:59:57,901 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2500, best=0.59, avg=0.59, std=0.00, steps=9.220e+07
2023-07-07 21:00:31,183 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2600, best=0.60, avg=0.59, std=0.00, steps=9.588e+07
2023-07-07 21:01:04,537 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2700, best=0.60, avg=0.59, std=0.00, steps=9.957e+07
2023-07-07 21:01:37,723 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2800, best=0.60, avg=0.60, std=0.00, steps=1.033e+08
2023-07-07 21:02:10,880 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2900, best=0.60, avg=0.60, std=0.00, steps=1.069e+08
2023-07-07 21:02:44,016 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3000, best=0.61, avg=0.60, std=0.00, steps=1.106e+08
2023-07-07 21:03:17,273 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3100, best=0.61, avg=0.60, std=0.00, steps=1.143e+08
2023-07-07 21:03:50,510 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3200, best=0.61, avg=0.60, std=0.00, steps=1.180e+08
2023-07-07 21:04:23,710 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3300, best=0.61, avg=0.60, std=0.00, steps=1.217e+08
2023-07-07 21:04:56,988 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3400, best=0.61, avg=0.61, std=0.00, steps=1.254e+08
2023-07-07 21:05:30,161 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3500, best=0.61, avg=0.61, std=0.00, steps=1.291e+08
2023-07-07 21:06:03,314 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3600, best=0.61, avg=0.61, std=0.00, steps=1.327e+08
2023-07-07 21:06:36,577 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3700, best=0.62, avg=0.61, std=0.00, steps=1.364e+08
2023-07-07 21:07:09,801 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3800, best=0.61, avg=0.61, std=0.00, steps=1.401e+08
2023-07-07 21:07:42,939 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3900, best=0.62, avg=0.61, std=0.00, steps=1.438e+08
2023-07-07 21:08:16,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4000, best=0.62, avg=0.61, std=0.00, steps=1.475e+08
2023-07-07 21:08:49,429 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4100, best=0.62, avg=0.61, std=0.00, steps=1.512e+08
2023-07-07 21:09:22,749 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4200, best=0.62, avg=0.61, std=0.00, steps=1.549e+08
2023-07-07 21:09:56,182 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4300, best=0.62, avg=0.61, std=0.00, steps=1.586e+08
2023-07-07 21:10:29,471 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4400, best=0.62, avg=0.62, std=0.00, steps=1.622e+08
2023-07-07 21:11:02,675 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4500, best=0.62, avg=0.62, std=0.00, steps=1.659e+08
2023-07-07 21:11:35,923 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4600, best=0.62, avg=0.62, std=0.00, steps=1.696e+08
2023-07-07 21:12:09,110 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4700, best=0.62, avg=0.62, std=0.00, steps=1.733e+08
2023-07-07 21:12:42,449 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4800, best=0.62, avg=0.62, std=0.00, steps=1.770e+08
2023-07-07 21:13:15,650 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4900, best=0.62, avg=0.62, std=0.00, steps=1.807e+08
2023-07-07 21:13:48,817 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5000, best=0.62, avg=0.62, std=0.00, steps=1.844e+08
2023-07-07 21:14:21,995 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5100, best=0.63, avg=0.62, std=0.00, steps=1.880e+08
2023-07-07 21:14:55,277 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5200, best=0.62, avg=0.62, std=0.00, steps=1.917e+08
2023-07-07 21:15:28,434 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5300, best=0.63, avg=0.62, std=0.00, steps=1.954e+08
2023-07-07 21:16:01,643 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5400, best=0.63, avg=0.62, std=0.00, steps=1.991e+08
2023-07-07 21:16:34,855 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5500, best=0.63, avg=0.62, std=0.00, steps=2.028e+08
2023-07-07 21:17:07,990 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5600, best=0.63, avg=0.62, std=0.00, steps=2.065e+08
2023-07-07 21:17:41,187 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5700, best=0.63, avg=0.62, std=0.00, steps=2.102e+08
2023-07-07 21:18:14,347 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5800, best=0.63, avg=0.62, std=0.00, steps=2.138e+08
2023-07-07 21:18:47,482 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5900, best=0.63, avg=0.62, std=0.00, steps=2.175e+08
2023-07-07 21:19:20,687 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6000, best=0.63, avg=0.62, std=0.00, steps=2.212e+08
2023-07-07 21:19:53,960 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6100, best=0.63, avg=0.62, std=0.00, steps=2.249e+08
2023-07-07 21:20:27,121 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6200, best=0.63, avg=0.62, std=0.00, steps=2.286e+08
2023-07-07 21:21:00,351 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6300, best=0.63, avg=0.62, std=0.00, steps=2.323e+08
2023-07-07 21:21:33,564 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6400, best=0.63, avg=0.62, std=0.00, steps=2.360e+08
2023-07-07 21:22:06,833 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6500, best=0.63, avg=0.63, std=0.00, steps=2.397e+08
2023-07-07 21:22:40,055 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6600, best=0.63, avg=0.63, std=0.00, steps=2.433e+08
2023-07-07 21:23:13,375 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6700, best=0.63, avg=0.63, std=0.00, steps=2.470e+08
2023-07-07 21:23:46,617 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6800, best=0.63, avg=0.63, std=0.00, steps=2.507e+08
2023-07-07 21:24:19,918 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6900, best=0.63, avg=0.63, std=0.00, steps=2.544e+08
2023-07-07 21:24:53,189 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7000, best=0.63, avg=0.63, std=0.00, steps=2.581e+08
2023-07-07 21:25:26,463 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7100, best=0.63, avg=0.63, std=0.00, steps=2.618e+08
2023-07-07 21:25:59,831 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7200, best=0.63, avg=0.63, std=0.00, steps=2.655e+08
2023-07-07 21:26:33,222 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7300, best=0.63, avg=0.63, std=0.00, steps=2.691e+08
2023-07-07 21:27:06,407 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7400, best=0.63, avg=0.63, std=0.00, steps=2.728e+08
2023-07-07 21:27:39,522 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7500, best=0.63, avg=0.63, std=0.00, steps=2.765e+08
2023-07-07 21:28:12,623 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7600, best=0.63, avg=0.63, std=0.00, steps=2.802e+08
2023-07-07 21:28:45,785 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7700, best=0.63, avg=0.63, std=0.00, steps=2.839e+08
2023-07-07 21:29:19,110 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7800, best=0.64, avg=0.63, std=0.00, steps=2.876e+08
2023-07-07 21:29:52,359 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7900, best=0.64, avg=0.63, std=0.00, steps=2.913e+08
2023-07-07 21:30:25,737 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8000, best=0.64, avg=0.63, std=0.00, steps=2.949e+08
2023-07-07 21:30:59,016 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8100, best=0.64, avg=0.63, std=0.00, steps=2.986e+08
2023-07-07 21:31:32,157 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8200, best=0.64, avg=0.63, std=0.00, steps=3.023e+08
2023-07-07 21:32:05,591 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8300, best=0.64, avg=0.63, std=0.00, steps=3.060e+08
2023-07-07 21:32:38,897 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8400, best=0.64, avg=0.63, std=0.00, steps=3.097e+08
2023-07-07 21:33:12,346 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8500, best=0.64, avg=0.63, std=0.00, steps=3.134e+08
2023-07-07 21:33:45,611 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8600, best=0.64, avg=0.63, std=0.00, steps=3.171e+08
2023-07-07 21:34:19,105 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8700, best=0.64, avg=0.63, std=0.00, steps=3.208e+08
2023-07-07 21:34:52,568 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8800, best=0.64, avg=0.63, std=0.00, steps=3.244e+08
2023-07-07 21:35:25,778 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8900, best=0.64, avg=0.63, std=0.00, steps=3.281e+08
2023-07-07 21:35:58,948 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9000, best=0.64, avg=0.63, std=0.00, steps=3.318e+08
2023-07-07 21:36:32,252 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9100, best=0.64, avg=0.63, std=0.00, steps=3.355e+08
2023-07-07 21:37:05,691 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9200, best=0.64, avg=0.63, std=0.00, steps=3.392e+08
2023-07-07 21:37:39,022 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9300, best=0.64, avg=0.63, std=0.00, steps=3.429e+08
2023-07-07 21:38:12,426 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9400, best=0.64, avg=0.64, std=0.00, steps=3.466e+08
2023-07-07 21:38:45,700 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9500, best=0.64, avg=0.64, std=0.00, steps=3.502e+08
2023-07-07 21:39:19,152 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9600, best=0.64, avg=0.64, std=0.00, steps=3.539e+08
2023-07-07 21:39:52,564 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9700, best=0.64, avg=0.64, std=0.00, steps=3.576e+08
2023-07-07 21:40:25,905 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9800, best=0.64, avg=0.64, std=0.00, steps=3.613e+08
2023-07-07 21:40:59,192 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9900, best=0.64, avg=0.64, std=0.00, steps=3.650e+08
2023-07-07 21:41:32,600 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10000, best=0.64, avg=0.64, std=0.00, steps=3.687e+08
2023-07-07 21:42:05,999 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10100, best=0.64, avg=0.64, std=0.00, steps=3.724e+08
2023-07-07 21:42:39,222 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10200, best=0.64, avg=0.64, std=0.00, steps=3.760e+08
2023-07-07 21:43:12,426 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10300, best=0.64, avg=0.64, std=0.00, steps=3.797e+08
2023-07-07 21:43:45,671 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10400, best=0.64, avg=0.64, std=0.00, steps=3.834e+08
2023-07-07 21:44:18,904 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10500, best=0.64, avg=0.64, std=0.00, steps=3.871e+08
2023-07-07 21:44:52,259 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10600, best=0.64, avg=0.64, std=0.00, steps=3.908e+08
2023-07-07 21:45:25,672 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10700, best=0.64, avg=0.64, std=0.00, steps=3.945e+08
2023-07-07 21:45:59,121 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10800, best=0.64, avg=0.64, std=0.00, steps=3.982e+08
2023-07-07 21:46:32,496 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10900, best=0.64, avg=0.64, std=0.00, steps=4.019e+08
2023-07-07 21:47:05,901 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11000, best=0.64, avg=0.64, std=0.00, steps=4.055e+08
2023-07-07 21:47:39,349 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11100, best=0.64, avg=0.64, std=0.00, steps=4.092e+08
2023-07-07 21:48:12,744 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11200, best=0.64, avg=0.64, std=0.00, steps=4.129e+08
2023-07-07 21:48:46,275 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11300, best=0.64, avg=0.64, std=0.00, steps=4.166e+08
2023-07-07 21:49:19,718 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11400, best=0.64, avg=0.64, std=0.00, steps=4.203e+08
2023-07-07 21:49:53,128 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11500, best=0.64, avg=0.64, std=0.00, steps=4.240e+08
2023-07-07 21:50:26,505 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11600, best=0.65, avg=0.64, std=0.00, steps=4.277e+08
2023-07-07 21:50:59,932 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11700, best=0.65, avg=0.64, std=0.00, steps=4.313e+08
2023-07-07 21:51:33,229 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11800, best=0.65, avg=0.64, std=0.00, steps=4.350e+08
2023-07-07 21:52:06,547 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11900, best=0.64, avg=0.64, std=0.00, steps=4.387e+08
2023-07-07 21:52:39,583 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11999, best=0.65, avg=0.64, std=0.00, steps=4.424e+08
2023-07-07 21:52:39,584 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
2023-07-07 21:52:39,609 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 21:52:39,641 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 21:53:19,080 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=4.137e+06
2023-07-07 21:53:55,979 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=8.233e+06
2023-07-07 21:54:32,892 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.233e+07
2023-07-07 21:55:09,969 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.642e+07
2023-07-07 21:55:47,080 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=2.052e+07
2023-07-07 21:56:24,094 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.462e+07
2023-07-07 21:57:01,065 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.871e+07
2023-07-07 21:57:38,084 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=3.281e+07
2023-07-07 21:58:15,133 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.690e+07
2023-07-07 21:58:52,149 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=4.100e+07
2023-07-07 21:59:29,112 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=4.510e+07
2023-07-07 22:00:06,087 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=4.919e+07
2023-07-07 22:00:42,973 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.00, steps=5.329e+07
2023-07-07 22:01:19,937 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.00, steps=5.738e+07
2023-07-07 22:01:56,833 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=6.148e+07
2023-07-07 22:02:33,764 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=6.558e+07
2023-07-07 22:03:10,868 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.00, steps=6.967e+07
2023-07-07 22:03:47,910 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1800, best=0.51, avg=0.50, std=0.00, steps=7.377e+07
2023-07-07 22:04:25,006 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.00, steps=7.786e+07
2023-07-07 22:05:02,159 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.00, steps=8.196e+07
2023-07-07 22:05:39,254 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.00, steps=8.606e+07
2023-07-07 22:06:16,265 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.00, steps=9.015e+07
2023-07-07 22:06:53,240 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2300, best=0.51, avg=0.50, std=0.00, steps=9.425e+07
2023-07-07 22:07:30,252 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2400, best=0.51, avg=0.50, std=0.00, steps=9.834e+07
2023-07-07 22:08:07,250 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.00, steps=1.024e+08
2023-07-07 22:08:44,097 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.00, steps=1.065e+08
2023-07-07 22:09:21,041 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2700, best=0.51, avg=0.50, std=0.00, steps=1.106e+08
2023-07-07 22:09:58,153 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2800, best=0.51, avg=0.50, std=0.00, steps=1.147e+08
2023-07-07 22:10:35,089 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.00, steps=1.188e+08
2023-07-07 22:11:11,958 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3000, best=0.51, avg=0.50, std=0.00, steps=1.229e+08
2023-07-07 22:11:48,775 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.00, steps=1.270e+08
2023-07-07 22:12:25,552 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3200, best=0.51, avg=0.50, std=0.00, steps=1.311e+08
2023-07-07 22:13:02,361 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3300, best=0.51, avg=0.50, std=0.00, steps=1.352e+08
2023-07-07 22:13:39,189 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.00, steps=1.393e+08
2023-07-07 22:14:16,053 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3500, best=0.51, avg=0.50, std=0.00, steps=1.434e+08
2023-07-07 22:14:52,903 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3600, best=0.51, avg=0.50, std=0.00, steps=1.475e+08
2023-07-07 22:15:29,773 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.00, steps=1.516e+08
2023-07-07 22:16:06,622 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.00, steps=1.557e+08
2023-07-07 22:16:43,460 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3900, best=0.51, avg=0.50, std=0.00, steps=1.598e+08
2023-07-07 22:17:20,237 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4000, best=0.51, avg=0.50, std=0.00, steps=1.639e+08
2023-07-07 22:17:57,099 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.00, steps=1.680e+08
2023-07-07 22:18:33,962 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.00, steps=1.721e+08
2023-07-07 22:19:10,856 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4300, best=0.51, avg=0.50, std=0.00, steps=1.762e+08
2023-07-07 22:19:47,733 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4400, best=0.51, avg=0.50, std=0.00, steps=1.803e+08
2023-07-07 22:20:24,747 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4500, best=0.51, avg=0.50, std=0.00, steps=1.844e+08
2023-07-07 22:21:01,640 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.00, steps=1.885e+08
2023-07-07 22:21:38,521 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.00, steps=1.926e+08
2023-07-07 22:22:15,347 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4800, best=0.51, avg=0.50, std=0.00, steps=1.966e+08
2023-07-07 22:22:52,264 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4900, best=0.51, avg=0.50, std=0.00, steps=2.007e+08
2023-07-07 22:23:29,052 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5000, best=0.51, avg=0.50, std=0.00, steps=2.048e+08
2023-07-07 22:24:05,895 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5100, best=0.51, avg=0.50, std=0.00, steps=2.089e+08
2023-07-07 22:24:42,735 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.00, steps=2.130e+08
2023-07-07 22:25:19,685 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5300, best=0.51, avg=0.50, std=0.00, steps=2.171e+08
2023-07-07 22:25:56,501 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5400, best=0.51, avg=0.50, std=0.00, steps=2.212e+08
2023-07-07 22:26:33,317 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.00, steps=2.253e+08
2023-07-07 22:27:10,143 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5600, best=0.50, avg=0.50, std=0.00, steps=2.294e+08
2023-07-07 22:27:46,942 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5700, best=0.51, avg=0.50, std=0.00, steps=2.335e+08
2023-07-07 22:28:23,804 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5800, best=0.51, avg=0.50, std=0.00, steps=2.376e+08
2023-07-07 22:29:00,784 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5900, best=0.51, avg=0.50, std=0.00, steps=2.417e+08
2023-07-07 22:29:37,690 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.00, steps=2.458e+08
2023-07-07 22:30:14,600 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6100, best=0.51, avg=0.50, std=0.00, steps=2.499e+08
2023-07-07 22:30:51,475 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.00, steps=2.540e+08
2023-07-07 22:31:28,301 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.00, steps=2.581e+08
2023-07-07 22:32:05,066 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6400, best=0.51, avg=0.50, std=0.00, steps=2.622e+08
2023-07-07 22:32:41,871 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6500, best=0.51, avg=0.50, std=0.00, steps=2.663e+08
2023-07-07 22:33:18,677 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6600, best=0.51, avg=0.50, std=0.00, steps=2.704e+08
2023-07-07 22:33:55,514 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6700, best=0.51, avg=0.50, std=0.00, steps=2.745e+08
2023-07-07 22:34:32,397 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.00, steps=2.786e+08
2023-07-07 22:35:09,372 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.00, steps=2.827e+08
2023-07-07 22:35:46,149 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7000, best=0.51, avg=0.50, std=0.00, steps=2.868e+08
2023-07-07 22:36:22,959 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7100, best=0.51, avg=0.50, std=0.00, steps=2.909e+08
2023-07-07 22:36:59,815 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7200, best=0.51, avg=0.50, std=0.00, steps=2.950e+08
2023-07-07 22:37:36,620 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.00, steps=2.990e+08
2023-07-07 22:38:13,464 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7400, best=0.51, avg=0.50, std=0.00, steps=3.031e+08
2023-07-07 22:38:50,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7500, best=0.51, avg=0.50, std=0.00, steps=3.072e+08
2023-07-07 22:39:27,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7600, best=0.56, avg=0.56, std=0.00, steps=3.113e+08
2023-07-07 22:40:04,175 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7700, best=0.57, avg=0.56, std=0.00, steps=3.154e+08
2023-07-07 22:40:41,089 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7800, best=0.57, avg=0.56, std=0.00, steps=3.195e+08
2023-07-07 22:41:17,995 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7900, best=0.57, avg=0.56, std=0.00, steps=3.236e+08
2023-07-07 22:41:54,907 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8000, best=0.57, avg=0.56, std=0.00, steps=3.277e+08
2023-07-07 22:42:31,765 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8100, best=0.57, avg=0.57, std=0.00, steps=3.318e+08
2023-07-07 22:43:08,696 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8200, best=0.57, avg=0.57, std=0.00, steps=3.359e+08
2023-07-07 22:43:45,525 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8300, best=0.57, avg=0.57, std=0.00, steps=3.400e+08
2023-07-07 22:44:22,375 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8400, best=0.57, avg=0.57, std=0.00, steps=3.441e+08
2023-07-07 22:44:59,266 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8500, best=0.57, avg=0.57, std=0.00, steps=3.482e+08
2023-07-07 22:45:36,136 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8600, best=0.57, avg=0.57, std=0.00, steps=3.523e+08
2023-07-07 22:46:13,024 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8700, best=0.58, avg=0.57, std=0.00, steps=3.564e+08
2023-07-07 22:46:49,941 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8800, best=0.58, avg=0.57, std=0.00, steps=3.605e+08
2023-07-07 22:47:26,875 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8900, best=0.58, avg=0.57, std=0.00, steps=3.646e+08
2023-07-07 22:48:03,660 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9000, best=0.58, avg=0.57, std=0.00, steps=3.687e+08
2023-07-07 22:48:40,462 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9100, best=0.57, avg=0.57, std=0.00, steps=3.728e+08
2023-07-07 22:49:17,312 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9200, best=0.58, avg=0.57, std=0.00, steps=3.769e+08
2023-07-07 22:49:54,240 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9300, best=0.58, avg=0.57, std=0.00, steps=3.810e+08
2023-07-07 22:50:31,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9400, best=0.57, avg=0.57, std=0.00, steps=3.851e+08
2023-07-07 22:51:07,785 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9500, best=0.58, avg=0.57, std=0.00, steps=3.892e+08
2023-07-07 22:51:44,712 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9600, best=0.58, avg=0.57, std=0.00, steps=3.933e+08
2023-07-07 22:52:21,620 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9700, best=0.58, avg=0.57, std=0.00, steps=3.974e+08
2023-07-07 22:52:58,458 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9800, best=0.58, avg=0.57, std=0.00, steps=4.014e+08
2023-07-07 22:53:35,387 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9900, best=0.58, avg=0.57, std=0.00, steps=4.055e+08
2023-07-07 22:54:12,281 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10000, best=0.58, avg=0.57, std=0.00, steps=4.096e+08
2023-07-07 22:54:49,108 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10100, best=0.58, avg=0.57, std=0.00, steps=4.137e+08
2023-07-07 22:55:25,978 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10200, best=0.58, avg=0.57, std=0.00, steps=4.178e+08
2023-07-07 22:56:02,738 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10300, best=0.57, avg=0.57, std=0.00, steps=4.219e+08
2023-07-07 22:56:39,717 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10400, best=0.58, avg=0.57, std=0.00, steps=4.260e+08
2023-07-07 22:57:16,607 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10500, best=0.58, avg=0.57, std=0.00, steps=4.301e+08
2023-07-07 22:57:53,570 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10600, best=0.58, avg=0.57, std=0.00, steps=4.342e+08
2023-07-07 22:58:30,423 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10700, best=0.57, avg=0.57, std=0.00, steps=4.383e+08
2023-07-07 22:59:07,339 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10800, best=0.58, avg=0.57, std=0.00, steps=4.424e+08
2023-07-07 22:59:44,234 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10900, best=0.58, avg=0.57, std=0.00, steps=4.465e+08
2023-07-07 23:00:21,100 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11000, best=0.58, avg=0.57, std=0.00, steps=4.506e+08
2023-07-07 23:00:57,999 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11100, best=0.58, avg=0.57, std=0.00, steps=4.547e+08
2023-07-07 23:01:34,819 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11200, best=0.58, avg=0.57, std=0.00, steps=4.588e+08
2023-07-07 23:02:11,637 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11300, best=0.58, avg=0.57, std=0.00, steps=4.629e+08
2023-07-07 23:02:48,460 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11400, best=0.58, avg=0.57, std=0.00, steps=4.670e+08
2023-07-07 23:03:25,237 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11500, best=0.58, avg=0.57, std=0.00, steps=4.711e+08
2023-07-07 23:04:02,162 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11600, best=0.58, avg=0.57, std=0.00, steps=4.752e+08
2023-07-07 23:04:39,057 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11700, best=0.58, avg=0.57, std=0.00, steps=4.793e+08
2023-07-07 23:05:15,893 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11800, best=0.58, avg=0.57, std=0.00, steps=4.834e+08
2023-07-07 23:05:52,739 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11900, best=0.58, avg=0.57, std=0.00, steps=4.875e+08
2023-07-07 23:06:29,216 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11999, best=0.58, avg=0.57, std=0.00, steps=4.915e+08
2023-07-07 23:06:29,217 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135823
