2023-07-07 13:59:06,958 -        meta learning: [    INFO] - [INFO] checkpoint saved to: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 13:59:06,958 -        meta learning: [    INFO] - [INFO] tensorboard dir set to: ./runs/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 13:59:06,958 -        meta learning: [    INFO] - [ARGS]: Namespace(policy='BatchedGruMetaStdpMLPPolicy', algo='PGPE', task='SeqTask', seq_length=20, latency=24, num_cls=5, feature_dims=14, sigma=0.1, batch_size=512, hidden_dims=[128], pop_size=256, center_lr=0.01, init_std=0.04, decay_std=0.999, limit_std=0.001, std_lr=0.07, terminate_when_unhealthy=False, max_iters=12000, num_tasks=1, seed=50, num_tests=128, eval_epoch=100, eval=False, eval_with_injury=False, resume='', save=False, repeat=1, root_dir='/data/anonymous/meta', tensorboard_dir='./runs', suffix='', output_dir='/data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906', summary_writer=<torch.utils.tensorboard.writer.SummaryWriter object at 0x7f133c03dd00>, tb_prefix='PGPE/SeqTask/BatchedGruMetaStdpMLPPolicy')
2023-07-07 13:59:10,137 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 13:59:10,202 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 13:59:18,162 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 100, best=0.71, avg=0.70, std=0.01, steps=4.137e+05
2023-07-07 13:59:22,151 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 200, best=0.79, avg=0.78, std=0.01, steps=8.233e+05
2023-07-07 13:59:26,149 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 300, best=0.86, avg=0.84, std=0.01, steps=1.233e+06
2023-07-07 13:59:30,187 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 400, best=0.89, avg=0.88, std=0.01, steps=1.642e+06
2023-07-07 13:59:34,202 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 500, best=0.91, avg=0.90, std=0.00, steps=2.052e+06
2023-07-07 13:59:38,240 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 600, best=0.94, avg=0.92, std=0.00, steps=2.462e+06
2023-07-07 13:59:42,251 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 700, best=0.96, avg=0.95, std=0.00, steps=2.871e+06
2023-07-07 13:59:46,242 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 800, best=0.96, avg=0.95, std=0.00, steps=3.281e+06
2023-07-07 13:59:50,215 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 900, best=0.97, avg=0.97, std=0.00, steps=3.690e+06
2023-07-07 13:59:54,170 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1000, best=0.98, avg=0.97, std=0.00, steps=4.100e+06
2023-07-07 13:59:58,095 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1100, best=0.99, avg=0.98, std=0.00, steps=4.510e+06
2023-07-07 14:00:02,032 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1200, best=0.99, avg=0.99, std=0.00, steps=4.919e+06
2023-07-07 14:00:05,981 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1300, best=1.00, avg=1.00, std=0.00, steps=5.329e+06
2023-07-07 14:00:09,933 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1400, best=1.00, avg=1.00, std=0.00, steps=5.738e+06
2023-07-07 14:00:13,869 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1500, best=1.00, avg=1.00, std=0.00, steps=6.148e+06
2023-07-07 14:00:17,813 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1600, best=1.00, avg=1.00, std=0.00, steps=6.558e+06
2023-07-07 14:00:21,756 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1700, best=1.00, avg=1.00, std=0.00, steps=6.967e+06
2023-07-07 14:00:25,705 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1800, best=1.00, avg=1.00, std=0.00, steps=7.377e+06
2023-07-07 14:00:29,635 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 1900, best=1.00, avg=1.00, std=0.00, steps=7.786e+06
2023-07-07 14:00:33,563 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2000, best=1.00, avg=1.00, std=0.00, steps=8.196e+06
2023-07-07 14:00:37,503 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2100, best=1.00, avg=1.00, std=0.00, steps=8.606e+06
2023-07-07 14:00:41,462 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2200, best=1.00, avg=1.00, std=0.00, steps=9.015e+06
2023-07-07 14:00:45,423 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2300, best=1.00, avg=1.00, std=0.00, steps=9.425e+06
2023-07-07 14:00:49,378 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2400, best=1.00, avg=1.00, std=0.00, steps=9.834e+06
2023-07-07 14:00:53,339 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2500, best=1.00, avg=1.00, std=0.00, steps=1.024e+07
2023-07-07 14:00:57,298 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2600, best=1.00, avg=1.00, std=0.00, steps=1.065e+07
2023-07-07 14:01:01,248 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2700, best=1.00, avg=1.00, std=0.00, steps=1.106e+07
2023-07-07 14:01:05,182 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2800, best=1.00, avg=1.00, std=0.00, steps=1.147e+07
2023-07-07 14:01:09,130 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 2900, best=1.00, avg=1.00, std=0.00, steps=1.188e+07
2023-07-07 14:01:13,074 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3000, best=1.00, avg=1.00, std=0.00, steps=1.229e+07
2023-07-07 14:01:17,013 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3100, best=1.00, avg=1.00, std=0.00, steps=1.270e+07
2023-07-07 14:01:20,962 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3200, best=1.00, avg=1.00, std=0.00, steps=1.311e+07
2023-07-07 14:01:24,911 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3300, best=1.00, avg=1.00, std=0.00, steps=1.352e+07
2023-07-07 14:01:28,850 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3400, best=1.00, avg=1.00, std=0.00, steps=1.393e+07
2023-07-07 14:01:32,824 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3500, best=1.00, avg=1.00, std=0.00, steps=1.434e+07
2023-07-07 14:01:36,792 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3600, best=1.00, avg=1.00, std=0.00, steps=1.475e+07
2023-07-07 14:01:40,755 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3700, best=1.00, avg=1.00, std=0.00, steps=1.516e+07
2023-07-07 14:01:44,715 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3800, best=1.00, avg=1.00, std=0.00, steps=1.557e+07
2023-07-07 14:01:48,705 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 3900, best=1.00, avg=1.00, std=0.00, steps=1.598e+07
2023-07-07 14:01:52,689 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4000, best=1.00, avg=1.00, std=0.00, steps=1.639e+07
2023-07-07 14:01:56,653 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4100, best=1.00, avg=1.00, std=0.00, steps=1.680e+07
2023-07-07 14:02:00,591 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4200, best=1.00, avg=1.00, std=0.00, steps=1.721e+07
2023-07-07 14:02:04,528 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4300, best=1.00, avg=1.00, std=0.00, steps=1.762e+07
2023-07-07 14:02:08,470 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4400, best=1.00, avg=1.00, std=0.00, steps=1.803e+07
2023-07-07 14:02:12,416 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4500, best=1.00, avg=1.00, std=0.00, steps=1.844e+07
2023-07-07 14:02:16,354 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4600, best=1.00, avg=1.00, std=0.00, steps=1.885e+07
2023-07-07 14:02:20,306 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4700, best=1.00, avg=1.00, std=0.00, steps=1.926e+07
2023-07-07 14:02:24,273 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4800, best=1.00, avg=1.00, std=0.00, steps=1.966e+07
2023-07-07 14:02:28,242 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 4900, best=1.00, avg=1.00, std=0.00, steps=2.007e+07
2023-07-07 14:02:32,185 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5000, best=1.00, avg=1.00, std=0.00, steps=2.048e+07
2023-07-07 14:02:36,135 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5100, best=1.00, avg=1.00, std=0.00, steps=2.089e+07
2023-07-07 14:02:40,092 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5200, best=1.00, avg=1.00, std=0.00, steps=2.130e+07
2023-07-07 14:02:44,043 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5300, best=1.00, avg=1.00, std=0.00, steps=2.171e+07
2023-07-07 14:02:48,018 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5400, best=1.00, avg=1.00, std=0.00, steps=2.212e+07
2023-07-07 14:02:51,974 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5500, best=1.00, avg=1.00, std=0.00, steps=2.253e+07
2023-07-07 14:02:55,912 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5600, best=1.00, avg=1.00, std=0.00, steps=2.294e+07
2023-07-07 14:02:59,847 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5700, best=1.00, avg=1.00, std=0.00, steps=2.335e+07
2023-07-07 14:03:03,774 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5800, best=1.00, avg=1.00, std=0.00, steps=2.376e+07
2023-07-07 14:03:07,702 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 5900, best=1.00, avg=1.00, std=0.00, steps=2.417e+07
2023-07-07 14:03:11,657 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6000, best=1.00, avg=1.00, std=0.00, steps=2.458e+07
2023-07-07 14:03:15,610 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6100, best=1.00, avg=1.00, std=0.00, steps=2.499e+07
2023-07-07 14:03:19,573 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6200, best=1.00, avg=1.00, std=0.00, steps=2.540e+07
2023-07-07 14:03:23,528 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6300, best=1.00, avg=1.00, std=0.00, steps=2.581e+07
2023-07-07 14:03:27,480 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6400, best=1.00, avg=1.00, std=0.00, steps=2.622e+07
2023-07-07 14:03:31,463 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6500, best=1.00, avg=1.00, std=0.00, steps=2.663e+07
2023-07-07 14:03:35,420 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6600, best=1.00, avg=1.00, std=0.00, steps=2.704e+07
2023-07-07 14:03:39,376 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6700, best=1.00, avg=1.00, std=0.00, steps=2.745e+07
2023-07-07 14:03:43,319 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6800, best=1.00, avg=1.00, std=0.00, steps=2.786e+07
2023-07-07 14:03:47,267 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 6900, best=1.00, avg=1.00, std=0.00, steps=2.827e+07
2023-07-07 14:03:51,247 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7000, best=1.00, avg=1.00, std=0.00, steps=2.868e+07
2023-07-07 14:03:55,189 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7100, best=1.00, avg=1.00, std=0.00, steps=2.909e+07
2023-07-07 14:03:59,130 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7200, best=1.00, avg=1.00, std=0.00, steps=2.950e+07
2023-07-07 14:04:03,080 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7300, best=1.00, avg=1.00, std=0.00, steps=2.990e+07
2023-07-07 14:04:07,035 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7400, best=1.00, avg=1.00, std=0.00, steps=3.031e+07
2023-07-07 14:04:10,983 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7500, best=1.00, avg=1.00, std=0.00, steps=3.072e+07
2023-07-07 14:04:14,925 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7600, best=1.00, avg=1.00, std=0.00, steps=3.113e+07
2023-07-07 14:04:18,853 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7700, best=1.00, avg=1.00, std=0.00, steps=3.154e+07
2023-07-07 14:04:22,776 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7800, best=1.00, avg=1.00, std=0.00, steps=3.195e+07
2023-07-07 14:04:26,733 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 7900, best=1.00, avg=1.00, std=0.00, steps=3.236e+07
2023-07-07 14:04:30,701 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8000, best=1.00, avg=1.00, std=0.00, steps=3.277e+07
2023-07-07 14:04:34,662 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8100, best=1.00, avg=1.00, std=0.00, steps=3.318e+07
2023-07-07 14:04:38,598 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8200, best=1.00, avg=1.00, std=0.00, steps=3.359e+07
2023-07-07 14:04:42,563 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8300, best=1.00, avg=1.00, std=0.00, steps=3.400e+07
2023-07-07 14:04:46,508 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8400, best=1.00, avg=1.00, std=0.00, steps=3.441e+07
2023-07-07 14:04:50,443 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8500, best=1.00, avg=1.00, std=0.00, steps=3.482e+07
2023-07-07 14:04:54,419 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8600, best=1.00, avg=1.00, std=0.00, steps=3.523e+07
2023-07-07 14:04:58,402 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8700, best=1.00, avg=1.00, std=0.00, steps=3.564e+07
2023-07-07 14:05:02,357 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8800, best=1.00, avg=1.00, std=0.00, steps=3.605e+07
2023-07-07 14:05:06,301 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 8900, best=1.00, avg=1.00, std=0.00, steps=3.646e+07
2023-07-07 14:05:10,240 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9000, best=1.00, avg=1.00, std=0.00, steps=3.687e+07
2023-07-07 14:05:14,176 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9100, best=1.00, avg=1.00, std=0.00, steps=3.728e+07
2023-07-07 14:05:18,122 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9200, best=1.00, avg=1.00, std=0.00, steps=3.769e+07
2023-07-07 14:05:22,056 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9300, best=1.00, avg=1.00, std=0.00, steps=3.810e+07
2023-07-07 14:05:25,987 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9400, best=1.00, avg=1.00, std=0.00, steps=3.851e+07
2023-07-07 14:05:29,934 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9500, best=1.00, avg=1.00, std=0.00, steps=3.892e+07
2023-07-07 14:05:33,874 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9600, best=1.00, avg=1.00, std=0.00, steps=3.933e+07
2023-07-07 14:05:37,820 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9700, best=1.00, avg=1.00, std=0.00, steps=3.974e+07
2023-07-07 14:05:41,797 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9800, best=1.00, avg=1.00, std=0.00, steps=4.014e+07
2023-07-07 14:05:45,747 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 9900, best=1.00, avg=1.00, std=0.00, steps=4.055e+07
2023-07-07 14:05:49,682 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10000, best=1.00, avg=1.00, std=0.00, steps=4.096e+07
2023-07-07 14:05:53,606 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10100, best=1.00, avg=1.00, std=0.00, steps=4.137e+07
2023-07-07 14:05:57,538 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10200, best=1.00, avg=1.00, std=0.00, steps=4.178e+07
2023-07-07 14:06:01,467 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10300, best=1.00, avg=1.00, std=0.00, steps=4.219e+07
2023-07-07 14:06:05,394 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10400, best=1.00, avg=1.00, std=0.00, steps=4.260e+07
2023-07-07 14:06:09,319 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10500, best=1.00, avg=1.00, std=0.00, steps=4.301e+07
2023-07-07 14:06:13,249 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10600, best=1.00, avg=1.00, std=0.00, steps=4.342e+07
2023-07-07 14:06:17,179 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10700, best=1.00, avg=1.00, std=0.00, steps=4.383e+07
2023-07-07 14:06:21,104 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10800, best=1.00, avg=1.00, std=0.00, steps=4.424e+07
2023-07-07 14:06:25,043 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 10900, best=1.00, avg=1.00, std=0.00, steps=4.465e+07
2023-07-07 14:06:28,987 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11000, best=1.00, avg=1.00, std=0.00, steps=4.506e+07
2023-07-07 14:06:32,952 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11100, best=1.00, avg=1.00, std=0.00, steps=4.547e+07
2023-07-07 14:06:36,899 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11200, best=1.00, avg=1.00, std=0.00, steps=4.588e+07
2023-07-07 14:06:40,829 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11300, best=1.00, avg=1.00, std=0.00, steps=4.629e+07
2023-07-07 14:06:44,755 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11400, best=1.00, avg=1.00, std=0.00, steps=4.670e+07
2023-07-07 14:06:48,688 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11500, best=1.00, avg=1.00, std=0.00, steps=4.711e+07
2023-07-07 14:06:52,628 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11600, best=1.00, avg=1.00, std=0.00, steps=4.752e+07
2023-07-07 14:06:56,550 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11700, best=1.00, avg=1.00, std=0.00, steps=4.793e+07
2023-07-07 14:07:00,489 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11800, best=1.00, avg=1.00, std=0.00, steps=4.834e+07
2023-07-07 14:07:04,401 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11900, best=1.00, avg=1.00, std=0.00, steps=4.875e+07
2023-07-07 14:07:08,278 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 0, 0, [Train]: 11999, best=1.00, avg=1.00, std=0.00, steps=4.915e+07
2023-07-07 14:07:08,279 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 14:07:08,304 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:07:08,335 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:07:16,299 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 100, best=0.65, avg=0.64, std=0.00, steps=6.205e+05
2023-07-07 14:07:22,077 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 200, best=0.71, avg=0.70, std=0.01, steps=1.235e+06
2023-07-07 14:07:27,854 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 300, best=0.74, avg=0.73, std=0.01, steps=1.849e+06
2023-07-07 14:07:33,610 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 400, best=0.76, avg=0.75, std=0.01, steps=2.464e+06
2023-07-07 14:07:39,362 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 500, best=0.78, avg=0.77, std=0.01, steps=3.078e+06
2023-07-07 14:07:45,124 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 600, best=0.80, avg=0.78, std=0.01, steps=3.693e+06
2023-07-07 14:07:50,894 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 700, best=0.81, avg=0.79, std=0.01, steps=4.307e+06
2023-07-07 14:07:56,665 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 800, best=0.83, avg=0.81, std=0.01, steps=4.921e+06
2023-07-07 14:08:02,450 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 900, best=0.84, avg=0.83, std=0.01, steps=5.536e+06
2023-07-07 14:08:08,206 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1000, best=0.85, avg=0.84, std=0.00, steps=6.150e+06
2023-07-07 14:08:13,942 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1100, best=0.86, avg=0.85, std=0.01, steps=6.765e+06
2023-07-07 14:08:19,718 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1200, best=0.87, avg=0.86, std=0.01, steps=7.379e+06
2023-07-07 14:08:25,494 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1300, best=0.88, avg=0.87, std=0.00, steps=7.993e+06
2023-07-07 14:08:31,244 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1400, best=0.88, avg=0.87, std=0.00, steps=8.608e+06
2023-07-07 14:08:36,992 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1500, best=0.88, avg=0.87, std=0.00, steps=9.222e+06
2023-07-07 14:08:42,743 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1600, best=0.89, avg=0.88, std=0.00, steps=9.837e+06
2023-07-07 14:08:48,473 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1700, best=0.89, avg=0.88, std=0.00, steps=1.045e+07
2023-07-07 14:08:54,226 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1800, best=0.89, avg=0.88, std=0.00, steps=1.107e+07
2023-07-07 14:08:59,971 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 1900, best=0.89, avg=0.88, std=0.00, steps=1.168e+07
2023-07-07 14:09:05,736 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2000, best=0.89, avg=0.88, std=0.00, steps=1.229e+07
2023-07-07 14:09:11,492 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2100, best=0.89, avg=0.88, std=0.00, steps=1.291e+07
2023-07-07 14:09:17,291 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2200, best=0.89, avg=0.89, std=0.00, steps=1.352e+07
2023-07-07 14:09:23,052 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2300, best=0.90, avg=0.88, std=0.00, steps=1.414e+07
2023-07-07 14:09:28,785 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2400, best=0.90, avg=0.89, std=0.00, steps=1.475e+07
2023-07-07 14:09:34,547 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2500, best=0.90, avg=0.89, std=0.00, steps=1.537e+07
2023-07-07 14:09:40,305 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2600, best=0.90, avg=0.89, std=0.00, steps=1.598e+07
2023-07-07 14:09:46,049 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2700, best=0.90, avg=0.89, std=0.00, steps=1.659e+07
2023-07-07 14:09:51,812 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2800, best=0.90, avg=0.89, std=0.00, steps=1.721e+07
2023-07-07 14:09:57,584 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 2900, best=0.90, avg=0.89, std=0.00, steps=1.782e+07
2023-07-07 14:10:03,344 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3000, best=0.90, avg=0.89, std=0.00, steps=1.844e+07
2023-07-07 14:10:09,131 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3100, best=0.90, avg=0.89, std=0.00, steps=1.905e+07
2023-07-07 14:10:14,899 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3200, best=0.90, avg=0.89, std=0.00, steps=1.967e+07
2023-07-07 14:10:20,661 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3300, best=0.90, avg=0.89, std=0.00, steps=2.028e+07
2023-07-07 14:10:26,430 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3400, best=0.90, avg=0.89, std=0.00, steps=2.090e+07
2023-07-07 14:10:32,178 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3500, best=0.90, avg=0.89, std=0.00, steps=2.151e+07
2023-07-07 14:10:37,928 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3600, best=0.90, avg=0.90, std=0.00, steps=2.212e+07
2023-07-07 14:10:43,696 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3700, best=0.91, avg=0.90, std=0.00, steps=2.274e+07
2023-07-07 14:10:49,460 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3800, best=0.91, avg=0.90, std=0.00, steps=2.335e+07
2023-07-07 14:10:55,203 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 3900, best=0.91, avg=0.90, std=0.00, steps=2.397e+07
2023-07-07 14:11:00,964 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4000, best=0.91, avg=0.90, std=0.00, steps=2.458e+07
2023-07-07 14:11:06,723 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4100, best=0.91, avg=0.90, std=0.00, steps=2.520e+07
2023-07-07 14:11:12,476 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4200, best=0.91, avg=0.90, std=0.00, steps=2.581e+07
2023-07-07 14:11:18,229 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4300, best=0.91, avg=0.90, std=0.00, steps=2.643e+07
2023-07-07 14:11:24,005 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4400, best=0.92, avg=0.90, std=0.00, steps=2.704e+07
2023-07-07 14:11:29,735 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4500, best=0.91, avg=0.90, std=0.00, steps=2.765e+07
2023-07-07 14:11:35,474 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4600, best=0.92, avg=0.90, std=0.00, steps=2.827e+07
2023-07-07 14:11:41,213 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4700, best=0.91, avg=0.90, std=0.00, steps=2.888e+07
2023-07-07 14:11:46,957 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4800, best=0.92, avg=0.90, std=0.00, steps=2.950e+07
2023-07-07 14:11:52,708 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 4900, best=0.92, avg=0.90, std=0.00, steps=3.011e+07
2023-07-07 14:11:58,450 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5000, best=0.91, avg=0.90, std=0.00, steps=3.073e+07
2023-07-07 14:12:04,180 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5100, best=0.92, avg=0.90, std=0.00, steps=3.134e+07
2023-07-07 14:12:09,921 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5200, best=0.91, avg=0.90, std=0.00, steps=3.195e+07
2023-07-07 14:12:15,650 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5300, best=0.91, avg=0.90, std=0.00, steps=3.257e+07
2023-07-07 14:12:21,393 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5400, best=0.92, avg=0.90, std=0.00, steps=3.318e+07
2023-07-07 14:12:27,133 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5500, best=0.91, avg=0.90, std=0.00, steps=3.380e+07
2023-07-07 14:12:32,865 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5600, best=0.92, avg=0.90, std=0.00, steps=3.441e+07
2023-07-07 14:12:38,623 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5700, best=0.91, avg=0.90, std=0.00, steps=3.503e+07
2023-07-07 14:12:44,363 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5800, best=0.91, avg=0.90, std=0.00, steps=3.564e+07
2023-07-07 14:12:50,093 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 5900, best=0.91, avg=0.90, std=0.00, steps=3.626e+07
2023-07-07 14:12:55,825 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6000, best=0.92, avg=0.90, std=0.00, steps=3.687e+07
2023-07-07 14:13:01,558 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6100, best=0.91, avg=0.90, std=0.00, steps=3.748e+07
2023-07-07 14:13:07,313 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6200, best=0.91, avg=0.90, std=0.00, steps=3.810e+07
2023-07-07 14:13:13,085 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6300, best=0.91, avg=0.90, std=0.00, steps=3.871e+07
2023-07-07 14:13:18,877 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6400, best=0.91, avg=0.90, std=0.00, steps=3.933e+07
2023-07-07 14:13:24,643 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6500, best=0.91, avg=0.90, std=0.00, steps=3.994e+07
2023-07-07 14:13:30,401 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6600, best=0.91, avg=0.90, std=0.00, steps=4.056e+07
2023-07-07 14:13:36,187 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6700, best=0.92, avg=0.90, std=0.00, steps=4.117e+07
2023-07-07 14:13:41,937 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6800, best=0.91, avg=0.90, std=0.00, steps=4.179e+07
2023-07-07 14:13:47,726 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 6900, best=0.91, avg=0.90, std=0.00, steps=4.240e+07
2023-07-07 14:13:53,500 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7000, best=0.92, avg=0.90, std=0.00, steps=4.301e+07
2023-07-07 14:13:59,263 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7100, best=0.92, avg=0.90, std=0.00, steps=4.363e+07
2023-07-07 14:14:05,002 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7200, best=0.91, avg=0.90, std=0.00, steps=4.424e+07
2023-07-07 14:14:10,758 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7300, best=0.92, avg=0.90, std=0.00, steps=4.486e+07
2023-07-07 14:14:16,495 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7400, best=0.92, avg=0.90, std=0.00, steps=4.547e+07
2023-07-07 14:14:22,264 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7500, best=0.91, avg=0.90, std=0.00, steps=4.609e+07
2023-07-07 14:14:27,999 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7600, best=0.91, avg=0.90, std=0.00, steps=4.670e+07
2023-07-07 14:14:33,738 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7700, best=0.91, avg=0.90, std=0.00, steps=4.731e+07
2023-07-07 14:14:39,479 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7800, best=0.91, avg=0.90, std=0.00, steps=4.793e+07
2023-07-07 14:14:45,220 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 7900, best=0.91, avg=0.90, std=0.00, steps=4.854e+07
2023-07-07 14:14:50,965 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8000, best=0.91, avg=0.90, std=0.00, steps=4.916e+07
2023-07-07 14:14:56,718 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8100, best=0.91, avg=0.90, std=0.00, steps=4.977e+07
2023-07-07 14:15:02,462 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8200, best=0.91, avg=0.90, std=0.00, steps=5.039e+07
2023-07-07 14:15:08,215 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8300, best=0.91, avg=0.90, std=0.00, steps=5.100e+07
2023-07-07 14:15:13,967 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8400, best=0.91, avg=0.90, std=0.00, steps=5.162e+07
2023-07-07 14:15:19,716 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8500, best=0.91, avg=0.90, std=0.00, steps=5.223e+07
2023-07-07 14:15:25,463 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8600, best=0.91, avg=0.90, std=0.00, steps=5.284e+07
2023-07-07 14:15:31,229 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8700, best=0.91, avg=0.90, std=0.00, steps=5.346e+07
2023-07-07 14:15:36,983 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8800, best=0.91, avg=0.90, std=0.00, steps=5.407e+07
2023-07-07 14:15:42,746 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 8900, best=0.91, avg=0.90, std=0.00, steps=5.469e+07
2023-07-07 14:15:48,501 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9000, best=0.91, avg=0.90, std=0.00, steps=5.530e+07
2023-07-07 14:15:54,253 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9100, best=0.91, avg=0.90, std=0.00, steps=5.592e+07
2023-07-07 14:16:00,025 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9200, best=0.91, avg=0.90, std=0.00, steps=5.653e+07
2023-07-07 14:16:05,815 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9300, best=0.91, avg=0.90, std=0.00, steps=5.715e+07
2023-07-07 14:16:11,608 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9400, best=0.91, avg=0.90, std=0.00, steps=5.776e+07
2023-07-07 14:16:17,392 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9500, best=0.91, avg=0.90, std=0.00, steps=5.837e+07
2023-07-07 14:16:23,198 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9600, best=0.91, avg=0.90, std=0.00, steps=5.899e+07
2023-07-07 14:16:28,978 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9700, best=0.92, avg=0.90, std=0.00, steps=5.960e+07
2023-07-07 14:16:34,756 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9800, best=0.91, avg=0.90, std=0.00, steps=6.022e+07
2023-07-07 14:16:40,530 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 9900, best=0.91, avg=0.90, std=0.00, steps=6.083e+07
2023-07-07 14:16:46,288 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10000, best=0.91, avg=0.90, std=0.00, steps=6.145e+07
2023-07-07 14:16:52,033 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10100, best=0.91, avg=0.90, std=0.00, steps=6.206e+07
2023-07-07 14:16:57,784 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10200, best=0.92, avg=0.90, std=0.00, steps=6.267e+07
2023-07-07 14:17:03,527 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10300, best=0.91, avg=0.90, std=0.00, steps=6.329e+07
2023-07-07 14:17:09,284 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10400, best=0.91, avg=0.90, std=0.00, steps=6.390e+07
2023-07-07 14:17:15,031 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10500, best=0.91, avg=0.90, std=0.00, steps=6.452e+07
2023-07-07 14:17:20,763 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10600, best=0.91, avg=0.90, std=0.00, steps=6.513e+07
2023-07-07 14:17:26,508 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10700, best=0.92, avg=0.90, std=0.00, steps=6.575e+07
2023-07-07 14:17:32,251 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10800, best=0.92, avg=0.90, std=0.00, steps=6.636e+07
2023-07-07 14:17:37,999 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 10900, best=0.92, avg=0.90, std=0.00, steps=6.698e+07
2023-07-07 14:17:43,782 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11000, best=0.91, avg=0.90, std=0.00, steps=6.759e+07
2023-07-07 14:17:49,532 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11100, best=0.92, avg=0.90, std=0.00, steps=6.820e+07
2023-07-07 14:17:55,282 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11200, best=0.91, avg=0.90, std=0.00, steps=6.882e+07
2023-07-07 14:18:01,017 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11300, best=0.91, avg=0.90, std=0.00, steps=6.943e+07
2023-07-07 14:18:06,749 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11400, best=0.92, avg=0.90, std=0.00, steps=7.005e+07
2023-07-07 14:18:12,482 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11500, best=0.91, avg=0.90, std=0.00, steps=7.066e+07
2023-07-07 14:18:18,220 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11600, best=0.91, avg=0.90, std=0.00, steps=7.128e+07
2023-07-07 14:18:23,984 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11700, best=0.91, avg=0.90, std=0.00, steps=7.189e+07
2023-07-07 14:18:29,762 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11800, best=0.91, avg=0.90, std=0.00, steps=7.251e+07
2023-07-07 14:18:35,536 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11900, best=0.91, avg=0.90, std=0.00, steps=7.312e+07
2023-07-07 14:18:41,226 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 8, 0, [Train]: 11999, best=0.91, avg=0.90, std=0.00, steps=7.373e+07
2023-07-07 14:18:41,227 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 14:18:41,253 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:18:41,285 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:18:51,021 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 100, best=0.52, avg=0.50, std=0.01, steps=8.274e+05
2023-07-07 14:18:58,586 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 200, best=0.71, avg=0.69, std=0.00, steps=1.647e+06
2023-07-07 14:19:06,149 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 300, best=0.71, avg=0.70, std=0.01, steps=2.466e+06
2023-07-07 14:19:13,698 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 400, best=0.73, avg=0.71, std=0.01, steps=3.285e+06
2023-07-07 14:19:21,245 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 500, best=0.73, avg=0.72, std=0.01, steps=4.104e+06
2023-07-07 14:19:28,798 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 600, best=0.74, avg=0.72, std=0.01, steps=4.923e+06
2023-07-07 14:19:36,348 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 700, best=0.76, avg=0.75, std=0.01, steps=5.743e+06
2023-07-07 14:19:43,922 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 800, best=0.77, avg=0.75, std=0.01, steps=6.562e+06
2023-07-07 14:19:51,488 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 900, best=0.78, avg=0.77, std=0.01, steps=7.381e+06
2023-07-07 14:19:59,085 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1000, best=0.79, avg=0.78, std=0.01, steps=8.200e+06
2023-07-07 14:20:06,642 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1100, best=0.81, avg=0.79, std=0.01, steps=9.019e+06
2023-07-07 14:20:14,197 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1200, best=0.81, avg=0.80, std=0.00, steps=9.839e+06
2023-07-07 14:20:21,759 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1300, best=0.82, avg=0.81, std=0.01, steps=1.066e+07
2023-07-07 14:20:29,324 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1400, best=0.82, avg=0.81, std=0.01, steps=1.148e+07
2023-07-07 14:20:36,911 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1500, best=0.83, avg=0.82, std=0.01, steps=1.230e+07
2023-07-07 14:20:44,477 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1600, best=0.84, avg=0.82, std=0.01, steps=1.312e+07
2023-07-07 14:20:52,032 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1700, best=0.85, avg=0.83, std=0.01, steps=1.393e+07
2023-07-07 14:20:59,625 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1800, best=0.85, avg=0.84, std=0.00, steps=1.475e+07
2023-07-07 14:21:07,198 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 1900, best=0.85, avg=0.84, std=0.00, steps=1.557e+07
2023-07-07 14:21:14,852 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2000, best=0.85, avg=0.84, std=0.00, steps=1.639e+07
2023-07-07 14:21:22,415 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2100, best=0.85, avg=0.84, std=0.00, steps=1.721e+07
2023-07-07 14:21:29,962 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2200, best=0.86, avg=0.84, std=0.00, steps=1.803e+07
2023-07-07 14:21:37,541 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2300, best=0.86, avg=0.84, std=0.00, steps=1.885e+07
2023-07-07 14:21:45,107 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2400, best=0.86, avg=0.85, std=0.00, steps=1.967e+07
2023-07-07 14:21:52,657 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2500, best=0.86, avg=0.85, std=0.00, steps=2.049e+07
2023-07-07 14:22:00,221 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2600, best=0.86, avg=0.85, std=0.00, steps=2.131e+07
2023-07-07 14:22:07,777 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2700, best=0.86, avg=0.85, std=0.00, steps=2.213e+07
2023-07-07 14:22:15,350 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2800, best=0.86, avg=0.85, std=0.00, steps=2.295e+07
2023-07-07 14:22:22,918 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 2900, best=0.86, avg=0.85, std=0.00, steps=2.376e+07
2023-07-07 14:22:30,490 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3000, best=0.86, avg=0.85, std=0.00, steps=2.458e+07
2023-07-07 14:22:38,044 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3100, best=0.86, avg=0.85, std=0.00, steps=2.540e+07
2023-07-07 14:22:45,608 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3200, best=0.86, avg=0.85, std=0.00, steps=2.622e+07
2023-07-07 14:22:53,183 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3300, best=0.86, avg=0.85, std=0.00, steps=2.704e+07
2023-07-07 14:23:00,753 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3400, best=0.86, avg=0.85, std=0.00, steps=2.786e+07
2023-07-07 14:23:08,319 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3500, best=0.86, avg=0.85, std=0.00, steps=2.868e+07
2023-07-07 14:23:15,888 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3600, best=0.86, avg=0.85, std=0.00, steps=2.950e+07
2023-07-07 14:23:23,459 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3700, best=0.86, avg=0.85, std=0.00, steps=3.032e+07
2023-07-07 14:23:31,040 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3800, best=0.86, avg=0.85, std=0.00, steps=3.114e+07
2023-07-07 14:23:38,608 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 3900, best=0.86, avg=0.85, std=0.00, steps=3.196e+07
2023-07-07 14:23:46,169 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4000, best=0.86, avg=0.85, std=0.00, steps=3.278e+07
2023-07-07 14:23:53,732 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4100, best=0.87, avg=0.85, std=0.00, steps=3.360e+07
2023-07-07 14:24:01,283 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4200, best=0.86, avg=0.85, std=0.00, steps=3.441e+07
2023-07-07 14:24:08,836 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4300, best=0.86, avg=0.85, std=0.00, steps=3.523e+07
2023-07-07 14:24:16,412 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4400, best=0.86, avg=0.85, std=0.00, steps=3.605e+07
2023-07-07 14:24:23,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4500, best=0.87, avg=0.86, std=0.00, steps=3.687e+07
2023-07-07 14:24:31,548 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4600, best=0.86, avg=0.85, std=0.00, steps=3.769e+07
2023-07-07 14:24:39,120 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4700, best=0.87, avg=0.86, std=0.00, steps=3.851e+07
2023-07-07 14:24:46,686 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4800, best=0.87, avg=0.86, std=0.00, steps=3.933e+07
2023-07-07 14:24:54,277 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 4900, best=0.87, avg=0.86, std=0.00, steps=4.015e+07
2023-07-07 14:25:01,843 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5000, best=0.87, avg=0.86, std=0.00, steps=4.097e+07
2023-07-07 14:25:09,410 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5100, best=0.87, avg=0.86, std=0.00, steps=4.179e+07
2023-07-07 14:25:16,986 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5200, best=0.87, avg=0.86, std=0.00, steps=4.261e+07
2023-07-07 14:25:24,556 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5300, best=0.87, avg=0.86, std=0.00, steps=4.343e+07
2023-07-07 14:25:32,137 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5400, best=0.87, avg=0.86, std=0.00, steps=4.424e+07
2023-07-07 14:25:39,707 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5500, best=0.87, avg=0.86, std=0.00, steps=4.506e+07
2023-07-07 14:25:47,293 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5600, best=0.87, avg=0.86, std=0.00, steps=4.588e+07
2023-07-07 14:25:54,857 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5700, best=0.87, avg=0.86, std=0.00, steps=4.670e+07
2023-07-07 14:26:02,447 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5800, best=0.87, avg=0.86, std=0.00, steps=4.752e+07
2023-07-07 14:26:10,004 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 5900, best=0.87, avg=0.86, std=0.00, steps=4.834e+07
2023-07-07 14:26:17,565 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6000, best=0.87, avg=0.86, std=0.00, steps=4.916e+07
2023-07-07 14:26:25,122 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6100, best=0.87, avg=0.86, std=0.00, steps=4.998e+07
2023-07-07 14:26:32,670 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6200, best=0.87, avg=0.86, std=0.00, steps=5.080e+07
2023-07-07 14:26:40,229 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6300, best=0.87, avg=0.86, std=0.00, steps=5.162e+07
2023-07-07 14:26:47,786 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6400, best=0.87, avg=0.86, std=0.00, steps=5.244e+07
2023-07-07 14:26:55,356 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6500, best=0.87, avg=0.86, std=0.00, steps=5.326e+07
2023-07-07 14:27:02,919 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6600, best=0.87, avg=0.86, std=0.00, steps=5.408e+07
2023-07-07 14:27:10,464 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6700, best=0.87, avg=0.86, std=0.00, steps=5.489e+07
2023-07-07 14:27:18,009 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6800, best=0.87, avg=0.86, std=0.00, steps=5.571e+07
2023-07-07 14:27:25,538 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 6900, best=0.87, avg=0.86, std=0.00, steps=5.653e+07
2023-07-07 14:27:33,089 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7000, best=0.87, avg=0.86, std=0.00, steps=5.735e+07
2023-07-07 14:27:40,635 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7100, best=0.87, avg=0.86, std=0.00, steps=5.817e+07
2023-07-07 14:27:48,172 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7200, best=0.87, avg=0.86, std=0.00, steps=5.899e+07
2023-07-07 14:27:55,721 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7300, best=0.87, avg=0.86, std=0.00, steps=5.981e+07
2023-07-07 14:28:03,265 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7400, best=0.87, avg=0.86, std=0.00, steps=6.063e+07
2023-07-07 14:28:10,799 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7500, best=0.87, avg=0.86, std=0.00, steps=6.145e+07
2023-07-07 14:28:18,376 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7600, best=0.87, avg=0.86, std=0.00, steps=6.227e+07
2023-07-07 14:28:25,930 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7700, best=0.87, avg=0.86, std=0.00, steps=6.309e+07
2023-07-07 14:28:33,490 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7800, best=0.87, avg=0.86, std=0.00, steps=6.391e+07
2023-07-07 14:28:41,052 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 7900, best=0.88, avg=0.86, std=0.00, steps=6.472e+07
2023-07-07 14:28:48,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8000, best=0.87, avg=0.86, std=0.00, steps=6.554e+07
2023-07-07 14:28:56,189 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8100, best=0.87, avg=0.86, std=0.00, steps=6.636e+07
2023-07-07 14:29:03,764 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8200, best=0.88, avg=0.86, std=0.00, steps=6.718e+07
2023-07-07 14:29:11,308 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8300, best=0.88, avg=0.86, std=0.00, steps=6.800e+07
2023-07-07 14:29:18,873 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8400, best=0.88, avg=0.87, std=0.00, steps=6.882e+07
2023-07-07 14:29:26,453 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8500, best=0.88, avg=0.86, std=0.00, steps=6.964e+07
2023-07-07 14:29:34,024 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8600, best=0.88, avg=0.86, std=0.00, steps=7.046e+07
2023-07-07 14:29:41,589 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8700, best=0.87, avg=0.87, std=0.00, steps=7.128e+07
2023-07-07 14:29:49,174 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8800, best=0.88, avg=0.87, std=0.00, steps=7.210e+07
2023-07-07 14:29:56,735 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 8900, best=0.88, avg=0.87, std=0.00, steps=7.292e+07
2023-07-07 14:30:04,292 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9000, best=0.88, avg=0.87, std=0.00, steps=7.374e+07
2023-07-07 14:30:11,846 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9100, best=0.87, avg=0.87, std=0.00, steps=7.456e+07
2023-07-07 14:30:19,392 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9200, best=0.88, avg=0.87, std=0.00, steps=7.537e+07
2023-07-07 14:30:26,941 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9300, best=0.88, avg=0.87, std=0.00, steps=7.619e+07
2023-07-07 14:30:34,496 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9400, best=0.88, avg=0.87, std=0.00, steps=7.701e+07
2023-07-07 14:30:42,050 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9500, best=0.87, avg=0.87, std=0.00, steps=7.783e+07
2023-07-07 14:30:49,627 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9600, best=0.87, avg=0.87, std=0.00, steps=7.865e+07
2023-07-07 14:30:57,204 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9700, best=0.87, avg=0.87, std=0.00, steps=7.947e+07
2023-07-07 14:31:04,800 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9800, best=0.88, avg=0.87, std=0.00, steps=8.029e+07
2023-07-07 14:31:12,384 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 9900, best=0.88, avg=0.87, std=0.00, steps=8.111e+07
2023-07-07 14:31:19,944 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10000, best=0.88, avg=0.87, std=0.00, steps=8.193e+07
2023-07-07 14:31:27,519 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10100, best=0.88, avg=0.87, std=0.00, steps=8.275e+07
2023-07-07 14:31:35,082 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10200, best=0.88, avg=0.87, std=0.00, steps=8.357e+07
2023-07-07 14:31:42,649 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10300, best=0.88, avg=0.87, std=0.00, steps=8.439e+07
2023-07-07 14:31:50,204 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10400, best=0.87, avg=0.87, std=0.00, steps=8.520e+07
2023-07-07 14:31:57,778 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10500, best=0.88, avg=0.87, std=0.00, steps=8.602e+07
2023-07-07 14:32:05,335 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10600, best=0.88, avg=0.87, std=0.00, steps=8.684e+07
2023-07-07 14:32:12,914 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10700, best=0.88, avg=0.87, std=0.00, steps=8.766e+07
2023-07-07 14:32:20,481 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10800, best=0.88, avg=0.87, std=0.00, steps=8.848e+07
2023-07-07 14:32:28,057 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 10900, best=0.88, avg=0.87, std=0.00, steps=8.930e+07
2023-07-07 14:32:35,604 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11000, best=0.88, avg=0.87, std=0.00, steps=9.012e+07
2023-07-07 14:32:43,160 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11100, best=0.88, avg=0.87, std=0.00, steps=9.094e+07
2023-07-07 14:32:50,711 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11200, best=0.88, avg=0.87, std=0.00, steps=9.176e+07
2023-07-07 14:32:58,277 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11300, best=0.88, avg=0.87, std=0.00, steps=9.258e+07
2023-07-07 14:33:05,839 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11400, best=0.88, avg=0.87, std=0.00, steps=9.340e+07
2023-07-07 14:33:13,406 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11500, best=0.88, avg=0.87, std=0.00, steps=9.422e+07
2023-07-07 14:33:20,966 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11600, best=0.88, avg=0.87, std=0.00, steps=9.504e+07
2023-07-07 14:33:28,535 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11700, best=0.88, avg=0.87, std=0.00, steps=9.585e+07
2023-07-07 14:33:36,092 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11800, best=0.88, avg=0.87, std=0.00, steps=9.667e+07
2023-07-07 14:33:43,646 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11900, best=0.87, avg=0.87, std=0.00, steps=9.749e+07
2023-07-07 14:33:51,140 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 16, 0, [Train]: 11999, best=0.88, avg=0.87, std=0.00, steps=9.830e+07
2023-07-07 14:33:51,141 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 14:33:51,170 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:33:51,206 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:34:04,630 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 100, best=0.52, avg=0.50, std=0.01, steps=1.241e+06
2023-07-07 14:34:15,831 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 200, best=0.52, avg=0.50, std=0.01, steps=2.470e+06
2023-07-07 14:34:27,001 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 300, best=0.65, avg=0.64, std=0.01, steps=3.699e+06
2023-07-07 14:34:38,197 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 400, best=0.67, avg=0.66, std=0.01, steps=4.927e+06
2023-07-07 14:34:49,396 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 500, best=0.69, avg=0.67, std=0.01, steps=6.156e+06
2023-07-07 14:35:00,590 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 600, best=0.70, avg=0.69, std=0.01, steps=7.385e+06
2023-07-07 14:35:11,795 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 700, best=0.73, avg=0.71, std=0.01, steps=8.614e+06
2023-07-07 14:35:22,983 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 800, best=0.74, avg=0.73, std=0.01, steps=9.843e+06
2023-07-07 14:35:34,151 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 900, best=0.74, avg=0.72, std=0.01, steps=1.107e+07
2023-07-07 14:35:45,346 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1000, best=0.74, avg=0.73, std=0.01, steps=1.230e+07
2023-07-07 14:35:56,558 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1100, best=0.75, avg=0.74, std=0.01, steps=1.353e+07
2023-07-07 14:36:07,727 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1200, best=0.76, avg=0.74, std=0.01, steps=1.476e+07
2023-07-07 14:36:18,888 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1300, best=0.76, avg=0.74, std=0.01, steps=1.599e+07
2023-07-07 14:36:30,066 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1400, best=0.76, avg=0.75, std=0.01, steps=1.722e+07
2023-07-07 14:36:41,249 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1500, best=0.76, avg=0.75, std=0.01, steps=1.844e+07
2023-07-07 14:36:52,442 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1600, best=0.77, avg=0.75, std=0.01, steps=1.967e+07
2023-07-07 14:37:03,623 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1700, best=0.77, avg=0.76, std=0.01, steps=2.090e+07
2023-07-07 14:37:14,809 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1800, best=0.77, avg=0.76, std=0.00, steps=2.213e+07
2023-07-07 14:37:25,991 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 1900, best=0.77, avg=0.76, std=0.01, steps=2.336e+07
2023-07-07 14:37:37,166 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2000, best=0.77, avg=0.76, std=0.00, steps=2.459e+07
2023-07-07 14:37:48,386 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2100, best=0.78, avg=0.76, std=0.01, steps=2.582e+07
2023-07-07 14:37:59,607 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2200, best=0.78, avg=0.77, std=0.00, steps=2.705e+07
2023-07-07 14:38:10,794 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2300, best=0.78, avg=0.77, std=0.01, steps=2.827e+07
2023-07-07 14:38:21,965 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2400, best=0.78, avg=0.77, std=0.01, steps=2.950e+07
2023-07-07 14:38:33,168 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2500, best=0.78, avg=0.77, std=0.01, steps=3.073e+07
2023-07-07 14:38:44,365 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2600, best=0.78, avg=0.77, std=0.01, steps=3.196e+07
2023-07-07 14:38:55,571 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2700, best=0.78, avg=0.77, std=0.00, steps=3.319e+07
2023-07-07 14:39:06,768 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2800, best=0.78, avg=0.77, std=0.00, steps=3.442e+07
2023-07-07 14:39:17,975 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 2900, best=0.78, avg=0.77, std=0.01, steps=3.565e+07
2023-07-07 14:39:29,176 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3000, best=0.78, avg=0.77, std=0.00, steps=3.688e+07
2023-07-07 14:39:40,366 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3100, best=0.79, avg=0.77, std=0.01, steps=3.811e+07
2023-07-07 14:39:51,553 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3200, best=0.78, avg=0.77, std=0.01, steps=3.933e+07
2023-07-07 14:40:02,730 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3300, best=0.79, avg=0.77, std=0.01, steps=4.056e+07
2023-07-07 14:40:13,903 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3400, best=0.79, avg=0.77, std=0.01, steps=4.179e+07
2023-07-07 14:40:25,069 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3500, best=0.79, avg=0.77, std=0.00, steps=4.302e+07
2023-07-07 14:40:36,246 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3600, best=0.79, avg=0.78, std=0.00, steps=4.425e+07
2023-07-07 14:40:47,469 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3700, best=0.79, avg=0.78, std=0.00, steps=4.548e+07
2023-07-07 14:40:58,688 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3800, best=0.79, avg=0.78, std=0.00, steps=4.671e+07
2023-07-07 14:41:09,871 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 3900, best=0.80, avg=0.78, std=0.00, steps=4.794e+07
2023-07-07 14:41:21,037 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4000, best=0.79, avg=0.78, std=0.00, steps=4.916e+07
2023-07-07 14:41:32,198 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4100, best=0.79, avg=0.78, std=0.00, steps=5.039e+07
2023-07-07 14:41:43,377 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4200, best=0.79, avg=0.78, std=0.00, steps=5.162e+07
2023-07-07 14:41:54,538 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4300, best=0.79, avg=0.78, std=0.00, steps=5.285e+07
2023-07-07 14:42:05,700 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4400, best=0.79, avg=0.78, std=0.00, steps=5.408e+07
2023-07-07 14:42:16,895 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4500, best=0.80, avg=0.78, std=0.00, steps=5.531e+07
2023-07-07 14:42:28,078 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4600, best=0.79, avg=0.78, std=0.00, steps=5.654e+07
2023-07-07 14:42:39,287 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4700, best=0.79, avg=0.78, std=0.00, steps=5.777e+07
2023-07-07 14:42:50,500 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4800, best=0.80, avg=0.79, std=0.00, steps=5.899e+07
2023-07-07 14:43:01,694 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 4900, best=0.80, avg=0.79, std=0.00, steps=6.022e+07
2023-07-07 14:43:12,906 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5000, best=0.80, avg=0.78, std=0.00, steps=6.145e+07
2023-07-07 14:43:24,085 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5100, best=0.80, avg=0.79, std=0.00, steps=6.268e+07
2023-07-07 14:43:35,278 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5200, best=0.80, avg=0.79, std=0.00, steps=6.391e+07
2023-07-07 14:43:46,526 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5300, best=0.80, avg=0.79, std=0.00, steps=6.514e+07
2023-07-07 14:43:57,717 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5400, best=0.80, avg=0.79, std=0.00, steps=6.637e+07
2023-07-07 14:44:08,912 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5500, best=0.80, avg=0.79, std=0.00, steps=6.760e+07
2023-07-07 14:44:20,122 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5600, best=0.80, avg=0.79, std=0.00, steps=6.883e+07
2023-07-07 14:44:31,305 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5700, best=0.80, avg=0.79, std=0.00, steps=7.005e+07
2023-07-07 14:44:42,495 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5800, best=0.80, avg=0.79, std=0.00, steps=7.128e+07
2023-07-07 14:44:53,689 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 5900, best=0.80, avg=0.79, std=0.00, steps=7.251e+07
2023-07-07 14:45:04,877 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6000, best=0.80, avg=0.79, std=0.00, steps=7.374e+07
2023-07-07 14:45:16,090 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6100, best=0.80, avg=0.79, std=0.00, steps=7.497e+07
2023-07-07 14:45:27,293 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6200, best=0.80, avg=0.79, std=0.00, steps=7.620e+07
2023-07-07 14:45:38,485 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6300, best=0.80, avg=0.79, std=0.00, steps=7.743e+07
2023-07-07 14:45:49,686 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6400, best=0.80, avg=0.79, std=0.00, steps=7.866e+07
2023-07-07 14:46:00,878 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6500, best=0.80, avg=0.79, std=0.00, steps=7.988e+07
2023-07-07 14:46:12,071 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6600, best=0.80, avg=0.79, std=0.00, steps=8.111e+07
2023-07-07 14:46:23,248 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6700, best=0.80, avg=0.79, std=0.00, steps=8.234e+07
2023-07-07 14:46:34,443 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6800, best=0.80, avg=0.79, std=0.00, steps=8.357e+07
2023-07-07 14:46:45,652 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 6900, best=0.80, avg=0.79, std=0.00, steps=8.480e+07
2023-07-07 14:46:56,844 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7000, best=0.80, avg=0.79, std=0.00, steps=8.603e+07
2023-07-07 14:47:08,027 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7100, best=0.80, avg=0.79, std=0.00, steps=8.726e+07
2023-07-07 14:47:19,214 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7200, best=0.80, avg=0.79, std=0.00, steps=8.849e+07
2023-07-07 14:47:30,395 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7300, best=0.80, avg=0.79, std=0.00, steps=8.971e+07
2023-07-07 14:47:41,583 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7400, best=0.80, avg=0.79, std=0.00, steps=9.094e+07
2023-07-07 14:47:52,778 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7500, best=0.80, avg=0.79, std=0.00, steps=9.217e+07
2023-07-07 14:48:03,963 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7600, best=0.80, avg=0.79, std=0.00, steps=9.340e+07
2023-07-07 14:48:15,132 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7700, best=0.80, avg=0.79, std=0.00, steps=9.463e+07
2023-07-07 14:48:26,325 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7800, best=0.80, avg=0.79, std=0.00, steps=9.586e+07
2023-07-07 14:48:37,500 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 7900, best=0.80, avg=0.79, std=0.00, steps=9.709e+07
2023-07-07 14:48:48,678 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8000, best=0.80, avg=0.79, std=0.00, steps=9.832e+07
2023-07-07 14:48:59,852 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8100, best=0.80, avg=0.79, std=0.00, steps=9.955e+07
2023-07-07 14:49:11,041 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8200, best=0.80, avg=0.79, std=0.00, steps=1.008e+08
2023-07-07 14:49:22,225 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8300, best=0.80, avg=0.79, std=0.00, steps=1.020e+08
2023-07-07 14:49:33,398 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8400, best=0.80, avg=0.79, std=0.00, steps=1.032e+08
2023-07-07 14:49:44,589 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8500, best=0.80, avg=0.79, std=0.00, steps=1.045e+08
2023-07-07 14:49:55,772 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8600, best=0.80, avg=0.79, std=0.00, steps=1.057e+08
2023-07-07 14:50:06,993 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8700, best=0.80, avg=0.79, std=0.00, steps=1.069e+08
2023-07-07 14:50:18,166 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8800, best=0.80, avg=0.79, std=0.00, steps=1.081e+08
2023-07-07 14:50:29,378 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 8900, best=0.80, avg=0.79, std=0.00, steps=1.094e+08
2023-07-07 14:50:40,581 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9000, best=0.80, avg=0.79, std=0.00, steps=1.106e+08
2023-07-07 14:50:51,808 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9100, best=0.80, avg=0.79, std=0.00, steps=1.118e+08
2023-07-07 14:51:03,018 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9200, best=0.80, avg=0.79, std=0.00, steps=1.131e+08
2023-07-07 14:51:14,202 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9300, best=0.80, avg=0.79, std=0.00, steps=1.143e+08
2023-07-07 14:51:25,417 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9400, best=0.80, avg=0.79, std=0.00, steps=1.155e+08
2023-07-07 14:51:36,617 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9500, best=0.80, avg=0.79, std=0.00, steps=1.167e+08
2023-07-07 14:51:47,789 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9600, best=0.80, avg=0.79, std=0.00, steps=1.180e+08
2023-07-07 14:51:58,947 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9700, best=0.80, avg=0.79, std=0.00, steps=1.192e+08
2023-07-07 14:52:10,153 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9800, best=0.80, avg=0.79, std=0.00, steps=1.204e+08
2023-07-07 14:52:21,345 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 9900, best=0.80, avg=0.79, std=0.00, steps=1.217e+08
2023-07-07 14:52:32,552 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10000, best=0.80, avg=0.79, std=0.00, steps=1.229e+08
2023-07-07 14:52:43,731 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10100, best=0.80, avg=0.79, std=0.00, steps=1.241e+08
2023-07-07 14:52:54,915 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10200, best=0.80, avg=0.79, std=0.00, steps=1.253e+08
2023-07-07 14:53:06,092 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10300, best=0.80, avg=0.79, std=0.00, steps=1.266e+08
2023-07-07 14:53:17,270 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10400, best=0.80, avg=0.79, std=0.00, steps=1.278e+08
2023-07-07 14:53:28,453 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10500, best=0.80, avg=0.79, std=0.00, steps=1.290e+08
2023-07-07 14:53:39,653 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10600, best=0.80, avg=0.79, std=0.00, steps=1.303e+08
2023-07-07 14:53:50,845 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10700, best=0.80, avg=0.79, std=0.00, steps=1.315e+08
2023-07-07 14:54:02,044 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10800, best=0.81, avg=0.79, std=0.00, steps=1.327e+08
2023-07-07 14:54:13,283 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 10900, best=0.80, avg=0.79, std=0.00, steps=1.340e+08
2023-07-07 14:54:24,488 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11000, best=0.80, avg=0.79, std=0.00, steps=1.352e+08
2023-07-07 14:54:35,687 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11100, best=0.80, avg=0.79, std=0.00, steps=1.364e+08
2023-07-07 14:54:46,880 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11200, best=0.81, avg=0.79, std=0.00, steps=1.376e+08
2023-07-07 14:54:58,087 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11300, best=0.80, avg=0.79, std=0.00, steps=1.389e+08
2023-07-07 14:55:09,271 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11400, best=0.80, avg=0.79, std=0.00, steps=1.401e+08
2023-07-07 14:55:20,463 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11500, best=0.80, avg=0.79, std=0.00, steps=1.413e+08
2023-07-07 14:55:31,641 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11600, best=0.80, avg=0.79, std=0.00, steps=1.426e+08
2023-07-07 14:55:42,826 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11700, best=0.80, avg=0.79, std=0.00, steps=1.438e+08
2023-07-07 14:55:54,019 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11800, best=0.80, avg=0.79, std=0.00, steps=1.450e+08
2023-07-07 14:56:05,241 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11900, best=0.80, avg=0.79, std=0.00, steps=1.462e+08
2023-07-07 14:56:16,317 -        meta learning: [    INFO] - [Len Lat Rep]: 8, 32, 0, [Train]: 11999, best=0.80, avg=0.79, std=0.00, steps=1.475e+08
2023-07-07 14:56:16,317 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 14:56:16,343 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 14:56:16,376 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 14:56:26,133 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 100, best=0.64, avg=0.63, std=0.00, steps=8.274e+05
2023-07-07 14:56:33,705 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 200, best=0.72, avg=0.71, std=0.00, steps=1.647e+06
2023-07-07 14:56:41,291 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 300, best=0.74, avg=0.73, std=0.00, steps=2.466e+06
2023-07-07 14:56:48,859 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 400, best=0.76, avg=0.75, std=0.00, steps=3.285e+06
2023-07-07 14:56:56,439 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 500, best=0.78, avg=0.77, std=0.00, steps=4.104e+06
2023-07-07 14:57:04,036 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 600, best=0.79, avg=0.78, std=0.00, steps=4.923e+06
2023-07-07 14:57:11,619 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 700, best=0.81, avg=0.80, std=0.00, steps=5.743e+06
2023-07-07 14:57:19,210 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 800, best=0.82, avg=0.81, std=0.00, steps=6.562e+06
2023-07-07 14:57:26,791 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 900, best=0.83, avg=0.83, std=0.00, steps=7.381e+06
2023-07-07 14:57:34,382 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1000, best=0.85, avg=0.84, std=0.00, steps=8.200e+06
2023-07-07 14:57:41,966 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1100, best=0.85, avg=0.84, std=0.00, steps=9.019e+06
2023-07-07 14:57:49,554 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1200, best=0.85, avg=0.84, std=0.00, steps=9.839e+06
2023-07-07 14:57:57,134 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1300, best=0.86, avg=0.85, std=0.00, steps=1.066e+07
2023-07-07 14:58:04,722 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1400, best=0.86, avg=0.85, std=0.00, steps=1.148e+07
2023-07-07 14:58:12,329 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1500, best=0.87, avg=0.85, std=0.00, steps=1.230e+07
2023-07-07 14:58:19,909 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1600, best=0.87, avg=0.86, std=0.00, steps=1.312e+07
2023-07-07 14:58:27,483 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1700, best=0.87, avg=0.86, std=0.00, steps=1.393e+07
2023-07-07 14:58:35,067 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1800, best=0.87, avg=0.86, std=0.00, steps=1.475e+07
2023-07-07 14:58:42,647 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 1900, best=0.87, avg=0.86, std=0.00, steps=1.557e+07
2023-07-07 14:58:50,236 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2000, best=0.88, avg=0.87, std=0.00, steps=1.639e+07
2023-07-07 14:58:57,811 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2100, best=0.88, avg=0.87, std=0.00, steps=1.721e+07
2023-07-07 14:59:05,391 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2200, best=0.88, avg=0.87, std=0.00, steps=1.803e+07
2023-07-07 14:59:12,961 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2300, best=0.88, avg=0.87, std=0.00, steps=1.885e+07
2023-07-07 14:59:20,533 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2400, best=0.88, avg=0.87, std=0.00, steps=1.967e+07
2023-07-07 14:59:28,104 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2500, best=0.88, avg=0.87, std=0.00, steps=2.049e+07
2023-07-07 14:59:35,684 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2600, best=0.89, avg=0.88, std=0.00, steps=2.131e+07
2023-07-07 14:59:43,252 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2700, best=0.89, avg=0.88, std=0.00, steps=2.213e+07
2023-07-07 14:59:50,825 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2800, best=0.89, avg=0.88, std=0.00, steps=2.295e+07
2023-07-07 14:59:58,404 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 2900, best=0.89, avg=0.88, std=0.00, steps=2.376e+07
2023-07-07 15:00:05,992 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3000, best=0.89, avg=0.88, std=0.00, steps=2.458e+07
2023-07-07 15:00:13,575 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3100, best=0.89, avg=0.88, std=0.00, steps=2.540e+07
2023-07-07 15:00:21,162 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3200, best=0.89, avg=0.88, std=0.00, steps=2.622e+07
2023-07-07 15:00:28,735 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3300, best=0.89, avg=0.88, std=0.00, steps=2.704e+07
2023-07-07 15:00:36,308 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3400, best=0.89, avg=0.88, std=0.00, steps=2.786e+07
2023-07-07 15:00:43,881 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3500, best=0.89, avg=0.88, std=0.00, steps=2.868e+07
2023-07-07 15:00:51,467 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3600, best=0.89, avg=0.88, std=0.00, steps=2.950e+07
2023-07-07 15:00:59,059 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3700, best=0.89, avg=0.88, std=0.00, steps=3.032e+07
2023-07-07 15:01:06,657 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3800, best=0.89, avg=0.88, std=0.00, steps=3.114e+07
2023-07-07 15:01:14,244 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 3900, best=0.89, avg=0.88, std=0.00, steps=3.196e+07
2023-07-07 15:01:21,842 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4000, best=0.89, avg=0.89, std=0.00, steps=3.278e+07
2023-07-07 15:01:29,435 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4100, best=0.90, avg=0.89, std=0.00, steps=3.360e+07
2023-07-07 15:01:37,009 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4200, best=0.91, avg=0.90, std=0.00, steps=3.441e+07
2023-07-07 15:01:44,584 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4300, best=0.91, avg=0.90, std=0.00, steps=3.523e+07
2023-07-07 15:01:52,172 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4400, best=0.91, avg=0.91, std=0.00, steps=3.605e+07
2023-07-07 15:01:59,744 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4500, best=0.91, avg=0.91, std=0.00, steps=3.687e+07
2023-07-07 15:02:07,323 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4600, best=0.92, avg=0.91, std=0.00, steps=3.769e+07
2023-07-07 15:02:14,905 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4700, best=0.92, avg=0.91, std=0.00, steps=3.851e+07
2023-07-07 15:02:22,489 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4800, best=0.92, avg=0.91, std=0.00, steps=3.933e+07
2023-07-07 15:02:30,061 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 4900, best=0.92, avg=0.91, std=0.00, steps=4.015e+07
2023-07-07 15:02:37,641 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5000, best=0.92, avg=0.91, std=0.00, steps=4.097e+07
2023-07-07 15:02:45,218 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5100, best=0.92, avg=0.91, std=0.00, steps=4.179e+07
2023-07-07 15:02:52,788 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5200, best=0.92, avg=0.91, std=0.00, steps=4.261e+07
2023-07-07 15:03:00,361 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5300, best=0.92, avg=0.91, std=0.00, steps=4.343e+07
2023-07-07 15:03:07,924 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5400, best=0.92, avg=0.91, std=0.00, steps=4.424e+07
2023-07-07 15:03:15,484 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5500, best=0.92, avg=0.91, std=0.00, steps=4.506e+07
2023-07-07 15:03:23,051 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5600, best=0.92, avg=0.91, std=0.00, steps=4.588e+07
2023-07-07 15:03:30,636 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5700, best=0.92, avg=0.91, std=0.00, steps=4.670e+07
2023-07-07 15:03:38,231 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5800, best=0.92, avg=0.91, std=0.00, steps=4.752e+07
2023-07-07 15:03:45,793 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 5900, best=0.92, avg=0.91, std=0.00, steps=4.834e+07
2023-07-07 15:03:53,369 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6000, best=0.92, avg=0.92, std=0.00, steps=4.916e+07
2023-07-07 15:04:00,937 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6100, best=0.92, avg=0.92, std=0.00, steps=4.998e+07
2023-07-07 15:04:08,479 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6200, best=0.92, avg=0.92, std=0.00, steps=5.080e+07
2023-07-07 15:04:16,039 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6300, best=0.92, avg=0.92, std=0.00, steps=5.162e+07
2023-07-07 15:04:23,598 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6400, best=0.92, avg=0.92, std=0.00, steps=5.244e+07
2023-07-07 15:04:31,147 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6500, best=0.92, avg=0.92, std=0.00, steps=5.326e+07
2023-07-07 15:04:38,702 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6600, best=0.92, avg=0.92, std=0.00, steps=5.408e+07
2023-07-07 15:04:46,254 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6700, best=0.92, avg=0.92, std=0.00, steps=5.489e+07
2023-07-07 15:04:53,831 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6800, best=0.92, avg=0.92, std=0.00, steps=5.571e+07
2023-07-07 15:05:01,433 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 6900, best=0.93, avg=0.92, std=0.00, steps=5.653e+07
2023-07-07 15:05:09,000 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7000, best=0.92, avg=0.92, std=0.00, steps=5.735e+07
2023-07-07 15:05:16,581 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7100, best=0.93, avg=0.92, std=0.00, steps=5.817e+07
2023-07-07 15:05:24,150 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7200, best=0.93, avg=0.92, std=0.00, steps=5.899e+07
2023-07-07 15:05:31,733 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7300, best=0.93, avg=0.92, std=0.00, steps=5.981e+07
2023-07-07 15:05:39,299 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7400, best=0.93, avg=0.92, std=0.00, steps=6.063e+07
2023-07-07 15:05:46,859 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7500, best=0.93, avg=0.92, std=0.00, steps=6.145e+07
2023-07-07 15:05:54,434 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7600, best=0.93, avg=0.93, std=0.00, steps=6.227e+07
2023-07-07 15:06:02,030 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7700, best=0.93, avg=0.93, std=0.00, steps=6.309e+07
2023-07-07 15:06:09,614 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7800, best=0.93, avg=0.93, std=0.00, steps=6.391e+07
2023-07-07 15:06:17,203 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 7900, best=0.93, avg=0.93, std=0.00, steps=6.472e+07
2023-07-07 15:06:24,762 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8000, best=0.94, avg=0.93, std=0.00, steps=6.554e+07
2023-07-07 15:06:32,336 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8100, best=0.94, avg=0.93, std=0.00, steps=6.636e+07
2023-07-07 15:06:39,914 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8200, best=0.94, avg=0.94, std=0.00, steps=6.718e+07
2023-07-07 15:06:47,501 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8300, best=0.95, avg=0.94, std=0.00, steps=6.800e+07
2023-07-07 15:06:55,077 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8400, best=0.95, avg=0.95, std=0.00, steps=6.882e+07
2023-07-07 15:07:02,657 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8500, best=0.96, avg=0.95, std=0.00, steps=6.964e+07
2023-07-07 15:07:10,247 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8600, best=0.96, avg=0.95, std=0.00, steps=7.046e+07
2023-07-07 15:07:17,826 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8700, best=0.96, avg=0.95, std=0.00, steps=7.128e+07
2023-07-07 15:07:25,419 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8800, best=0.96, avg=0.95, std=0.00, steps=7.210e+07
2023-07-07 15:07:33,006 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 8900, best=0.96, avg=0.96, std=0.00, steps=7.292e+07
2023-07-07 15:07:40,591 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9000, best=0.96, avg=0.96, std=0.00, steps=7.374e+07
2023-07-07 15:07:48,171 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9100, best=0.96, avg=0.96, std=0.00, steps=7.456e+07
2023-07-07 15:07:55,752 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9200, best=0.96, avg=0.96, std=0.00, steps=7.537e+07
2023-07-07 15:08:03,334 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9300, best=0.96, avg=0.96, std=0.00, steps=7.619e+07
2023-07-07 15:08:10,902 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9400, best=0.96, avg=0.96, std=0.00, steps=7.701e+07
2023-07-07 15:08:18,469 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9500, best=0.96, avg=0.96, std=0.00, steps=7.783e+07
2023-07-07 15:08:26,061 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9600, best=0.97, avg=0.96, std=0.00, steps=7.865e+07
2023-07-07 15:08:33,637 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9700, best=0.96, avg=0.96, std=0.00, steps=7.947e+07
2023-07-07 15:08:41,219 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9800, best=0.96, avg=0.96, std=0.00, steps=8.029e+07
2023-07-07 15:08:48,797 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 9900, best=0.96, avg=0.96, std=0.00, steps=8.111e+07
2023-07-07 15:08:56,357 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10000, best=0.96, avg=0.96, std=0.00, steps=8.193e+07
2023-07-07 15:09:03,924 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10100, best=0.96, avg=0.96, std=0.00, steps=8.275e+07
2023-07-07 15:09:11,501 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10200, best=0.96, avg=0.96, std=0.00, steps=8.357e+07
2023-07-07 15:09:19,059 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10300, best=0.96, avg=0.96, std=0.00, steps=8.439e+07
2023-07-07 15:09:26,642 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10400, best=0.96, avg=0.96, std=0.00, steps=8.520e+07
2023-07-07 15:09:34,225 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10500, best=0.96, avg=0.96, std=0.00, steps=8.602e+07
2023-07-07 15:09:41,787 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10600, best=0.96, avg=0.96, std=0.00, steps=8.684e+07
2023-07-07 15:09:49,371 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10700, best=0.96, avg=0.96, std=0.00, steps=8.766e+07
2023-07-07 15:09:56,930 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10800, best=0.96, avg=0.96, std=0.00, steps=8.848e+07
2023-07-07 15:10:04,496 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 10900, best=0.96, avg=0.96, std=0.00, steps=8.930e+07
2023-07-07 15:10:12,077 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11000, best=0.96, avg=0.96, std=0.00, steps=9.012e+07
2023-07-07 15:10:19,643 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11100, best=0.96, avg=0.96, std=0.00, steps=9.094e+07
2023-07-07 15:10:27,208 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11200, best=0.96, avg=0.96, std=0.00, steps=9.176e+07
2023-07-07 15:10:34,786 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11300, best=0.96, avg=0.96, std=0.00, steps=9.258e+07
2023-07-07 15:10:42,355 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11400, best=0.96, avg=0.96, std=0.00, steps=9.340e+07
2023-07-07 15:10:49,912 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11500, best=0.96, avg=0.96, std=0.00, steps=9.422e+07
2023-07-07 15:10:57,486 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11600, best=0.96, avg=0.96, std=0.00, steps=9.504e+07
2023-07-07 15:11:05,086 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11700, best=0.96, avg=0.96, std=0.00, steps=9.585e+07
2023-07-07 15:11:12,672 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11800, best=0.96, avg=0.96, std=0.00, steps=9.667e+07
2023-07-07 15:11:20,272 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11900, best=0.96, avg=0.96, std=0.00, steps=9.749e+07
2023-07-07 15:11:27,776 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 0, 0, [Train]: 11999, best=0.96, avg=0.96, std=0.00, steps=9.830e+07
2023-07-07 15:11:27,776 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 15:11:27,801 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:11:27,834 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:11:39,612 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.034e+06
2023-07-07 15:11:48,978 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 200, best=0.51, avg=0.50, std=0.01, steps=2.058e+06
2023-07-07 15:11:58,365 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 300, best=0.51, avg=0.50, std=0.01, steps=3.082e+06
2023-07-07 15:12:07,765 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 400, best=0.52, avg=0.50, std=0.01, steps=4.106e+06
2023-07-07 15:12:17,156 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 500, best=0.51, avg=0.50, std=0.01, steps=5.130e+06
2023-07-07 15:12:26,545 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=6.154e+06
2023-07-07 15:12:35,958 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 700, best=0.52, avg=0.50, std=0.01, steps=7.178e+06
2023-07-07 15:12:45,383 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 800, best=0.51, avg=0.50, std=0.01, steps=8.202e+06
2023-07-07 15:12:54,778 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 900, best=0.51, avg=0.50, std=0.01, steps=9.226e+06
2023-07-07 15:13:04,182 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=1.025e+07
2023-07-07 15:13:13,557 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.01, steps=1.127e+07
2023-07-07 15:13:22,940 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.01, steps=1.230e+07
2023-07-07 15:13:32,348 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1300, best=0.52, avg=0.50, std=0.01, steps=1.332e+07
2023-07-07 15:13:41,746 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.01, steps=1.435e+07
2023-07-07 15:13:51,154 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.01, steps=1.537e+07
2023-07-07 15:14:00,564 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.01, steps=1.639e+07
2023-07-07 15:14:09,962 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1700, best=0.52, avg=0.50, std=0.01, steps=1.742e+07
2023-07-07 15:14:19,358 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1800, best=0.51, avg=0.50, std=0.01, steps=1.844e+07
2023-07-07 15:14:28,746 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.01, steps=1.947e+07
2023-07-07 15:14:38,136 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.01, steps=2.049e+07
2023-07-07 15:14:47,519 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2100, best=0.52, avg=0.50, std=0.01, steps=2.151e+07
2023-07-07 15:14:56,895 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2200, best=0.52, avg=0.50, std=0.01, steps=2.254e+07
2023-07-07 15:15:06,259 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2300, best=0.51, avg=0.50, std=0.01, steps=2.356e+07
2023-07-07 15:15:15,638 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2400, best=0.52, avg=0.50, std=0.01, steps=2.459e+07
2023-07-07 15:15:25,013 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.01, steps=2.561e+07
2023-07-07 15:15:34,448 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.01, steps=2.663e+07
2023-07-07 15:15:43,869 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2700, best=0.51, avg=0.50, std=0.01, steps=2.766e+07
2023-07-07 15:15:53,293 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2800, best=0.51, avg=0.50, std=0.01, steps=2.868e+07
2023-07-07 15:16:02,697 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.01, steps=2.971e+07
2023-07-07 15:16:12,079 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3000, best=0.51, avg=0.50, std=0.01, steps=3.073e+07
2023-07-07 15:16:21,457 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.00, steps=3.175e+07
2023-07-07 15:16:30,840 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3200, best=0.51, avg=0.50, std=0.01, steps=3.278e+07
2023-07-07 15:16:40,253 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3300, best=0.52, avg=0.50, std=0.01, steps=3.380e+07
2023-07-07 15:16:49,656 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.01, steps=3.483e+07
2023-07-07 15:16:59,068 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3500, best=0.52, avg=0.50, std=0.01, steps=3.585e+07
2023-07-07 15:17:08,467 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3600, best=0.52, avg=0.50, std=0.01, steps=3.687e+07
2023-07-07 15:17:17,854 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.01, steps=3.790e+07
2023-07-07 15:17:27,254 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.01, steps=3.892e+07
2023-07-07 15:17:36,654 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 3900, best=0.52, avg=0.50, std=0.01, steps=3.995e+07
2023-07-07 15:17:46,040 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4000, best=0.51, avg=0.50, std=0.01, steps=4.097e+07
2023-07-07 15:17:55,414 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.01, steps=4.199e+07
2023-07-07 15:18:04,771 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.01, steps=4.302e+07
2023-07-07 15:18:14,146 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4300, best=0.51, avg=0.50, std=0.01, steps=4.404e+07
2023-07-07 15:18:23,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4400, best=0.51, avg=0.50, std=0.01, steps=4.507e+07
2023-07-07 15:18:32,916 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4500, best=0.52, avg=0.50, std=0.01, steps=4.609e+07
2023-07-07 15:18:42,285 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.01, steps=4.711e+07
2023-07-07 15:18:51,657 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.01, steps=4.814e+07
2023-07-07 15:19:01,054 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4800, best=0.51, avg=0.50, std=0.01, steps=4.916e+07
2023-07-07 15:19:10,444 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 4900, best=0.51, avg=0.50, std=0.01, steps=5.019e+07
2023-07-07 15:19:19,834 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5000, best=0.52, avg=0.50, std=0.01, steps=5.121e+07
2023-07-07 15:19:29,240 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5100, best=0.51, avg=0.50, std=0.01, steps=5.223e+07
2023-07-07 15:19:38,645 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.01, steps=5.326e+07
2023-07-07 15:19:48,048 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5300, best=0.52, avg=0.50, std=0.01, steps=5.428e+07
2023-07-07 15:19:57,434 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5400, best=0.52, avg=0.50, std=0.01, steps=5.531e+07
2023-07-07 15:20:06,863 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.01, steps=5.633e+07
2023-07-07 15:20:16,277 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5600, best=0.52, avg=0.50, std=0.01, steps=5.735e+07
2023-07-07 15:20:25,677 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5700, best=0.52, avg=0.50, std=0.01, steps=5.838e+07
2023-07-07 15:20:35,054 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5800, best=0.52, avg=0.50, std=0.01, steps=5.940e+07
2023-07-07 15:20:44,423 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 5900, best=0.52, avg=0.50, std=0.00, steps=6.043e+07
2023-07-07 15:20:53,787 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.01, steps=6.145e+07
2023-07-07 15:21:03,184 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6100, best=0.51, avg=0.50, std=0.01, steps=6.247e+07
2023-07-07 15:21:12,594 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.01, steps=6.350e+07
2023-07-07 15:21:22,003 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.01, steps=6.452e+07
2023-07-07 15:21:31,381 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6400, best=0.52, avg=0.50, std=0.01, steps=6.555e+07
2023-07-07 15:21:40,757 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6500, best=0.52, avg=0.50, std=0.01, steps=6.657e+07
2023-07-07 15:21:50,183 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6600, best=0.52, avg=0.50, std=0.01, steps=6.759e+07
2023-07-07 15:21:59,586 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6700, best=0.51, avg=0.50, std=0.00, steps=6.862e+07
2023-07-07 15:22:08,971 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.00, steps=6.964e+07
2023-07-07 15:22:18,405 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.01, steps=7.067e+07
2023-07-07 15:22:27,791 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7000, best=0.52, avg=0.50, std=0.01, steps=7.169e+07
2023-07-07 15:22:37,180 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7100, best=0.52, avg=0.50, std=0.01, steps=7.271e+07
2023-07-07 15:22:46,592 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7200, best=0.52, avg=0.50, std=0.01, steps=7.374e+07
2023-07-07 15:22:55,996 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.01, steps=7.476e+07
2023-07-07 15:23:05,390 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7400, best=0.51, avg=0.50, std=0.01, steps=7.579e+07
2023-07-07 15:23:14,765 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7500, best=0.52, avg=0.50, std=0.01, steps=7.681e+07
2023-07-07 15:23:24,148 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7600, best=0.52, avg=0.50, std=0.01, steps=7.783e+07
2023-07-07 15:23:33,518 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7700, best=0.51, avg=0.50, std=0.01, steps=7.886e+07
2023-07-07 15:23:42,874 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7800, best=0.51, avg=0.50, std=0.01, steps=7.988e+07
2023-07-07 15:23:52,258 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 7900, best=0.52, avg=0.50, std=0.01, steps=8.091e+07
2023-07-07 15:24:01,644 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8000, best=0.51, avg=0.50, std=0.01, steps=8.193e+07
2023-07-07 15:24:11,029 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8100, best=0.51, avg=0.50, std=0.01, steps=8.295e+07
2023-07-07 15:24:20,410 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8200, best=0.52, avg=0.50, std=0.01, steps=8.398e+07
2023-07-07 15:24:29,819 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8300, best=0.51, avg=0.50, std=0.01, steps=8.500e+07
2023-07-07 15:24:39,193 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8400, best=0.51, avg=0.50, std=0.01, steps=8.603e+07
2023-07-07 15:24:48,572 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8500, best=0.51, avg=0.50, std=0.01, steps=8.705e+07
2023-07-07 15:24:57,940 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8600, best=0.51, avg=0.50, std=0.01, steps=8.807e+07
2023-07-07 15:25:07,333 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8700, best=0.51, avg=0.50, std=0.01, steps=8.910e+07
2023-07-07 15:25:16,698 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8800, best=0.51, avg=0.50, std=0.01, steps=9.012e+07
2023-07-07 15:25:26,061 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 8900, best=0.52, avg=0.50, std=0.01, steps=9.115e+07
2023-07-07 15:25:35,435 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9000, best=0.52, avg=0.50, std=0.01, steps=9.217e+07
2023-07-07 15:25:44,821 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9100, best=0.51, avg=0.50, std=0.01, steps=9.319e+07
2023-07-07 15:25:54,188 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9200, best=0.52, avg=0.50, std=0.01, steps=9.422e+07
2023-07-07 15:26:03,553 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9300, best=0.51, avg=0.50, std=0.01, steps=9.524e+07
2023-07-07 15:26:12,914 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9400, best=0.52, avg=0.50, std=0.01, steps=9.627e+07
2023-07-07 15:26:22,312 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9500, best=0.52, avg=0.50, std=0.01, steps=9.729e+07
2023-07-07 15:26:31,682 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9600, best=0.52, avg=0.50, std=0.01, steps=9.831e+07
2023-07-07 15:26:41,065 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9700, best=0.51, avg=0.50, std=0.01, steps=9.934e+07
2023-07-07 15:26:50,449 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9800, best=0.51, avg=0.50, std=0.01, steps=1.004e+08
2023-07-07 15:26:59,824 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 9900, best=0.54, avg=0.53, std=0.01, steps=1.014e+08
2023-07-07 15:27:09,198 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10000, best=0.66, avg=0.65, std=0.00, steps=1.024e+08
2023-07-07 15:27:18,582 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10100, best=0.68, avg=0.67, std=0.00, steps=1.034e+08
2023-07-07 15:27:27,984 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10200, best=0.70, avg=0.69, std=0.00, steps=1.045e+08
2023-07-07 15:27:37,378 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10300, best=0.71, avg=0.70, std=0.00, steps=1.055e+08
2023-07-07 15:27:46,768 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10400, best=0.72, avg=0.71, std=0.00, steps=1.065e+08
2023-07-07 15:27:56,161 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10500, best=0.73, avg=0.72, std=0.00, steps=1.075e+08
2023-07-07 15:28:05,543 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10600, best=0.74, avg=0.73, std=0.00, steps=1.086e+08
2023-07-07 15:28:14,929 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10700, best=0.74, avg=0.73, std=0.00, steps=1.096e+08
2023-07-07 15:28:24,321 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10800, best=0.75, avg=0.74, std=0.00, steps=1.106e+08
2023-07-07 15:28:33,745 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 10900, best=0.76, avg=0.75, std=0.00, steps=1.116e+08
2023-07-07 15:28:43,145 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11000, best=0.76, avg=0.75, std=0.00, steps=1.127e+08
2023-07-07 15:28:52,520 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11100, best=0.77, avg=0.76, std=0.00, steps=1.137e+08
2023-07-07 15:29:01,888 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11200, best=0.77, avg=0.76, std=0.00, steps=1.147e+08
2023-07-07 15:29:11,270 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11300, best=0.77, avg=0.76, std=0.00, steps=1.157e+08
2023-07-07 15:29:20,658 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11400, best=0.77, avg=0.77, std=0.00, steps=1.167e+08
2023-07-07 15:29:30,042 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11500, best=0.78, avg=0.77, std=0.00, steps=1.178e+08
2023-07-07 15:29:39,406 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11600, best=0.78, avg=0.77, std=0.00, steps=1.188e+08
2023-07-07 15:29:48,790 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11700, best=0.78, avg=0.77, std=0.00, steps=1.198e+08
2023-07-07 15:29:58,187 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11800, best=0.79, avg=0.78, std=0.00, steps=1.208e+08
2023-07-07 15:30:07,568 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11900, best=0.79, avg=0.78, std=0.00, steps=1.219e+08
2023-07-07 15:30:16,884 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 8, 0, [Train]: 11999, best=0.79, avg=0.78, std=0.00, steps=1.229e+08
2023-07-07 15:30:16,884 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 15:30:16,910 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:30:16,946 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:30:30,446 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 100, best=0.58, avg=0.57, std=0.01, steps=1.241e+06
2023-07-07 15:30:41,625 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 200, best=0.63, avg=0.63, std=0.00, steps=2.470e+06
2023-07-07 15:30:52,827 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 300, best=0.66, avg=0.65, std=0.00, steps=3.699e+06
2023-07-07 15:31:04,005 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 400, best=0.67, avg=0.66, std=0.00, steps=4.927e+06
2023-07-07 15:31:15,223 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 500, best=0.69, avg=0.68, std=0.00, steps=6.156e+06
2023-07-07 15:31:26,422 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 600, best=0.71, avg=0.70, std=0.00, steps=7.385e+06
2023-07-07 15:31:37,606 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 700, best=0.72, avg=0.71, std=0.00, steps=8.614e+06
2023-07-07 15:31:48,805 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 800, best=0.73, avg=0.72, std=0.00, steps=9.843e+06
2023-07-07 15:32:00,002 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 900, best=0.74, avg=0.73, std=0.00, steps=1.107e+07
2023-07-07 15:32:11,211 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1000, best=0.75, avg=0.73, std=0.00, steps=1.230e+07
2023-07-07 15:32:22,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1100, best=0.76, avg=0.74, std=0.00, steps=1.353e+07
2023-07-07 15:32:33,601 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1200, best=0.75, avg=0.74, std=0.00, steps=1.476e+07
2023-07-07 15:32:44,799 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1300, best=0.76, avg=0.75, std=0.00, steps=1.599e+07
2023-07-07 15:32:55,994 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1400, best=0.76, avg=0.75, std=0.00, steps=1.722e+07
2023-07-07 15:33:07,198 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1500, best=0.76, avg=0.75, std=0.00, steps=1.844e+07
2023-07-07 15:33:18,397 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1600, best=0.77, avg=0.76, std=0.00, steps=1.967e+07
2023-07-07 15:33:29,626 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1700, best=0.77, avg=0.76, std=0.00, steps=2.090e+07
2023-07-07 15:33:40,847 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1800, best=0.77, avg=0.76, std=0.00, steps=2.213e+07
2023-07-07 15:33:52,038 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 1900, best=0.77, avg=0.76, std=0.00, steps=2.336e+07
2023-07-07 15:34:03,229 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2000, best=0.78, avg=0.77, std=0.00, steps=2.459e+07
2023-07-07 15:34:14,414 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2100, best=0.79, avg=0.77, std=0.00, steps=2.582e+07
2023-07-07 15:34:25,602 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2200, best=0.78, avg=0.77, std=0.00, steps=2.705e+07
2023-07-07 15:34:36,802 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2300, best=0.79, avg=0.78, std=0.00, steps=2.827e+07
2023-07-07 15:34:47,982 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2400, best=0.79, avg=0.78, std=0.00, steps=2.950e+07
2023-07-07 15:34:59,172 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2500, best=0.79, avg=0.78, std=0.00, steps=3.073e+07
2023-07-07 15:35:10,386 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2600, best=0.79, avg=0.78, std=0.00, steps=3.196e+07
2023-07-07 15:35:21,566 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2700, best=0.80, avg=0.78, std=0.00, steps=3.319e+07
2023-07-07 15:35:32,755 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2800, best=0.79, avg=0.78, std=0.00, steps=3.442e+07
2023-07-07 15:35:43,947 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 2900, best=0.80, avg=0.79, std=0.00, steps=3.565e+07
2023-07-07 15:35:55,174 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3000, best=0.80, avg=0.79, std=0.00, steps=3.688e+07
2023-07-07 15:36:06,358 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3100, best=0.80, avg=0.79, std=0.00, steps=3.811e+07
2023-07-07 15:36:17,550 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3200, best=0.80, avg=0.79, std=0.00, steps=3.933e+07
2023-07-07 15:36:28,734 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3300, best=0.80, avg=0.79, std=0.00, steps=4.056e+07
2023-07-07 15:36:39,907 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3400, best=0.80, avg=0.79, std=0.00, steps=4.179e+07
2023-07-07 15:36:51,140 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3500, best=0.80, avg=0.80, std=0.00, steps=4.302e+07
2023-07-07 15:37:02,339 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3600, best=0.80, avg=0.80, std=0.00, steps=4.425e+07
2023-07-07 15:37:13,536 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3700, best=0.81, avg=0.80, std=0.00, steps=4.548e+07
2023-07-07 15:37:24,704 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3800, best=0.81, avg=0.80, std=0.00, steps=4.671e+07
2023-07-07 15:37:35,878 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 3900, best=0.81, avg=0.80, std=0.00, steps=4.794e+07
2023-07-07 15:37:47,072 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4000, best=0.81, avg=0.80, std=0.00, steps=4.916e+07
2023-07-07 15:37:58,270 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4100, best=0.81, avg=0.80, std=0.00, steps=5.039e+07
2023-07-07 15:38:09,488 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4200, best=0.81, avg=0.80, std=0.00, steps=5.162e+07
2023-07-07 15:38:20,695 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4300, best=0.81, avg=0.80, std=0.00, steps=5.285e+07
2023-07-07 15:38:31,877 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4400, best=0.81, avg=0.80, std=0.00, steps=5.408e+07
2023-07-07 15:38:43,079 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4500, best=0.81, avg=0.80, std=0.00, steps=5.531e+07
2023-07-07 15:38:54,255 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4600, best=0.81, avg=0.80, std=0.00, steps=5.654e+07
2023-07-07 15:39:05,463 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4700, best=0.81, avg=0.80, std=0.00, steps=5.777e+07
2023-07-07 15:39:16,673 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4800, best=0.81, avg=0.80, std=0.00, steps=5.899e+07
2023-07-07 15:39:27,867 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 4900, best=0.81, avg=0.80, std=0.00, steps=6.022e+07
2023-07-07 15:39:39,069 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5000, best=0.81, avg=0.80, std=0.00, steps=6.145e+07
2023-07-07 15:39:50,250 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5100, best=0.81, avg=0.80, std=0.00, steps=6.268e+07
2023-07-07 15:40:01,454 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5200, best=0.81, avg=0.80, std=0.00, steps=6.391e+07
2023-07-07 15:40:12,627 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5300, best=0.81, avg=0.80, std=0.00, steps=6.514e+07
2023-07-07 15:40:23,786 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5400, best=0.81, avg=0.81, std=0.00, steps=6.637e+07
2023-07-07 15:40:34,975 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5500, best=0.81, avg=0.81, std=0.00, steps=6.760e+07
2023-07-07 15:40:46,209 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5600, best=0.81, avg=0.81, std=0.00, steps=6.883e+07
2023-07-07 15:40:57,410 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5700, best=0.82, avg=0.81, std=0.00, steps=7.005e+07
2023-07-07 15:41:08,598 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5800, best=0.81, avg=0.81, std=0.00, steps=7.128e+07
2023-07-07 15:41:19,815 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 5900, best=0.81, avg=0.81, std=0.00, steps=7.251e+07
2023-07-07 15:41:31,023 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6000, best=0.82, avg=0.81, std=0.00, steps=7.374e+07
2023-07-07 15:41:42,271 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6100, best=0.82, avg=0.81, std=0.00, steps=7.497e+07
2023-07-07 15:41:53,495 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6200, best=0.82, avg=0.81, std=0.00, steps=7.620e+07
2023-07-07 15:42:04,676 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6300, best=0.82, avg=0.81, std=0.00, steps=7.743e+07
2023-07-07 15:42:15,873 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6400, best=0.82, avg=0.81, std=0.00, steps=7.866e+07
2023-07-07 15:42:27,063 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6500, best=0.82, avg=0.81, std=0.00, steps=7.988e+07
2023-07-07 15:42:38,257 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6600, best=0.82, avg=0.81, std=0.00, steps=8.111e+07
2023-07-07 15:42:49,460 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6700, best=0.82, avg=0.81, std=0.00, steps=8.234e+07
2023-07-07 15:43:00,666 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6800, best=0.82, avg=0.81, std=0.00, steps=8.357e+07
2023-07-07 15:43:11,870 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 6900, best=0.82, avg=0.81, std=0.00, steps=8.480e+07
2023-07-07 15:43:23,054 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7000, best=0.82, avg=0.81, std=0.00, steps=8.603e+07
2023-07-07 15:43:34,251 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7100, best=0.82, avg=0.81, std=0.00, steps=8.726e+07
2023-07-07 15:43:45,445 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7200, best=0.82, avg=0.81, std=0.00, steps=8.849e+07
2023-07-07 15:43:56,633 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7300, best=0.82, avg=0.81, std=0.00, steps=8.971e+07
2023-07-07 15:44:07,846 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7400, best=0.82, avg=0.81, std=0.00, steps=9.094e+07
2023-07-07 15:44:19,048 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7500, best=0.82, avg=0.81, std=0.00, steps=9.217e+07
2023-07-07 15:44:30,272 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7600, best=0.82, avg=0.81, std=0.00, steps=9.340e+07
2023-07-07 15:44:41,499 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7700, best=0.82, avg=0.81, std=0.00, steps=9.463e+07
2023-07-07 15:44:52,713 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7800, best=0.82, avg=0.81, std=0.00, steps=9.586e+07
2023-07-07 15:45:03,910 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 7900, best=0.82, avg=0.81, std=0.00, steps=9.709e+07
2023-07-07 15:45:15,095 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8000, best=0.82, avg=0.81, std=0.00, steps=9.832e+07
2023-07-07 15:45:26,289 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8100, best=0.82, avg=0.81, std=0.00, steps=9.955e+07
2023-07-07 15:45:37,503 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8200, best=0.82, avg=0.81, std=0.00, steps=1.008e+08
2023-07-07 15:45:48,713 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8300, best=0.82, avg=0.81, std=0.00, steps=1.020e+08
2023-07-07 15:45:59,910 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8400, best=0.82, avg=0.81, std=0.00, steps=1.032e+08
2023-07-07 15:46:11,146 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8500, best=0.82, avg=0.81, std=0.00, steps=1.045e+08
2023-07-07 15:46:22,398 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8600, best=0.82, avg=0.81, std=0.00, steps=1.057e+08
2023-07-07 15:46:33,603 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8700, best=0.82, avg=0.81, std=0.00, steps=1.069e+08
2023-07-07 15:46:44,815 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8800, best=0.82, avg=0.81, std=0.00, steps=1.081e+08
2023-07-07 15:46:56,031 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 8900, best=0.82, avg=0.81, std=0.00, steps=1.094e+08
2023-07-07 15:47:07,218 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9000, best=0.82, avg=0.81, std=0.00, steps=1.106e+08
2023-07-07 15:47:18,419 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9100, best=0.82, avg=0.81, std=0.00, steps=1.118e+08
2023-07-07 15:47:29,619 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9200, best=0.82, avg=0.81, std=0.00, steps=1.131e+08
2023-07-07 15:47:40,814 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9300, best=0.82, avg=0.81, std=0.00, steps=1.143e+08
2023-07-07 15:47:52,020 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9400, best=0.82, avg=0.81, std=0.00, steps=1.155e+08
2023-07-07 15:48:03,218 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9500, best=0.82, avg=0.81, std=0.00, steps=1.167e+08
2023-07-07 15:48:14,404 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9600, best=0.82, avg=0.81, std=0.00, steps=1.180e+08
2023-07-07 15:48:25,601 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9700, best=0.82, avg=0.81, std=0.00, steps=1.192e+08
2023-07-07 15:48:36,795 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9800, best=0.82, avg=0.81, std=0.00, steps=1.204e+08
2023-07-07 15:48:48,027 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 9900, best=0.82, avg=0.81, std=0.00, steps=1.217e+08
2023-07-07 15:48:59,216 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10000, best=0.82, avg=0.81, std=0.00, steps=1.229e+08
2023-07-07 15:49:10,403 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10100, best=0.82, avg=0.81, std=0.00, steps=1.241e+08
2023-07-07 15:49:21,583 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10200, best=0.83, avg=0.82, std=0.00, steps=1.253e+08
2023-07-07 15:49:32,816 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10300, best=0.83, avg=0.82, std=0.00, steps=1.266e+08
2023-07-07 15:49:44,019 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10400, best=0.82, avg=0.81, std=0.00, steps=1.278e+08
2023-07-07 15:49:55,216 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10500, best=0.82, avg=0.82, std=0.00, steps=1.290e+08
2023-07-07 15:50:06,390 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10600, best=0.82, avg=0.82, std=0.00, steps=1.303e+08
2023-07-07 15:50:17,597 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10700, best=0.82, avg=0.82, std=0.00, steps=1.315e+08
2023-07-07 15:50:28,799 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10800, best=0.82, avg=0.81, std=0.00, steps=1.327e+08
2023-07-07 15:50:40,037 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 10900, best=0.82, avg=0.82, std=0.00, steps=1.340e+08
2023-07-07 15:50:51,252 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11000, best=0.82, avg=0.82, std=0.00, steps=1.352e+08
2023-07-07 15:51:02,460 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11100, best=0.83, avg=0.82, std=0.00, steps=1.364e+08
2023-07-07 15:51:13,672 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11200, best=0.82, avg=0.82, std=0.00, steps=1.376e+08
2023-07-07 15:51:24,897 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11300, best=0.82, avg=0.82, std=0.00, steps=1.389e+08
2023-07-07 15:51:36,104 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11400, best=0.83, avg=0.82, std=0.00, steps=1.401e+08
2023-07-07 15:51:47,304 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11500, best=0.83, avg=0.82, std=0.00, steps=1.413e+08
2023-07-07 15:51:58,522 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11600, best=0.82, avg=0.82, std=0.00, steps=1.426e+08
2023-07-07 15:52:09,731 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11700, best=0.82, avg=0.82, std=0.00, steps=1.438e+08
2023-07-07 15:52:20,948 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11800, best=0.82, avg=0.82, std=0.00, steps=1.450e+08
2023-07-07 15:52:32,164 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11900, best=0.82, avg=0.82, std=0.00, steps=1.462e+08
2023-07-07 15:52:43,258 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 16, 0, [Train]: 11999, best=0.82, avg=0.82, std=0.00, steps=1.475e+08
2023-07-07 15:52:43,259 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 15:52:43,285 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 15:52:43,318 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 15:53:00,430 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.01, steps=1.655e+06
2023-07-07 15:53:15,244 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 200, best=0.52, avg=0.50, std=0.01, steps=3.293e+06
2023-07-07 15:53:30,092 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 300, best=0.52, avg=0.50, std=0.01, steps=4.932e+06
2023-07-07 15:53:44,965 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=6.570e+06
2023-07-07 15:53:59,812 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 500, best=0.52, avg=0.51, std=0.01, steps=8.208e+06
2023-07-07 15:54:14,673 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 600, best=0.65, avg=0.64, std=0.00, steps=9.847e+06
2023-07-07 15:54:29,515 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 700, best=0.66, avg=0.65, std=0.00, steps=1.149e+07
2023-07-07 15:54:44,342 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 800, best=0.67, avg=0.66, std=0.00, steps=1.312e+07
2023-07-07 15:54:59,187 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 900, best=0.68, avg=0.67, std=0.00, steps=1.476e+07
2023-07-07 15:55:14,028 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1000, best=0.68, avg=0.67, std=0.00, steps=1.640e+07
2023-07-07 15:55:28,883 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1100, best=0.68, avg=0.67, std=0.00, steps=1.804e+07
2023-07-07 15:55:43,729 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1200, best=0.68, avg=0.67, std=0.00, steps=1.968e+07
2023-07-07 15:55:58,553 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1300, best=0.68, avg=0.67, std=0.00, steps=2.132e+07
2023-07-07 15:56:13,389 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1400, best=0.69, avg=0.68, std=0.00, steps=2.295e+07
2023-07-07 15:56:28,227 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1500, best=0.69, avg=0.68, std=0.00, steps=2.459e+07
2023-07-07 15:56:43,096 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1600, best=0.69, avg=0.68, std=0.00, steps=2.623e+07
2023-07-07 15:56:57,946 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1700, best=0.69, avg=0.68, std=0.00, steps=2.787e+07
2023-07-07 15:57:12,776 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1800, best=0.69, avg=0.68, std=0.00, steps=2.951e+07
2023-07-07 15:57:27,598 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 1900, best=0.69, avg=0.68, std=0.00, steps=3.115e+07
2023-07-07 15:57:42,467 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2000, best=0.69, avg=0.68, std=0.00, steps=3.278e+07
2023-07-07 15:57:57,290 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2100, best=0.70, avg=0.69, std=0.00, steps=3.442e+07
2023-07-07 15:58:12,130 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2200, best=0.70, avg=0.69, std=0.00, steps=3.606e+07
2023-07-07 15:58:26,966 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2300, best=0.70, avg=0.69, std=0.00, steps=3.770e+07
2023-07-07 15:58:41,819 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2400, best=0.70, avg=0.69, std=0.00, steps=3.934e+07
2023-07-07 15:58:56,646 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2500, best=0.70, avg=0.69, std=0.00, steps=4.098e+07
2023-07-07 15:59:11,484 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2600, best=0.70, avg=0.69, std=0.00, steps=4.261e+07
2023-07-07 15:59:26,301 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2700, best=0.72, avg=0.71, std=0.00, steps=4.425e+07
2023-07-07 15:59:41,137 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2800, best=0.72, avg=0.71, std=0.00, steps=4.589e+07
2023-07-07 15:59:55,964 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 2900, best=0.72, avg=0.71, std=0.00, steps=4.753e+07
2023-07-07 16:00:10,791 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3000, best=0.73, avg=0.71, std=0.00, steps=4.917e+07
2023-07-07 16:00:25,618 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3100, best=0.73, avg=0.72, std=0.00, steps=5.081e+07
2023-07-07 16:00:40,449 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3200, best=0.74, avg=0.72, std=0.00, steps=5.245e+07
2023-07-07 16:00:55,282 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3300, best=0.74, avg=0.73, std=0.00, steps=5.408e+07
2023-07-07 16:01:10,113 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3400, best=0.74, avg=0.73, std=0.00, steps=5.572e+07
2023-07-07 16:01:24,915 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3500, best=0.74, avg=0.73, std=0.00, steps=5.736e+07
2023-07-07 16:01:39,722 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3600, best=0.75, avg=0.73, std=0.00, steps=5.900e+07
2023-07-07 16:01:54,545 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3700, best=0.74, avg=0.73, std=0.00, steps=6.064e+07
2023-07-07 16:02:09,358 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3800, best=0.75, avg=0.74, std=0.00, steps=6.228e+07
2023-07-07 16:02:24,171 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 3900, best=0.74, avg=0.74, std=0.00, steps=6.391e+07
2023-07-07 16:02:39,006 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4000, best=0.75, avg=0.74, std=0.00, steps=6.555e+07
2023-07-07 16:02:53,847 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4100, best=0.75, avg=0.74, std=0.00, steps=6.719e+07
2023-07-07 16:03:08,679 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4200, best=0.76, avg=0.74, std=0.00, steps=6.883e+07
2023-07-07 16:03:23,489 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4300, best=0.76, avg=0.75, std=0.00, steps=7.047e+07
2023-07-07 16:03:38,334 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4400, best=0.76, avg=0.75, std=0.00, steps=7.211e+07
2023-07-07 16:03:53,212 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4500, best=0.76, avg=0.75, std=0.00, steps=7.374e+07
2023-07-07 16:04:08,072 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4600, best=0.76, avg=0.75, std=0.00, steps=7.538e+07
2023-07-07 16:04:22,899 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4700, best=0.76, avg=0.75, std=0.00, steps=7.702e+07
2023-07-07 16:04:37,731 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4800, best=0.76, avg=0.75, std=0.00, steps=7.866e+07
2023-07-07 16:04:52,552 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 4900, best=0.76, avg=0.75, std=0.00, steps=8.030e+07
2023-07-07 16:05:07,390 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5000, best=0.77, avg=0.76, std=0.00, steps=8.194e+07
2023-07-07 16:05:22,221 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5100, best=0.77, avg=0.76, std=0.00, steps=8.357e+07
2023-07-07 16:05:37,048 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5200, best=0.77, avg=0.76, std=0.00, steps=8.521e+07
2023-07-07 16:05:51,873 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5300, best=0.77, avg=0.76, std=0.00, steps=8.685e+07
2023-07-07 16:06:06,709 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5400, best=0.77, avg=0.76, std=0.00, steps=8.849e+07
2023-07-07 16:06:21,528 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5500, best=0.77, avg=0.76, std=0.00, steps=9.013e+07
2023-07-07 16:06:36,340 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5600, best=0.77, avg=0.76, std=0.00, steps=9.177e+07
2023-07-07 16:06:51,145 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5700, best=0.78, avg=0.76, std=0.00, steps=9.341e+07
2023-07-07 16:07:05,963 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5800, best=0.78, avg=0.77, std=0.00, steps=9.504e+07
2023-07-07 16:07:20,832 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 5900, best=0.78, avg=0.77, std=0.00, steps=9.668e+07
2023-07-07 16:07:35,689 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6000, best=0.78, avg=0.77, std=0.00, steps=9.832e+07
2023-07-07 16:07:50,581 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6100, best=0.78, avg=0.77, std=0.00, steps=9.996e+07
2023-07-07 16:08:05,418 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6200, best=0.78, avg=0.77, std=0.00, steps=1.016e+08
2023-07-07 16:08:20,233 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6300, best=0.78, avg=0.77, std=0.00, steps=1.032e+08
2023-07-07 16:08:35,047 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6400, best=0.78, avg=0.77, std=0.00, steps=1.049e+08
2023-07-07 16:08:49,857 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6500, best=0.78, avg=0.77, std=0.00, steps=1.065e+08
2023-07-07 16:09:04,677 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6600, best=0.79, avg=0.78, std=0.00, steps=1.082e+08
2023-07-07 16:09:19,516 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6700, best=0.78, avg=0.78, std=0.00, steps=1.098e+08
2023-07-07 16:09:34,343 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6800, best=0.79, avg=0.78, std=0.00, steps=1.114e+08
2023-07-07 16:09:49,175 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 6900, best=0.79, avg=0.78, std=0.00, steps=1.131e+08
2023-07-07 16:10:04,003 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7000, best=0.79, avg=0.78, std=0.00, steps=1.147e+08
2023-07-07 16:10:18,829 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7100, best=0.79, avg=0.78, std=0.00, steps=1.163e+08
2023-07-07 16:10:33,656 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7200, best=0.79, avg=0.78, std=0.00, steps=1.180e+08
2023-07-07 16:10:48,480 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7300, best=0.79, avg=0.78, std=0.00, steps=1.196e+08
2023-07-07 16:11:03,273 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7400, best=0.79, avg=0.78, std=0.00, steps=1.213e+08
2023-07-07 16:11:18,065 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7500, best=0.79, avg=0.78, std=0.00, steps=1.229e+08
2023-07-07 16:11:32,875 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7600, best=0.80, avg=0.78, std=0.00, steps=1.245e+08
2023-07-07 16:11:47,690 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7700, best=0.79, avg=0.78, std=0.00, steps=1.262e+08
2023-07-07 16:12:02,520 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7800, best=0.79, avg=0.78, std=0.00, steps=1.278e+08
2023-07-07 16:12:17,411 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 7900, best=0.79, avg=0.78, std=0.00, steps=1.294e+08
2023-07-07 16:12:32,233 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8000, best=0.80, avg=0.79, std=0.00, steps=1.311e+08
2023-07-07 16:12:47,063 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8100, best=0.80, avg=0.79, std=0.00, steps=1.327e+08
2023-07-07 16:13:01,906 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8200, best=0.80, avg=0.79, std=0.00, steps=1.344e+08
2023-07-07 16:13:16,757 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8300, best=0.80, avg=0.79, std=0.00, steps=1.360e+08
2023-07-07 16:13:31,588 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8400, best=0.80, avg=0.79, std=0.00, steps=1.376e+08
2023-07-07 16:13:46,414 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8500, best=0.80, avg=0.79, std=0.00, steps=1.393e+08
2023-07-07 16:14:01,229 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8600, best=0.80, avg=0.79, std=0.00, steps=1.409e+08
2023-07-07 16:14:16,058 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8700, best=0.80, avg=0.79, std=0.00, steps=1.426e+08
2023-07-07 16:14:30,870 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8800, best=0.80, avg=0.79, std=0.00, steps=1.442e+08
2023-07-07 16:14:45,680 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 8900, best=0.80, avg=0.79, std=0.00, steps=1.458e+08
2023-07-07 16:15:00,509 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9000, best=0.80, avg=0.79, std=0.00, steps=1.475e+08
2023-07-07 16:15:15,336 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9100, best=0.80, avg=0.79, std=0.00, steps=1.491e+08
2023-07-07 16:15:30,167 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9200, best=0.80, avg=0.79, std=0.00, steps=1.507e+08
2023-07-07 16:15:45,018 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9300, best=0.80, avg=0.79, std=0.00, steps=1.524e+08
2023-07-07 16:15:59,816 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9400, best=0.80, avg=0.79, std=0.00, steps=1.540e+08
2023-07-07 16:16:14,624 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9500, best=0.80, avg=0.79, std=0.00, steps=1.557e+08
2023-07-07 16:16:29,446 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9600, best=0.80, avg=0.79, std=0.00, steps=1.573e+08
2023-07-07 16:16:44,294 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9700, best=0.80, avg=0.79, std=0.00, steps=1.589e+08
2023-07-07 16:16:59,141 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9800, best=0.80, avg=0.79, std=0.00, steps=1.606e+08
2023-07-07 16:17:13,998 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 9900, best=0.81, avg=0.79, std=0.00, steps=1.622e+08
2023-07-07 16:17:28,821 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10000, best=0.80, avg=0.79, std=0.00, steps=1.639e+08
2023-07-07 16:17:43,641 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10100, best=0.81, avg=0.80, std=0.00, steps=1.655e+08
2023-07-07 16:17:58,503 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10200, best=0.81, avg=0.80, std=0.00, steps=1.671e+08
2023-07-07 16:18:13,354 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10300, best=0.81, avg=0.80, std=0.00, steps=1.688e+08
2023-07-07 16:18:28,174 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10400, best=0.81, avg=0.80, std=0.00, steps=1.704e+08
2023-07-07 16:18:43,019 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10500, best=0.80, avg=0.80, std=0.00, steps=1.720e+08
2023-07-07 16:18:57,852 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10600, best=0.81, avg=0.80, std=0.00, steps=1.737e+08
2023-07-07 16:19:12,667 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10700, best=0.81, avg=0.80, std=0.00, steps=1.753e+08
2023-07-07 16:19:27,478 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10800, best=0.81, avg=0.80, std=0.00, steps=1.770e+08
2023-07-07 16:19:42,314 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 10900, best=0.81, avg=0.80, std=0.00, steps=1.786e+08
2023-07-07 16:19:57,179 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11000, best=0.81, avg=0.80, std=0.00, steps=1.802e+08
2023-07-07 16:20:12,008 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11100, best=0.81, avg=0.80, std=0.00, steps=1.819e+08
2023-07-07 16:20:26,816 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11200, best=0.81, avg=0.80, std=0.00, steps=1.835e+08
2023-07-07 16:20:41,634 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11300, best=0.81, avg=0.80, std=0.00, steps=1.852e+08
2023-07-07 16:20:56,452 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11400, best=0.81, avg=0.80, std=0.00, steps=1.868e+08
2023-07-07 16:21:11,306 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11500, best=0.81, avg=0.80, std=0.00, steps=1.884e+08
2023-07-07 16:21:26,138 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11600, best=0.81, avg=0.80, std=0.00, steps=1.901e+08
2023-07-07 16:21:40,968 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11700, best=0.81, avg=0.80, std=0.00, steps=1.917e+08
2023-07-07 16:21:55,809 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11800, best=0.81, avg=0.80, std=0.00, steps=1.933e+08
2023-07-07 16:22:10,660 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11900, best=0.81, avg=0.80, std=0.00, steps=1.950e+08
2023-07-07 16:22:25,384 -        meta learning: [    INFO] - [Len Lat Rep]: 16, 32, 0, [Train]: 11999, best=0.81, avg=0.80, std=0.00, steps=1.966e+08
2023-07-07 16:22:25,385 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 16:22:25,410 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:22:25,444 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:22:42,636 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.655e+06
2023-07-07 16:22:57,481 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 200, best=0.60, avg=0.60, std=0.00, steps=3.293e+06
2023-07-07 16:23:12,360 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 300, best=0.65, avg=0.64, std=0.00, steps=4.932e+06
2023-07-07 16:23:27,231 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 400, best=0.67, avg=0.66, std=0.00, steps=6.570e+06
2023-07-07 16:23:42,093 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 500, best=0.68, avg=0.67, std=0.00, steps=8.208e+06
2023-07-07 16:23:57,064 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 600, best=0.69, avg=0.68, std=0.00, steps=9.847e+06
2023-07-07 16:24:11,979 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 700, best=0.70, avg=0.69, std=0.00, steps=1.149e+07
2023-07-07 16:24:26,820 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 800, best=0.70, avg=0.69, std=0.00, steps=1.312e+07
2023-07-07 16:24:41,669 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 900, best=0.71, avg=0.70, std=0.00, steps=1.476e+07
2023-07-07 16:24:56,640 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1000, best=0.71, avg=0.70, std=0.00, steps=1.640e+07
2023-07-07 16:25:11,527 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1100, best=0.72, avg=0.71, std=0.00, steps=1.804e+07
2023-07-07 16:25:26,408 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1200, best=0.72, avg=0.71, std=0.00, steps=1.968e+07
2023-07-07 16:25:41,350 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1300, best=0.72, avg=0.71, std=0.00, steps=2.132e+07
2023-07-07 16:25:56,249 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1400, best=0.72, avg=0.72, std=0.00, steps=2.295e+07
2023-07-07 16:26:11,121 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1500, best=0.73, avg=0.72, std=0.00, steps=2.459e+07
2023-07-07 16:26:25,978 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1600, best=0.73, avg=0.72, std=0.00, steps=2.623e+07
2023-07-07 16:26:40,832 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1700, best=0.73, avg=0.72, std=0.00, steps=2.787e+07
2023-07-07 16:26:55,794 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1800, best=0.73, avg=0.73, std=0.00, steps=2.951e+07
2023-07-07 16:27:10,668 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 1900, best=0.74, avg=0.73, std=0.00, steps=3.115e+07
2023-07-07 16:27:25,507 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2000, best=0.74, avg=0.73, std=0.00, steps=3.278e+07
2023-07-07 16:27:40,355 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2100, best=0.74, avg=0.73, std=0.00, steps=3.442e+07
2023-07-07 16:27:55,172 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2200, best=0.75, avg=0.73, std=0.00, steps=3.606e+07
2023-07-07 16:28:10,022 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2300, best=0.74, avg=0.74, std=0.00, steps=3.770e+07
2023-07-07 16:28:24,876 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2400, best=0.75, avg=0.74, std=0.00, steps=3.934e+07
2023-07-07 16:28:39,719 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2500, best=0.75, avg=0.74, std=0.00, steps=4.098e+07
2023-07-07 16:28:54,553 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2600, best=0.75, avg=0.74, std=0.00, steps=4.261e+07
2023-07-07 16:29:09,406 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2700, best=0.75, avg=0.74, std=0.00, steps=4.425e+07
2023-07-07 16:29:24,258 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2800, best=0.75, avg=0.74, std=0.00, steps=4.589e+07
2023-07-07 16:29:39,110 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 2900, best=0.75, avg=0.74, std=0.00, steps=4.753e+07
2023-07-07 16:29:53,980 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3000, best=0.75, avg=0.74, std=0.00, steps=4.917e+07
2023-07-07 16:30:08,820 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3100, best=0.75, avg=0.75, std=0.00, steps=5.081e+07
2023-07-07 16:30:23,674 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3200, best=0.76, avg=0.75, std=0.00, steps=5.245e+07
2023-07-07 16:30:38,535 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3300, best=0.75, avg=0.75, std=0.00, steps=5.408e+07
2023-07-07 16:30:53,393 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3400, best=0.75, avg=0.75, std=0.00, steps=5.572e+07
2023-07-07 16:31:08,235 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3500, best=0.76, avg=0.75, std=0.00, steps=5.736e+07
2023-07-07 16:31:23,081 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3600, best=0.76, avg=0.75, std=0.00, steps=5.900e+07
2023-07-07 16:31:37,932 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3700, best=0.76, avg=0.75, std=0.00, steps=6.064e+07
2023-07-07 16:31:52,777 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3800, best=0.76, avg=0.75, std=0.00, steps=6.228e+07
2023-07-07 16:32:07,626 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 3900, best=0.76, avg=0.75, std=0.00, steps=6.391e+07
2023-07-07 16:32:22,488 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4000, best=0.76, avg=0.76, std=0.00, steps=6.555e+07
2023-07-07 16:32:37,329 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4100, best=0.76, avg=0.76, std=0.00, steps=6.719e+07
2023-07-07 16:32:52,180 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4200, best=0.76, avg=0.76, std=0.00, steps=6.883e+07
2023-07-07 16:33:07,020 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4300, best=0.77, avg=0.76, std=0.00, steps=7.047e+07
2023-07-07 16:33:21,864 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4400, best=0.77, avg=0.76, std=0.00, steps=7.211e+07
2023-07-07 16:33:36,749 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4500, best=0.77, avg=0.76, std=0.00, steps=7.374e+07
2023-07-07 16:33:51,669 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4600, best=0.77, avg=0.76, std=0.00, steps=7.538e+07
2023-07-07 16:34:06,534 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4700, best=0.77, avg=0.76, std=0.00, steps=7.702e+07
2023-07-07 16:34:21,366 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4800, best=0.77, avg=0.76, std=0.00, steps=7.866e+07
2023-07-07 16:34:36,229 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 4900, best=0.77, avg=0.76, std=0.00, steps=8.030e+07
2023-07-07 16:34:51,078 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5000, best=0.77, avg=0.76, std=0.00, steps=8.194e+07
2023-07-07 16:35:05,918 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5100, best=0.77, avg=0.76, std=0.00, steps=8.357e+07
2023-07-07 16:35:20,784 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5200, best=0.77, avg=0.76, std=0.00, steps=8.521e+07
2023-07-07 16:35:35,634 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5300, best=0.77, avg=0.76, std=0.00, steps=8.685e+07
2023-07-07 16:35:50,504 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5400, best=0.77, avg=0.77, std=0.00, steps=8.849e+07
2023-07-07 16:36:05,405 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5500, best=0.77, avg=0.77, std=0.00, steps=9.013e+07
2023-07-07 16:36:20,271 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5600, best=0.78, avg=0.77, std=0.00, steps=9.177e+07
2023-07-07 16:36:35,110 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5700, best=0.78, avg=0.77, std=0.00, steps=9.341e+07
2023-07-07 16:36:49,941 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5800, best=0.78, avg=0.77, std=0.00, steps=9.504e+07
2023-07-07 16:37:04,788 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 5900, best=0.78, avg=0.77, std=0.00, steps=9.668e+07
2023-07-07 16:37:19,665 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6000, best=0.78, avg=0.77, std=0.00, steps=9.832e+07
2023-07-07 16:37:34,633 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6100, best=0.78, avg=0.77, std=0.00, steps=9.996e+07
2023-07-07 16:37:49,465 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6200, best=0.78, avg=0.77, std=0.00, steps=1.016e+08
2023-07-07 16:38:04,301 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6300, best=0.78, avg=0.77, std=0.00, steps=1.032e+08
2023-07-07 16:38:19,131 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6400, best=0.78, avg=0.77, std=0.00, steps=1.049e+08
2023-07-07 16:38:33,957 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6500, best=0.78, avg=0.77, std=0.00, steps=1.065e+08
2023-07-07 16:38:48,779 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6600, best=0.78, avg=0.77, std=0.00, steps=1.082e+08
2023-07-07 16:39:03,726 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6700, best=0.78, avg=0.77, std=0.00, steps=1.098e+08
2023-07-07 16:39:18,649 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6800, best=0.78, avg=0.77, std=0.00, steps=1.114e+08
2023-07-07 16:39:33,537 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 6900, best=0.78, avg=0.77, std=0.00, steps=1.131e+08
2023-07-07 16:39:48,374 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7000, best=0.78, avg=0.78, std=0.00, steps=1.147e+08
2023-07-07 16:40:03,208 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7100, best=0.78, avg=0.78, std=0.00, steps=1.163e+08
2023-07-07 16:40:18,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7200, best=0.78, avg=0.78, std=0.00, steps=1.180e+08
2023-07-07 16:40:32,917 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7300, best=0.79, avg=0.78, std=0.00, steps=1.196e+08
2023-07-07 16:40:47,751 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7400, best=0.78, avg=0.78, std=0.00, steps=1.213e+08
2023-07-07 16:41:02,610 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7500, best=0.79, avg=0.78, std=0.00, steps=1.229e+08
2023-07-07 16:41:17,468 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7600, best=0.79, avg=0.78, std=0.00, steps=1.245e+08
2023-07-07 16:41:32,312 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7700, best=0.79, avg=0.78, std=0.00, steps=1.262e+08
2023-07-07 16:41:47,157 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7800, best=0.79, avg=0.78, std=0.00, steps=1.278e+08
2023-07-07 16:42:01,984 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 7900, best=0.79, avg=0.78, std=0.00, steps=1.294e+08
2023-07-07 16:42:16,815 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8000, best=0.79, avg=0.78, std=0.00, steps=1.311e+08
2023-07-07 16:42:31,667 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8100, best=0.79, avg=0.78, std=0.00, steps=1.327e+08
2023-07-07 16:42:46,595 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8200, best=0.79, avg=0.78, std=0.00, steps=1.344e+08
2023-07-07 16:43:01,450 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8300, best=0.79, avg=0.78, std=0.00, steps=1.360e+08
2023-07-07 16:43:16,263 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8400, best=0.79, avg=0.78, std=0.00, steps=1.376e+08
2023-07-07 16:43:31,071 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8500, best=0.79, avg=0.78, std=0.00, steps=1.393e+08
2023-07-07 16:43:45,876 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8600, best=0.79, avg=0.78, std=0.00, steps=1.409e+08
2023-07-07 16:44:00,847 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8700, best=0.79, avg=0.78, std=0.00, steps=1.426e+08
2023-07-07 16:44:15,678 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8800, best=0.79, avg=0.78, std=0.00, steps=1.442e+08
2023-07-07 16:44:30,498 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 8900, best=0.79, avg=0.78, std=0.00, steps=1.458e+08
2023-07-07 16:44:45,352 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9000, best=0.79, avg=0.78, std=0.00, steps=1.475e+08
2023-07-07 16:45:00,189 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9100, best=0.79, avg=0.79, std=0.00, steps=1.491e+08
2023-07-07 16:45:15,078 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9200, best=0.79, avg=0.79, std=0.00, steps=1.507e+08
2023-07-07 16:45:30,003 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9300, best=0.79, avg=0.79, std=0.00, steps=1.524e+08
2023-07-07 16:45:44,839 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9400, best=0.79, avg=0.79, std=0.00, steps=1.540e+08
2023-07-07 16:45:59,680 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9500, best=0.79, avg=0.79, std=0.00, steps=1.557e+08
2023-07-07 16:46:14,530 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9600, best=0.79, avg=0.79, std=0.00, steps=1.573e+08
2023-07-07 16:46:29,378 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9700, best=0.79, avg=0.79, std=0.00, steps=1.589e+08
2023-07-07 16:46:44,214 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9800, best=0.79, avg=0.79, std=0.00, steps=1.606e+08
2023-07-07 16:46:59,180 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 9900, best=0.79, avg=0.79, std=0.00, steps=1.622e+08
2023-07-07 16:47:14,030 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10000, best=0.79, avg=0.79, std=0.00, steps=1.639e+08
2023-07-07 16:47:28,856 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10100, best=0.80, avg=0.79, std=0.00, steps=1.655e+08
2023-07-07 16:47:43,779 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10200, best=0.79, avg=0.79, std=0.00, steps=1.671e+08
2023-07-07 16:47:58,615 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10300, best=0.79, avg=0.79, std=0.00, steps=1.688e+08
2023-07-07 16:48:13,430 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10400, best=0.80, avg=0.79, std=0.00, steps=1.704e+08
2023-07-07 16:48:28,268 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10500, best=0.79, avg=0.79, std=0.00, steps=1.720e+08
2023-07-07 16:48:43,128 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10600, best=0.79, avg=0.79, std=0.00, steps=1.737e+08
2023-07-07 16:48:57,954 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10700, best=0.80, avg=0.79, std=0.00, steps=1.753e+08
2023-07-07 16:49:12,771 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10800, best=0.80, avg=0.79, std=0.00, steps=1.770e+08
2023-07-07 16:49:27,593 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 10900, best=0.80, avg=0.79, std=0.00, steps=1.786e+08
2023-07-07 16:49:42,403 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11000, best=0.80, avg=0.79, std=0.00, steps=1.802e+08
2023-07-07 16:49:57,303 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11100, best=0.80, avg=0.79, std=0.00, steps=1.819e+08
2023-07-07 16:50:12,155 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11200, best=0.80, avg=0.79, std=0.00, steps=1.835e+08
2023-07-07 16:50:27,002 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11300, best=0.80, avg=0.79, std=0.00, steps=1.852e+08
2023-07-07 16:50:41,846 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11400, best=0.80, avg=0.79, std=0.00, steps=1.868e+08
2023-07-07 16:50:56,695 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11500, best=0.80, avg=0.79, std=0.00, steps=1.884e+08
2023-07-07 16:51:11,514 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11600, best=0.80, avg=0.79, std=0.00, steps=1.901e+08
2023-07-07 16:51:26,345 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11700, best=0.80, avg=0.79, std=0.00, steps=1.917e+08
2023-07-07 16:51:41,177 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11800, best=0.80, avg=0.79, std=0.00, steps=1.933e+08
2023-07-07 16:51:56,005 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11900, best=0.80, avg=0.79, std=0.00, steps=1.950e+08
2023-07-07 16:52:10,699 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 0, 0, [Train]: 11999, best=0.80, avg=0.79, std=0.00, steps=1.966e+08
2023-07-07 16:52:10,700 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 16:52:10,727 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 16:52:10,759 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 16:52:29,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=1.862e+06
2023-07-07 16:52:46,368 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 200, best=0.57, avg=0.56, std=0.00, steps=3.705e+06
2023-07-07 16:53:03,018 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 300, best=0.61, avg=0.60, std=0.00, steps=5.548e+06
2023-07-07 16:53:19,660 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 400, best=0.63, avg=0.62, std=0.00, steps=7.391e+06
2023-07-07 16:53:36,303 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 500, best=0.64, avg=0.64, std=0.00, steps=9.234e+06
2023-07-07 16:53:52,958 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 600, best=0.65, avg=0.65, std=0.00, steps=1.108e+07
2023-07-07 16:54:09,611 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 700, best=0.66, avg=0.65, std=0.00, steps=1.292e+07
2023-07-07 16:54:26,267 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 800, best=0.67, avg=0.66, std=0.00, steps=1.476e+07
2023-07-07 16:54:42,904 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 900, best=0.67, avg=0.66, std=0.00, steps=1.661e+07
2023-07-07 16:54:59,580 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1000, best=0.68, avg=0.67, std=0.00, steps=1.845e+07
2023-07-07 16:55:16,228 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1100, best=0.68, avg=0.67, std=0.00, steps=2.029e+07
2023-07-07 16:55:32,871 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1200, best=0.68, avg=0.67, std=0.00, steps=2.214e+07
2023-07-07 16:55:49,514 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1300, best=0.68, avg=0.68, std=0.00, steps=2.398e+07
2023-07-07 16:56:06,179 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1400, best=0.69, avg=0.68, std=0.00, steps=2.582e+07
2023-07-07 16:56:22,838 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1500, best=0.69, avg=0.68, std=0.00, steps=2.767e+07
2023-07-07 16:56:39,473 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1600, best=0.69, avg=0.68, std=0.00, steps=2.951e+07
2023-07-07 16:56:56,121 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1700, best=0.69, avg=0.68, std=0.00, steps=3.135e+07
2023-07-07 16:57:12,784 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1800, best=0.69, avg=0.68, std=0.00, steps=3.320e+07
2023-07-07 16:57:29,420 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 1900, best=0.70, avg=0.69, std=0.00, steps=3.504e+07
2023-07-07 16:57:46,073 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2000, best=0.69, avg=0.69, std=0.00, steps=3.688e+07
2023-07-07 16:58:02,726 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2100, best=0.70, avg=0.69, std=0.00, steps=3.873e+07
2023-07-07 16:58:19,382 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2200, best=0.70, avg=0.69, std=0.00, steps=4.057e+07
2023-07-07 16:58:36,040 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2300, best=0.70, avg=0.69, std=0.00, steps=4.241e+07
2023-07-07 16:58:52,683 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2400, best=0.70, avg=0.69, std=0.00, steps=4.426e+07
2023-07-07 16:59:09,329 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2500, best=0.70, avg=0.69, std=0.00, steps=4.610e+07
2023-07-07 16:59:25,987 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2600, best=0.70, avg=0.69, std=0.00, steps=4.794e+07
2023-07-07 16:59:42,655 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2700, best=0.70, avg=0.70, std=0.00, steps=4.978e+07
2023-07-07 16:59:59,313 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2800, best=0.71, avg=0.70, std=0.00, steps=5.163e+07
2023-07-07 17:00:15,971 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 2900, best=0.71, avg=0.70, std=0.00, steps=5.347e+07
2023-07-07 17:00:32,619 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3000, best=0.71, avg=0.70, std=0.00, steps=5.531e+07
2023-07-07 17:00:49,286 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3100, best=0.71, avg=0.70, std=0.00, steps=5.716e+07
2023-07-07 17:01:05,937 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3200, best=0.71, avg=0.70, std=0.00, steps=5.900e+07
2023-07-07 17:01:22,601 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3300, best=0.71, avg=0.70, std=0.00, steps=6.084e+07
2023-07-07 17:01:39,257 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3400, best=0.71, avg=0.70, std=0.00, steps=6.269e+07
2023-07-07 17:01:55,902 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3500, best=0.71, avg=0.70, std=0.00, steps=6.453e+07
2023-07-07 17:02:12,563 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3600, best=0.71, avg=0.70, std=0.00, steps=6.637e+07
2023-07-07 17:02:29,196 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3700, best=0.71, avg=0.70, std=0.00, steps=6.822e+07
2023-07-07 17:02:45,845 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3800, best=0.71, avg=0.70, std=0.00, steps=7.006e+07
2023-07-07 17:03:02,483 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 3900, best=0.71, avg=0.71, std=0.00, steps=7.190e+07
2023-07-07 17:03:19,108 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4000, best=0.71, avg=0.71, std=0.00, steps=7.375e+07
2023-07-07 17:03:35,762 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4100, best=0.71, avg=0.71, std=0.00, steps=7.559e+07
2023-07-07 17:03:52,416 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4200, best=0.71, avg=0.71, std=0.00, steps=7.743e+07
2023-07-07 17:04:09,066 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4300, best=0.71, avg=0.71, std=0.00, steps=7.928e+07
2023-07-07 17:04:25,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4400, best=0.71, avg=0.71, std=0.00, steps=8.112e+07
2023-07-07 17:04:42,366 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4500, best=0.72, avg=0.71, std=0.00, steps=8.296e+07
2023-07-07 17:04:59,043 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4600, best=0.72, avg=0.71, std=0.00, steps=8.481e+07
2023-07-07 17:05:15,704 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4700, best=0.72, avg=0.71, std=0.00, steps=8.665e+07
2023-07-07 17:05:32,358 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4800, best=0.72, avg=0.71, std=0.00, steps=8.849e+07
2023-07-07 17:05:49,017 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 4900, best=0.72, avg=0.71, std=0.00, steps=9.034e+07
2023-07-07 17:06:05,662 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5000, best=0.72, avg=0.71, std=0.00, steps=9.218e+07
2023-07-07 17:06:22,301 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5100, best=0.72, avg=0.71, std=0.00, steps=9.402e+07
2023-07-07 17:06:38,988 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5200, best=0.72, avg=0.71, std=0.00, steps=9.586e+07
2023-07-07 17:06:55,642 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5300, best=0.72, avg=0.71, std=0.00, steps=9.771e+07
2023-07-07 17:07:12,275 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5400, best=0.72, avg=0.71, std=0.00, steps=9.955e+07
2023-07-07 17:07:28,911 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5500, best=0.72, avg=0.71, std=0.00, steps=1.014e+08
2023-07-07 17:07:45,536 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5600, best=0.72, avg=0.71, std=0.00, steps=1.032e+08
2023-07-07 17:08:02,324 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5700, best=0.72, avg=0.72, std=0.00, steps=1.051e+08
2023-07-07 17:08:18,974 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5800, best=0.72, avg=0.72, std=0.00, steps=1.069e+08
2023-07-07 17:08:35,670 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 5900, best=0.72, avg=0.72, std=0.00, steps=1.088e+08
2023-07-07 17:08:52,345 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6000, best=0.73, avg=0.72, std=0.00, steps=1.106e+08
2023-07-07 17:09:09,066 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6100, best=0.72, avg=0.72, std=0.00, steps=1.125e+08
2023-07-07 17:09:25,729 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6200, best=0.72, avg=0.72, std=0.00, steps=1.143e+08
2023-07-07 17:09:42,422 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6300, best=0.72, avg=0.72, std=0.00, steps=1.161e+08
2023-07-07 17:09:59,072 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6400, best=0.73, avg=0.72, std=0.00, steps=1.180e+08
2023-07-07 17:10:15,748 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6500, best=0.73, avg=0.72, std=0.00, steps=1.198e+08
2023-07-07 17:10:32,442 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6600, best=0.73, avg=0.72, std=0.00, steps=1.217e+08
2023-07-07 17:10:49,118 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6700, best=0.72, avg=0.72, std=0.00, steps=1.235e+08
2023-07-07 17:11:05,766 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6800, best=0.72, avg=0.72, std=0.00, steps=1.254e+08
2023-07-07 17:11:22,428 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 6900, best=0.73, avg=0.72, std=0.00, steps=1.272e+08
2023-07-07 17:11:39,088 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7000, best=0.73, avg=0.72, std=0.00, steps=1.290e+08
2023-07-07 17:11:55,737 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7100, best=0.73, avg=0.72, std=0.00, steps=1.309e+08
2023-07-07 17:12:12,381 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7200, best=0.73, avg=0.72, std=0.00, steps=1.327e+08
2023-07-07 17:12:29,023 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7300, best=0.73, avg=0.72, std=0.00, steps=1.346e+08
2023-07-07 17:12:45,679 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7400, best=0.73, avg=0.72, std=0.00, steps=1.364e+08
2023-07-07 17:13:02,338 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7500, best=0.73, avg=0.72, std=0.00, steps=1.383e+08
2023-07-07 17:13:18,978 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7600, best=0.73, avg=0.72, std=0.00, steps=1.401e+08
2023-07-07 17:13:35,608 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7700, best=0.73, avg=0.72, std=0.00, steps=1.419e+08
2023-07-07 17:13:52,226 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7800, best=0.73, avg=0.72, std=0.00, steps=1.438e+08
2023-07-07 17:14:08,854 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 7900, best=0.73, avg=0.72, std=0.00, steps=1.456e+08
2023-07-07 17:14:25,604 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8000, best=0.73, avg=0.72, std=0.00, steps=1.475e+08
2023-07-07 17:14:42,268 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8100, best=0.73, avg=0.72, std=0.00, steps=1.493e+08
2023-07-07 17:14:58,920 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8200, best=0.73, avg=0.72, std=0.00, steps=1.512e+08
2023-07-07 17:15:15,560 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8300, best=0.73, avg=0.72, std=0.00, steps=1.530e+08
2023-07-07 17:15:32,191 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8400, best=0.73, avg=0.72, std=0.00, steps=1.548e+08
2023-07-07 17:15:48,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8500, best=0.73, avg=0.72, std=0.00, steps=1.567e+08
2023-07-07 17:16:05,432 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8600, best=0.73, avg=0.72, std=0.00, steps=1.585e+08
2023-07-07 17:16:22,070 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8700, best=0.73, avg=0.72, std=0.00, steps=1.604e+08
2023-07-07 17:16:38,761 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8800, best=0.73, avg=0.72, std=0.00, steps=1.622e+08
2023-07-07 17:16:55,403 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 8900, best=0.73, avg=0.72, std=0.00, steps=1.641e+08
2023-07-07 17:17:12,033 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9000, best=0.73, avg=0.73, std=0.00, steps=1.659e+08
2023-07-07 17:17:28,656 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9100, best=0.73, avg=0.73, std=0.00, steps=1.677e+08
2023-07-07 17:17:45,323 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9200, best=0.73, avg=0.73, std=0.00, steps=1.696e+08
2023-07-07 17:18:01,965 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9300, best=0.73, avg=0.73, std=0.00, steps=1.714e+08
2023-07-07 17:18:18,605 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9400, best=0.73, avg=0.73, std=0.00, steps=1.733e+08
2023-07-07 17:18:35,261 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9500, best=0.74, avg=0.73, std=0.00, steps=1.751e+08
2023-07-07 17:18:51,946 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9600, best=0.73, avg=0.73, std=0.00, steps=1.770e+08
2023-07-07 17:19:08,643 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9700, best=0.74, avg=0.73, std=0.00, steps=1.788e+08
2023-07-07 17:19:25,383 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9800, best=0.73, avg=0.73, std=0.00, steps=1.807e+08
2023-07-07 17:19:42,151 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 9900, best=0.74, avg=0.73, std=0.00, steps=1.825e+08
2023-07-07 17:19:58,774 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10000, best=0.74, avg=0.73, std=0.00, steps=1.843e+08
2023-07-07 17:20:15,438 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10100, best=0.74, avg=0.73, std=0.00, steps=1.862e+08
2023-07-07 17:20:32,172 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10200, best=0.74, avg=0.73, std=0.00, steps=1.880e+08
2023-07-07 17:20:48,847 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10300, best=0.74, avg=0.73, std=0.00, steps=1.899e+08
2023-07-07 17:21:05,495 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10400, best=0.74, avg=0.73, std=0.00, steps=1.917e+08
2023-07-07 17:21:22,124 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10500, best=0.74, avg=0.73, std=0.00, steps=1.936e+08
2023-07-07 17:21:38,758 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10600, best=0.74, avg=0.73, std=0.00, steps=1.954e+08
2023-07-07 17:21:55,558 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10700, best=0.74, avg=0.73, std=0.00, steps=1.972e+08
2023-07-07 17:22:12,236 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10800, best=0.74, avg=0.73, std=0.00, steps=1.991e+08
2023-07-07 17:22:28,890 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 10900, best=0.74, avg=0.73, std=0.00, steps=2.009e+08
2023-07-07 17:22:45,544 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11000, best=0.74, avg=0.73, std=0.00, steps=2.028e+08
2023-07-07 17:23:02,212 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11100, best=0.74, avg=0.73, std=0.00, steps=2.046e+08
2023-07-07 17:23:18,864 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11200, best=0.74, avg=0.73, std=0.00, steps=2.065e+08
2023-07-07 17:23:35,564 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11300, best=0.74, avg=0.73, std=0.00, steps=2.083e+08
2023-07-07 17:23:52,261 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11400, best=0.74, avg=0.74, std=0.00, steps=2.101e+08
2023-07-07 17:24:08,905 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11500, best=0.74, avg=0.73, std=0.00, steps=2.120e+08
2023-07-07 17:24:25,611 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11600, best=0.74, avg=0.73, std=0.00, steps=2.138e+08
2023-07-07 17:24:42,399 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11700, best=0.74, avg=0.74, std=0.00, steps=2.157e+08
2023-07-07 17:24:59,073 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11800, best=0.74, avg=0.74, std=0.00, steps=2.175e+08
2023-07-07 17:25:15,768 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11900, best=0.75, avg=0.74, std=0.00, steps=2.194e+08
2023-07-07 17:25:32,274 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 8, 0, [Train]: 11999, best=0.74, avg=0.74, std=0.00, steps=2.212e+08
2023-07-07 17:25:32,275 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 17:25:32,301 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 17:25:32,333 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 17:25:53,179 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.068e+06
2023-07-07 17:26:11,675 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=4.116e+06
2023-07-07 17:26:30,137 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 300, best=0.57, avg=0.57, std=0.00, steps=6.164e+06
2023-07-07 17:26:48,625 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 400, best=0.59, avg=0.58, std=0.00, steps=8.212e+06
2023-07-07 17:27:07,124 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 500, best=0.61, avg=0.60, std=0.00, steps=1.026e+07
2023-07-07 17:27:25,615 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 600, best=0.62, avg=0.61, std=0.00, steps=1.231e+07
2023-07-07 17:27:44,077 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 700, best=0.62, avg=0.62, std=0.00, steps=1.436e+07
2023-07-07 17:28:02,557 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 800, best=0.63, avg=0.62, std=0.00, steps=1.640e+07
2023-07-07 17:28:21,075 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 900, best=0.64, avg=0.63, std=0.00, steps=1.845e+07
2023-07-07 17:28:39,564 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1000, best=0.65, avg=0.64, std=0.00, steps=2.050e+07
2023-07-07 17:28:58,043 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1100, best=0.65, avg=0.64, std=0.00, steps=2.255e+07
2023-07-07 17:29:16,536 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1200, best=0.66, avg=0.65, std=0.00, steps=2.460e+07
2023-07-07 17:29:34,993 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1300, best=0.66, avg=0.65, std=0.00, steps=2.664e+07
2023-07-07 17:29:53,463 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1400, best=0.66, avg=0.65, std=0.00, steps=2.869e+07
2023-07-07 17:30:11,938 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1500, best=0.66, avg=0.66, std=0.00, steps=3.074e+07
2023-07-07 17:30:30,416 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1600, best=0.66, avg=0.66, std=0.00, steps=3.279e+07
2023-07-07 17:30:48,932 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1700, best=0.67, avg=0.66, std=0.00, steps=3.484e+07
2023-07-07 17:31:07,419 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1800, best=0.67, avg=0.66, std=0.00, steps=3.688e+07
2023-07-07 17:31:25,913 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 1900, best=0.67, avg=0.66, std=0.00, steps=3.893e+07
2023-07-07 17:31:44,370 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2000, best=0.67, avg=0.66, std=0.00, steps=4.098e+07
2023-07-07 17:32:02,880 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2100, best=0.68, avg=0.67, std=0.00, steps=4.303e+07
2023-07-07 17:32:21,366 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2200, best=0.67, avg=0.67, std=0.00, steps=4.508e+07
2023-07-07 17:32:39,846 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2300, best=0.68, avg=0.67, std=0.00, steps=4.712e+07
2023-07-07 17:32:58,335 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2400, best=0.68, avg=0.67, std=0.00, steps=4.917e+07
2023-07-07 17:33:16,824 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2500, best=0.68, avg=0.67, std=0.00, steps=5.122e+07
2023-07-07 17:33:35,274 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2600, best=0.68, avg=0.67, std=0.00, steps=5.327e+07
2023-07-07 17:33:53,728 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2700, best=0.68, avg=0.67, std=0.00, steps=5.532e+07
2023-07-07 17:34:12,189 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2800, best=0.68, avg=0.67, std=0.00, steps=5.736e+07
2023-07-07 17:34:30,648 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 2900, best=0.68, avg=0.67, std=0.00, steps=5.941e+07
2023-07-07 17:34:49,125 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3000, best=0.68, avg=0.67, std=0.00, steps=6.146e+07
2023-07-07 17:35:07,601 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3100, best=0.68, avg=0.67, std=0.00, steps=6.351e+07
2023-07-07 17:35:26,094 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3200, best=0.68, avg=0.68, std=0.00, steps=6.556e+07
2023-07-07 17:35:44,567 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3300, best=0.68, avg=0.68, std=0.00, steps=6.760e+07
2023-07-07 17:36:03,101 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3400, best=0.68, avg=0.68, std=0.00, steps=6.965e+07
2023-07-07 17:36:21,585 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3500, best=0.69, avg=0.68, std=0.00, steps=7.170e+07
2023-07-07 17:36:40,060 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3600, best=0.69, avg=0.68, std=0.00, steps=7.375e+07
2023-07-07 17:36:58,532 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3700, best=0.69, avg=0.68, std=0.00, steps=7.580e+07
2023-07-07 17:37:17,036 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3800, best=0.69, avg=0.68, std=0.00, steps=7.784e+07
2023-07-07 17:37:35,514 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 3900, best=0.69, avg=0.68, std=0.00, steps=7.989e+07
2023-07-07 17:37:53,988 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4000, best=0.69, avg=0.68, std=0.00, steps=8.194e+07
2023-07-07 17:38:12,444 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4100, best=0.69, avg=0.68, std=0.00, steps=8.399e+07
2023-07-07 17:38:30,914 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4200, best=0.69, avg=0.68, std=0.00, steps=8.604e+07
2023-07-07 17:38:49,407 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4300, best=0.69, avg=0.68, std=0.00, steps=8.808e+07
2023-07-07 17:39:07,891 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4400, best=0.69, avg=0.68, std=0.00, steps=9.013e+07
2023-07-07 17:39:26,367 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4500, best=0.69, avg=0.69, std=0.00, steps=9.218e+07
2023-07-07 17:39:44,850 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4600, best=0.69, avg=0.69, std=0.00, steps=9.423e+07
2023-07-07 17:40:03,367 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4700, best=0.69, avg=0.69, std=0.00, steps=9.628e+07
2023-07-07 17:40:21,869 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4800, best=0.70, avg=0.69, std=0.00, steps=9.832e+07
2023-07-07 17:40:40,391 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 4900, best=0.70, avg=0.69, std=0.00, steps=1.004e+08
2023-07-07 17:40:58,870 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5000, best=0.70, avg=0.69, std=0.00, steps=1.024e+08
2023-07-07 17:41:17,351 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5100, best=0.70, avg=0.69, std=0.00, steps=1.045e+08
2023-07-07 17:41:35,809 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5200, best=0.70, avg=0.69, std=0.00, steps=1.065e+08
2023-07-07 17:41:54,244 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5300, best=0.70, avg=0.69, std=0.00, steps=1.086e+08
2023-07-07 17:42:12,688 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5400, best=0.70, avg=0.69, std=0.00, steps=1.106e+08
2023-07-07 17:42:31,167 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5500, best=0.70, avg=0.69, std=0.00, steps=1.127e+08
2023-07-07 17:42:49,635 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5600, best=0.70, avg=0.69, std=0.00, steps=1.147e+08
2023-07-07 17:43:08,104 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5700, best=0.70, avg=0.69, std=0.00, steps=1.168e+08
2023-07-07 17:43:26,563 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5800, best=0.70, avg=0.69, std=0.00, steps=1.188e+08
2023-07-07 17:43:45,037 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 5900, best=0.70, avg=0.69, std=0.00, steps=1.209e+08
2023-07-07 17:44:03,510 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6000, best=0.70, avg=0.69, std=0.00, steps=1.229e+08
2023-07-07 17:44:22,003 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6100, best=0.71, avg=0.70, std=0.00, steps=1.249e+08
2023-07-07 17:44:40,482 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6200, best=0.71, avg=0.70, std=0.00, steps=1.270e+08
2023-07-07 17:44:58,957 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6300, best=0.71, avg=0.70, std=0.00, steps=1.290e+08
2023-07-07 17:45:17,429 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6400, best=0.71, avg=0.70, std=0.00, steps=1.311e+08
2023-07-07 17:45:35,913 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6500, best=0.70, avg=0.70, std=0.00, steps=1.331e+08
2023-07-07 17:45:54,398 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6600, best=0.70, avg=0.70, std=0.00, steps=1.352e+08
2023-07-07 17:46:12,907 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6700, best=0.71, avg=0.70, std=0.00, steps=1.372e+08
2023-07-07 17:46:31,424 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6800, best=0.71, avg=0.70, std=0.00, steps=1.393e+08
2023-07-07 17:46:49,892 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 6900, best=0.71, avg=0.70, std=0.00, steps=1.413e+08
2023-07-07 17:47:08,378 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7000, best=0.71, avg=0.70, std=0.00, steps=1.434e+08
2023-07-07 17:47:26,842 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7100, best=0.71, avg=0.70, std=0.00, steps=1.454e+08
2023-07-07 17:47:45,319 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7200, best=0.71, avg=0.70, std=0.00, steps=1.475e+08
2023-07-07 17:48:03,799 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7300, best=0.71, avg=0.70, std=0.00, steps=1.495e+08
2023-07-07 17:48:22,263 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7400, best=0.71, avg=0.70, std=0.00, steps=1.516e+08
2023-07-07 17:48:40,761 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7500, best=0.71, avg=0.70, std=0.00, steps=1.536e+08
2023-07-07 17:48:59,237 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7600, best=0.71, avg=0.70, std=0.00, steps=1.557e+08
2023-07-07 17:49:17,721 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7700, best=0.71, avg=0.70, std=0.00, steps=1.577e+08
2023-07-07 17:49:36,211 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7800, best=0.71, avg=0.70, std=0.00, steps=1.598e+08
2023-07-07 17:49:54,691 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 7900, best=0.71, avg=0.71, std=0.00, steps=1.618e+08
2023-07-07 17:50:13,172 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8000, best=0.71, avg=0.71, std=0.00, steps=1.639e+08
2023-07-07 17:50:31,669 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8100, best=0.71, avg=0.71, std=0.00, steps=1.659e+08
2023-07-07 17:50:50,153 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8200, best=0.71, avg=0.71, std=0.00, steps=1.680e+08
2023-07-07 17:51:08,634 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8300, best=0.71, avg=0.71, std=0.00, steps=1.700e+08
2023-07-07 17:51:27,120 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8400, best=0.71, avg=0.71, std=0.00, steps=1.721e+08
2023-07-07 17:51:45,598 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8500, best=0.72, avg=0.71, std=0.00, steps=1.741e+08
2023-07-07 17:52:04,058 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8600, best=0.71, avg=0.71, std=0.00, steps=1.761e+08
2023-07-07 17:52:22,518 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8700, best=0.71, avg=0.71, std=0.00, steps=1.782e+08
2023-07-07 17:52:40,982 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8800, best=0.72, avg=0.71, std=0.00, steps=1.802e+08
2023-07-07 17:52:59,455 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 8900, best=0.72, avg=0.71, std=0.00, steps=1.823e+08
2023-07-07 17:53:17,924 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9000, best=0.71, avg=0.71, std=0.00, steps=1.843e+08
2023-07-07 17:53:36,394 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9100, best=0.72, avg=0.71, std=0.00, steps=1.864e+08
2023-07-07 17:53:54,888 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9200, best=0.72, avg=0.71, std=0.00, steps=1.884e+08
2023-07-07 17:54:13,355 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9300, best=0.72, avg=0.71, std=0.00, steps=1.905e+08
2023-07-07 17:54:31,829 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9400, best=0.72, avg=0.71, std=0.00, steps=1.925e+08
2023-07-07 17:54:50,297 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9500, best=0.72, avg=0.71, std=0.00, steps=1.946e+08
2023-07-07 17:55:08,778 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9600, best=0.72, avg=0.71, std=0.00, steps=1.966e+08
2023-07-07 17:55:27,253 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9700, best=0.72, avg=0.71, std=0.00, steps=1.987e+08
2023-07-07 17:55:45,747 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9800, best=0.72, avg=0.71, std=0.00, steps=2.007e+08
2023-07-07 17:56:04,218 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 9900, best=0.72, avg=0.71, std=0.00, steps=2.028e+08
2023-07-07 17:56:22,665 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10000, best=0.72, avg=0.71, std=0.00, steps=2.048e+08
2023-07-07 17:56:41,102 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10100, best=0.72, avg=0.71, std=0.00, steps=2.069e+08
2023-07-07 17:56:59,556 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10200, best=0.72, avg=0.71, std=0.00, steps=2.089e+08
2023-07-07 17:57:18,054 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10300, best=0.72, avg=0.71, std=0.00, steps=2.110e+08
2023-07-07 17:57:36,536 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10400, best=0.72, avg=0.72, std=0.00, steps=2.130e+08
2023-07-07 17:57:55,039 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10500, best=0.72, avg=0.71, std=0.00, steps=2.151e+08
2023-07-07 17:58:13,538 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10600, best=0.72, avg=0.71, std=0.00, steps=2.171e+08
2023-07-07 17:58:32,010 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10700, best=0.72, avg=0.72, std=0.00, steps=2.192e+08
2023-07-07 17:58:50,486 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10800, best=0.72, avg=0.72, std=0.00, steps=2.212e+08
2023-07-07 17:59:08,949 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 10900, best=0.72, avg=0.72, std=0.00, steps=2.233e+08
2023-07-07 17:59:27,429 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11000, best=0.72, avg=0.72, std=0.00, steps=2.253e+08
2023-07-07 17:59:45,901 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11100, best=0.73, avg=0.72, std=0.00, steps=2.273e+08
2023-07-07 18:00:04,345 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11200, best=0.73, avg=0.72, std=0.00, steps=2.294e+08
2023-07-07 18:00:22,804 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11300, best=0.72, avg=0.72, std=0.00, steps=2.314e+08
2023-07-07 18:00:41,265 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11400, best=0.73, avg=0.72, std=0.00, steps=2.335e+08
2023-07-07 18:00:59,734 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11500, best=0.73, avg=0.72, std=0.00, steps=2.355e+08
2023-07-07 18:01:18,208 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11600, best=0.72, avg=0.72, std=0.00, steps=2.376e+08
2023-07-07 18:01:36,681 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11700, best=0.73, avg=0.72, std=0.00, steps=2.396e+08
2023-07-07 18:01:55,157 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11800, best=0.73, avg=0.72, std=0.00, steps=2.417e+08
2023-07-07 18:02:13,641 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11900, best=0.73, avg=0.72, std=0.00, steps=2.437e+08
2023-07-07 18:02:31,935 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 16, 0, [Train]: 11999, best=0.73, avg=0.72, std=0.00, steps=2.458e+08
2023-07-07 18:02:31,936 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 18:02:31,962 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:02:31,994 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:02:56,425 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=2.482e+06
2023-07-07 18:03:18,522 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=4.940e+06
2023-07-07 18:03:40,650 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=7.397e+06
2023-07-07 18:04:02,753 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=9.855e+06
2023-07-07 18:04:24,855 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=1.231e+07
2023-07-07 18:04:46,958 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=1.477e+07
2023-07-07 18:05:09,020 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=1.723e+07
2023-07-07 18:05:31,087 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=1.969e+07
2023-07-07 18:05:53,207 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=2.214e+07
2023-07-07 18:06:15,319 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=2.460e+07
2023-07-07 18:06:37,435 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=2.706e+07
2023-07-07 18:06:59,540 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=2.952e+07
2023-07-07 18:07:21,652 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.00, steps=3.197e+07
2023-07-07 18:07:43,772 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.00, steps=3.443e+07
2023-07-07 18:08:05,877 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=3.689e+07
2023-07-07 18:08:27,982 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=3.935e+07
2023-07-07 18:08:50,085 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.00, steps=4.180e+07
2023-07-07 18:09:12,182 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1800, best=0.51, avg=0.50, std=0.00, steps=4.426e+07
2023-07-07 18:09:34,255 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.00, steps=4.672e+07
2023-07-07 18:09:56,334 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.00, steps=4.918e+07
2023-07-07 18:10:18,421 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.00, steps=5.163e+07
2023-07-07 18:10:40,509 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.00, steps=5.409e+07
2023-07-07 18:11:02,619 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2300, best=0.51, avg=0.50, std=0.00, steps=5.655e+07
2023-07-07 18:11:24,686 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2400, best=0.51, avg=0.50, std=0.00, steps=5.901e+07
2023-07-07 18:11:46,744 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.00, steps=6.146e+07
2023-07-07 18:12:08,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.00, steps=6.392e+07
2023-07-07 18:12:30,916 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2700, best=0.51, avg=0.50, std=0.00, steps=6.638e+07
2023-07-07 18:12:53,017 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2800, best=0.51, avg=0.50, std=0.00, steps=6.884e+07
2023-07-07 18:13:15,110 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.00, steps=7.129e+07
2023-07-07 18:13:37,196 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3000, best=0.51, avg=0.50, std=0.00, steps=7.375e+07
2023-07-07 18:13:59,278 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.00, steps=7.621e+07
2023-07-07 18:14:21,364 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3200, best=0.51, avg=0.50, std=0.00, steps=7.867e+07
2023-07-07 18:14:43,433 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3300, best=0.51, avg=0.50, std=0.00, steps=8.113e+07
2023-07-07 18:15:05,497 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.00, steps=8.358e+07
2023-07-07 18:15:27,568 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3500, best=0.51, avg=0.50, std=0.00, steps=8.604e+07
2023-07-07 18:15:49,649 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3600, best=0.51, avg=0.50, std=0.00, steps=8.850e+07
2023-07-07 18:16:11,722 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.00, steps=9.096e+07
2023-07-07 18:16:33,817 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.00, steps=9.341e+07
2023-07-07 18:16:55,929 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 3900, best=0.51, avg=0.50, std=0.00, steps=9.587e+07
2023-07-07 18:17:18,067 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4000, best=0.51, avg=0.50, std=0.00, steps=9.833e+07
2023-07-07 18:17:40,175 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.00, steps=1.008e+08
2023-07-07 18:18:02,246 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.00, steps=1.032e+08
2023-07-07 18:18:24,347 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4300, best=0.51, avg=0.50, std=0.00, steps=1.057e+08
2023-07-07 18:18:46,477 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4400, best=0.51, avg=0.50, std=0.00, steps=1.082e+08
2023-07-07 18:19:08,548 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4500, best=0.51, avg=0.50, std=0.00, steps=1.106e+08
2023-07-07 18:19:30,618 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.00, steps=1.131e+08
2023-07-07 18:19:52,706 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.00, steps=1.155e+08
2023-07-07 18:20:14,789 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4800, best=0.51, avg=0.50, std=0.00, steps=1.180e+08
2023-07-07 18:20:36,844 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 4900, best=0.51, avg=0.50, std=0.00, steps=1.204e+08
2023-07-07 18:20:58,935 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5000, best=0.51, avg=0.50, std=0.00, steps=1.229e+08
2023-07-07 18:21:21,015 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5100, best=0.51, avg=0.50, std=0.00, steps=1.254e+08
2023-07-07 18:21:43,101 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.00, steps=1.278e+08
2023-07-07 18:22:05,210 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5300, best=0.51, avg=0.50, std=0.00, steps=1.303e+08
2023-07-07 18:22:27,320 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5400, best=0.51, avg=0.50, std=0.00, steps=1.327e+08
2023-07-07 18:22:49,422 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.00, steps=1.352e+08
2023-07-07 18:23:11,506 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5600, best=0.51, avg=0.50, std=0.00, steps=1.377e+08
2023-07-07 18:23:33,600 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5700, best=0.51, avg=0.50, std=0.00, steps=1.401e+08
2023-07-07 18:23:55,717 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5800, best=0.51, avg=0.50, std=0.00, steps=1.426e+08
2023-07-07 18:24:17,790 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 5900, best=0.51, avg=0.50, std=0.00, steps=1.450e+08
2023-07-07 18:24:39,855 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.00, steps=1.475e+08
2023-07-07 18:25:01,952 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6100, best=0.51, avg=0.50, std=0.00, steps=1.499e+08
2023-07-07 18:25:24,052 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.00, steps=1.524e+08
2023-07-07 18:25:46,146 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.00, steps=1.549e+08
2023-07-07 18:26:08,230 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6400, best=0.51, avg=0.50, std=0.00, steps=1.573e+08
2023-07-07 18:26:30,309 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6500, best=0.51, avg=0.50, std=0.00, steps=1.598e+08
2023-07-07 18:26:52,385 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6600, best=0.51, avg=0.50, std=0.00, steps=1.622e+08
2023-07-07 18:27:14,473 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6700, best=0.51, avg=0.50, std=0.00, steps=1.647e+08
2023-07-07 18:27:36,555 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.00, steps=1.671e+08
2023-07-07 18:27:58,660 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.00, steps=1.696e+08
2023-07-07 18:28:20,751 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7000, best=0.51, avg=0.50, std=0.00, steps=1.721e+08
2023-07-07 18:28:42,861 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7100, best=0.51, avg=0.50, std=0.00, steps=1.745e+08
2023-07-07 18:29:04,953 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7200, best=0.51, avg=0.50, std=0.00, steps=1.770e+08
2023-07-07 18:29:27,044 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.00, steps=1.794e+08
2023-07-07 18:29:49,131 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7400, best=0.56, avg=0.55, std=0.00, steps=1.819e+08
2023-07-07 18:30:11,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7500, best=0.57, avg=0.57, std=0.00, steps=1.843e+08
2023-07-07 18:30:33,325 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7600, best=0.58, avg=0.57, std=0.00, steps=1.868e+08
2023-07-07 18:30:55,427 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7700, best=0.57, avg=0.57, std=0.00, steps=1.893e+08
2023-07-07 18:31:17,520 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7800, best=0.58, avg=0.57, std=0.00, steps=1.917e+08
2023-07-07 18:31:39,613 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 7900, best=0.58, avg=0.57, std=0.00, steps=1.942e+08
2023-07-07 18:32:01,736 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8000, best=0.58, avg=0.57, std=0.00, steps=1.966e+08
2023-07-07 18:32:23,851 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8100, best=0.58, avg=0.57, std=0.00, steps=1.991e+08
2023-07-07 18:32:45,933 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8200, best=0.59, avg=0.58, std=0.00, steps=2.015e+08
2023-07-07 18:33:08,025 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8300, best=0.59, avg=0.58, std=0.00, steps=2.040e+08
2023-07-07 18:33:30,109 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8400, best=0.59, avg=0.59, std=0.00, steps=2.065e+08
2023-07-07 18:33:52,203 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8500, best=0.60, avg=0.59, std=0.00, steps=2.089e+08
2023-07-07 18:34:14,291 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8600, best=0.60, avg=0.59, std=0.00, steps=2.114e+08
2023-07-07 18:34:36,360 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8700, best=0.60, avg=0.59, std=0.00, steps=2.138e+08
2023-07-07 18:34:58,463 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8800, best=0.60, avg=0.59, std=0.00, steps=2.163e+08
2023-07-07 18:35:20,538 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 8900, best=0.60, avg=0.60, std=0.00, steps=2.188e+08
2023-07-07 18:35:42,626 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9000, best=0.61, avg=0.60, std=0.00, steps=2.212e+08
2023-07-07 18:36:04,700 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9100, best=0.61, avg=0.60, std=0.00, steps=2.237e+08
2023-07-07 18:36:26,803 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9200, best=0.61, avg=0.61, std=0.00, steps=2.261e+08
2023-07-07 18:36:48,883 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9300, best=0.62, avg=0.61, std=0.00, steps=2.286e+08
2023-07-07 18:37:10,964 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9400, best=0.62, avg=0.61, std=0.00, steps=2.310e+08
2023-07-07 18:37:33,058 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9500, best=0.62, avg=0.61, std=0.00, steps=2.335e+08
2023-07-07 18:37:55,130 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9600, best=0.62, avg=0.62, std=0.00, steps=2.360e+08
2023-07-07 18:38:17,220 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9700, best=0.62, avg=0.62, std=0.00, steps=2.384e+08
2023-07-07 18:38:39,292 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9800, best=0.62, avg=0.62, std=0.00, steps=2.409e+08
2023-07-07 18:39:01,388 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 9900, best=0.63, avg=0.62, std=0.00, steps=2.433e+08
2023-07-07 18:39:23,474 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10000, best=0.63, avg=0.62, std=0.00, steps=2.458e+08
2023-07-07 18:39:45,559 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10100, best=0.63, avg=0.62, std=0.00, steps=2.482e+08
2023-07-07 18:40:07,638 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10200, best=0.63, avg=0.62, std=0.00, steps=2.507e+08
2023-07-07 18:40:29,716 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10300, best=0.63, avg=0.62, std=0.00, steps=2.532e+08
2023-07-07 18:40:51,802 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10400, best=0.63, avg=0.62, std=0.00, steps=2.556e+08
2023-07-07 18:41:13,903 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10500, best=0.63, avg=0.62, std=0.00, steps=2.581e+08
2023-07-07 18:41:36,005 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10600, best=0.63, avg=0.62, std=0.00, steps=2.605e+08
2023-07-07 18:41:58,111 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10700, best=0.63, avg=0.62, std=0.00, steps=2.630e+08
2023-07-07 18:42:20,192 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10800, best=0.63, avg=0.62, std=0.00, steps=2.654e+08
2023-07-07 18:42:42,301 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 10900, best=0.63, avg=0.62, std=0.00, steps=2.679e+08
2023-07-07 18:43:04,388 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11000, best=0.63, avg=0.63, std=0.00, steps=2.704e+08
2023-07-07 18:43:26,488 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11100, best=0.63, avg=0.62, std=0.00, steps=2.728e+08
2023-07-07 18:43:48,556 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11200, best=0.63, avg=0.63, std=0.00, steps=2.753e+08
2023-07-07 18:44:10,624 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11300, best=0.63, avg=0.63, std=0.00, steps=2.777e+08
2023-07-07 18:44:32,728 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11400, best=0.63, avg=0.63, std=0.00, steps=2.802e+08
2023-07-07 18:44:54,810 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11500, best=0.63, avg=0.63, std=0.00, steps=2.826e+08
2023-07-07 18:45:16,903 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11600, best=0.63, avg=0.63, std=0.00, steps=2.851e+08
2023-07-07 18:45:39,016 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11700, best=0.63, avg=0.63, std=0.00, steps=2.876e+08
2023-07-07 18:46:01,111 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11800, best=0.63, avg=0.63, std=0.00, steps=2.900e+08
2023-07-07 18:46:23,219 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11900, best=0.63, avg=0.63, std=0.00, steps=2.925e+08
2023-07-07 18:46:45,112 -        meta learning: [    INFO] - [Len Lat Rep]: 32, 32, 0, [Train]: 11999, best=0.64, avg=0.63, std=0.00, steps=2.949e+08
2023-07-07 18:46:45,114 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 18:46:45,140 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 18:46:45,174 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 18:47:17,336 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.310e+06
2023-07-07 18:47:46,729 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 200, best=0.53, avg=0.52, std=0.00, steps=6.586e+06
2023-07-07 18:48:16,094 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 300, best=0.56, avg=0.55, std=0.00, steps=9.863e+06
2023-07-07 18:48:45,437 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 400, best=0.56, avg=0.56, std=0.00, steps=1.314e+07
2023-07-07 18:49:14,787 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 500, best=0.57, avg=0.56, std=0.00, steps=1.642e+07
2023-07-07 18:49:44,168 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 600, best=0.57, avg=0.57, std=0.00, steps=1.969e+07
2023-07-07 18:50:13,538 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 700, best=0.59, avg=0.58, std=0.00, steps=2.297e+07
2023-07-07 18:50:42,964 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 800, best=0.60, avg=0.59, std=0.00, steps=2.625e+07
2023-07-07 18:51:12,332 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 900, best=0.60, avg=0.60, std=0.00, steps=2.952e+07
2023-07-07 18:51:41,698 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1000, best=0.61, avg=0.60, std=0.00, steps=3.280e+07
2023-07-07 18:52:11,056 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1100, best=0.61, avg=0.61, std=0.00, steps=3.608e+07
2023-07-07 18:52:40,419 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1200, best=0.61, avg=0.61, std=0.00, steps=3.935e+07
2023-07-07 18:53:09,793 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1300, best=0.62, avg=0.61, std=0.00, steps=4.263e+07
2023-07-07 18:53:39,158 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1400, best=0.62, avg=0.61, std=0.00, steps=4.591e+07
2023-07-07 18:54:08,524 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1500, best=0.62, avg=0.62, std=0.00, steps=4.918e+07
2023-07-07 18:54:37,905 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1600, best=0.62, avg=0.62, std=0.00, steps=5.246e+07
2023-07-07 18:55:07,252 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1700, best=0.62, avg=0.62, std=0.00, steps=5.574e+07
2023-07-07 18:55:36,625 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1800, best=0.63, avg=0.62, std=0.00, steps=5.902e+07
2023-07-07 18:56:05,971 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 1900, best=0.63, avg=0.62, std=0.00, steps=6.229e+07
2023-07-07 18:56:35,353 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2000, best=0.63, avg=0.62, std=0.00, steps=6.557e+07
2023-07-07 18:57:04,702 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2100, best=0.63, avg=0.63, std=0.00, steps=6.885e+07
2023-07-07 18:57:34,073 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2200, best=0.63, avg=0.63, std=0.00, steps=7.212e+07
2023-07-07 18:58:03,450 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2300, best=0.63, avg=0.63, std=0.00, steps=7.540e+07
2023-07-07 18:58:32,825 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2400, best=0.63, avg=0.63, std=0.00, steps=7.868e+07
2023-07-07 18:59:02,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2500, best=0.63, avg=0.63, std=0.00, steps=8.195e+07
2023-07-07 18:59:31,552 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2600, best=0.64, avg=0.63, std=0.00, steps=8.523e+07
2023-07-07 19:00:00,920 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2700, best=0.64, avg=0.63, std=0.00, steps=8.851e+07
2023-07-07 19:00:30,274 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2800, best=0.64, avg=0.63, std=0.00, steps=9.178e+07
2023-07-07 19:00:59,635 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 2900, best=0.64, avg=0.63, std=0.00, steps=9.506e+07
2023-07-07 19:01:29,005 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3000, best=0.64, avg=0.63, std=0.00, steps=9.834e+07
2023-07-07 19:01:58,389 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3100, best=0.64, avg=0.63, std=0.00, steps=1.016e+08
2023-07-07 19:02:27,782 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3200, best=0.64, avg=0.64, std=0.00, steps=1.049e+08
2023-07-07 19:02:57,178 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3300, best=0.64, avg=0.64, std=0.00, steps=1.082e+08
2023-07-07 19:03:26,548 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3400, best=0.64, avg=0.64, std=0.00, steps=1.114e+08
2023-07-07 19:03:55,887 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3500, best=0.64, avg=0.64, std=0.00, steps=1.147e+08
2023-07-07 19:04:25,265 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3600, best=0.64, avg=0.64, std=0.00, steps=1.180e+08
2023-07-07 19:04:54,642 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3700, best=0.64, avg=0.64, std=0.00, steps=1.213e+08
2023-07-07 19:05:24,039 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3800, best=0.64, avg=0.64, std=0.00, steps=1.246e+08
2023-07-07 19:05:53,468 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 3900, best=0.64, avg=0.64, std=0.00, steps=1.278e+08
2023-07-07 19:06:22,846 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4000, best=0.64, avg=0.64, std=0.00, steps=1.311e+08
2023-07-07 19:06:52,232 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4100, best=0.65, avg=0.64, std=0.00, steps=1.344e+08
2023-07-07 19:07:21,611 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4200, best=0.65, avg=0.64, std=0.00, steps=1.377e+08
2023-07-07 19:07:51,021 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4300, best=0.64, avg=0.64, std=0.00, steps=1.409e+08
2023-07-07 19:08:20,424 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4400, best=0.65, avg=0.64, std=0.00, steps=1.442e+08
2023-07-07 19:08:49,795 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4500, best=0.65, avg=0.64, std=0.00, steps=1.475e+08
2023-07-07 19:09:19,151 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4600, best=0.65, avg=0.64, std=0.00, steps=1.508e+08
2023-07-07 19:09:48,544 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4700, best=0.65, avg=0.64, std=0.00, steps=1.540e+08
2023-07-07 19:10:17,947 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4800, best=0.65, avg=0.64, std=0.00, steps=1.573e+08
2023-07-07 19:10:47,330 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 4900, best=0.65, avg=0.64, std=0.00, steps=1.606e+08
2023-07-07 19:11:16,705 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5000, best=0.65, avg=0.64, std=0.00, steps=1.639e+08
2023-07-07 19:11:46,069 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5100, best=0.65, avg=0.64, std=0.00, steps=1.671e+08
2023-07-07 19:12:15,453 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5200, best=0.65, avg=0.64, std=0.00, steps=1.704e+08
2023-07-07 19:12:44,842 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5300, best=0.65, avg=0.65, std=0.00, steps=1.737e+08
2023-07-07 19:13:14,220 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5400, best=0.65, avg=0.65, std=0.00, steps=1.770e+08
2023-07-07 19:13:43,585 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5500, best=0.65, avg=0.65, std=0.00, steps=1.803e+08
2023-07-07 19:14:12,948 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5600, best=0.65, avg=0.65, std=0.00, steps=1.835e+08
2023-07-07 19:14:42,339 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5700, best=0.65, avg=0.65, std=0.00, steps=1.868e+08
2023-07-07 19:15:11,739 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5800, best=0.65, avg=0.65, std=0.00, steps=1.901e+08
2023-07-07 19:15:41,103 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 5900, best=0.65, avg=0.65, std=0.00, steps=1.934e+08
2023-07-07 19:16:10,463 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6000, best=0.65, avg=0.65, std=0.00, steps=1.966e+08
2023-07-07 19:16:39,822 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6100, best=0.65, avg=0.65, std=0.00, steps=1.999e+08
2023-07-07 19:17:09,186 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6200, best=0.65, avg=0.65, std=0.00, steps=2.032e+08
2023-07-07 19:17:38,588 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6300, best=0.65, avg=0.65, std=0.00, steps=2.065e+08
2023-07-07 19:18:07,958 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6400, best=0.65, avg=0.65, std=0.00, steps=2.097e+08
2023-07-07 19:18:37,320 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6500, best=0.65, avg=0.65, std=0.00, steps=2.130e+08
2023-07-07 19:19:06,665 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6600, best=0.65, avg=0.65, std=0.00, steps=2.163e+08
2023-07-07 19:19:36,019 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6700, best=0.65, avg=0.65, std=0.00, steps=2.196e+08
2023-07-07 19:20:05,372 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6800, best=0.66, avg=0.65, std=0.00, steps=2.229e+08
2023-07-07 19:20:34,728 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 6900, best=0.65, avg=0.65, std=0.00, steps=2.261e+08
2023-07-07 19:21:04,059 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7000, best=0.66, avg=0.65, std=0.00, steps=2.294e+08
2023-07-07 19:21:33,435 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7100, best=0.66, avg=0.65, std=0.00, steps=2.327e+08
2023-07-07 19:22:02,844 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7200, best=0.65, avg=0.65, std=0.00, steps=2.360e+08
2023-07-07 19:22:32,208 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7300, best=0.66, avg=0.65, std=0.00, steps=2.392e+08
2023-07-07 19:23:01,560 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7400, best=0.66, avg=0.65, std=0.00, steps=2.425e+08
2023-07-07 19:23:30,897 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7500, best=0.66, avg=0.65, std=0.00, steps=2.458e+08
2023-07-07 19:24:00,284 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7600, best=0.66, avg=0.65, std=0.00, steps=2.491e+08
2023-07-07 19:24:29,628 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7700, best=0.66, avg=0.65, std=0.00, steps=2.523e+08
2023-07-07 19:24:59,007 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7800, best=0.66, avg=0.65, std=0.00, steps=2.556e+08
2023-07-07 19:25:28,359 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 7900, best=0.66, avg=0.65, std=0.00, steps=2.589e+08
2023-07-07 19:25:57,722 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8000, best=0.66, avg=0.65, std=0.00, steps=2.622e+08
2023-07-07 19:26:27,108 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8100, best=0.66, avg=0.65, std=0.00, steps=2.655e+08
2023-07-07 19:26:56,460 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8200, best=0.66, avg=0.65, std=0.00, steps=2.687e+08
2023-07-07 19:27:25,828 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8300, best=0.66, avg=0.65, std=0.00, steps=2.720e+08
2023-07-07 19:27:55,200 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8400, best=0.66, avg=0.65, std=0.00, steps=2.753e+08
2023-07-07 19:28:24,564 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8500, best=0.66, avg=0.65, std=0.00, steps=2.786e+08
2023-07-07 19:28:53,932 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8600, best=0.66, avg=0.65, std=0.00, steps=2.818e+08
2023-07-07 19:29:23,302 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8700, best=0.66, avg=0.65, std=0.00, steps=2.851e+08
2023-07-07 19:29:52,656 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8800, best=0.66, avg=0.65, std=0.00, steps=2.884e+08
2023-07-07 19:30:22,034 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 8900, best=0.66, avg=0.65, std=0.00, steps=2.917e+08
2023-07-07 19:30:51,406 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9000, best=0.66, avg=0.65, std=0.00, steps=2.949e+08
2023-07-07 19:31:20,768 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9100, best=0.66, avg=0.65, std=0.00, steps=2.982e+08
2023-07-07 19:31:50,136 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9200, best=0.66, avg=0.66, std=0.00, steps=3.015e+08
2023-07-07 19:32:19,518 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9300, best=0.66, avg=0.66, std=0.00, steps=3.048e+08
2023-07-07 19:32:48,883 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9400, best=0.66, avg=0.66, std=0.00, steps=3.081e+08
2023-07-07 19:33:18,273 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9500, best=0.66, avg=0.66, std=0.00, steps=3.113e+08
2023-07-07 19:33:47,658 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9600, best=0.66, avg=0.66, std=0.00, steps=3.146e+08
2023-07-07 19:34:17,010 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9700, best=0.66, avg=0.66, std=0.00, steps=3.179e+08
2023-07-07 19:34:46,350 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9800, best=0.66, avg=0.66, std=0.00, steps=3.212e+08
2023-07-07 19:35:15,731 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 9900, best=0.66, avg=0.66, std=0.00, steps=3.244e+08
2023-07-07 19:35:45,079 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10000, best=0.66, avg=0.66, std=0.00, steps=3.277e+08
2023-07-07 19:36:14,475 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10100, best=0.66, avg=0.66, std=0.00, steps=3.310e+08
2023-07-07 19:36:43,851 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10200, best=0.66, avg=0.66, std=0.00, steps=3.343e+08
2023-07-07 19:37:13,261 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10300, best=0.66, avg=0.66, std=0.00, steps=3.375e+08
2023-07-07 19:37:42,702 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10400, best=0.66, avg=0.66, std=0.00, steps=3.408e+08
2023-07-07 19:38:12,125 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10500, best=0.66, avg=0.66, std=0.00, steps=3.441e+08
2023-07-07 19:38:41,455 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10600, best=0.66, avg=0.66, std=0.00, steps=3.474e+08
2023-07-07 19:39:10,797 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10700, best=0.66, avg=0.66, std=0.00, steps=3.507e+08
2023-07-07 19:39:40,164 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10800, best=0.66, avg=0.66, std=0.00, steps=3.539e+08
2023-07-07 19:40:09,545 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 10900, best=0.66, avg=0.66, std=0.00, steps=3.572e+08
2023-07-07 19:40:38,925 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11000, best=0.66, avg=0.66, std=0.00, steps=3.605e+08
2023-07-07 19:41:08,281 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11100, best=0.67, avg=0.66, std=0.00, steps=3.638e+08
2023-07-07 19:41:37,663 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11200, best=0.66, avg=0.66, std=0.00, steps=3.670e+08
2023-07-07 19:42:06,966 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11300, best=0.66, avg=0.66, std=0.00, steps=3.703e+08
2023-07-07 19:42:36,332 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11400, best=0.66, avg=0.66, std=0.00, steps=3.736e+08
2023-07-07 19:43:05,786 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11500, best=0.66, avg=0.66, std=0.00, steps=3.769e+08
2023-07-07 19:43:35,163 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11600, best=0.66, avg=0.66, std=0.00, steps=3.801e+08
2023-07-07 19:44:04,512 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11700, best=0.66, avg=0.66, std=0.00, steps=3.834e+08
2023-07-07 19:44:33,877 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11800, best=0.66, avg=0.66, std=0.00, steps=3.867e+08
2023-07-07 19:45:03,256 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11900, best=0.67, avg=0.66, std=0.00, steps=3.900e+08
2023-07-07 19:45:32,366 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 0, 0, [Train]: 11999, best=0.67, avg=0.66, std=0.00, steps=3.932e+08
2023-07-07 19:45:32,367 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 19:45:32,393 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 19:45:32,430 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 19:46:06,118 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.516e+06
2023-07-07 19:46:37,262 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=6.998e+06
2023-07-07 19:47:08,427 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 300, best=0.54, avg=0.53, std=0.00, steps=1.048e+07
2023-07-07 19:47:39,608 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 400, best=0.57, avg=0.56, std=0.00, steps=1.396e+07
2023-07-07 19:48:10,776 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 500, best=0.58, avg=0.57, std=0.00, steps=1.744e+07
2023-07-07 19:48:41,953 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 600, best=0.58, avg=0.58, std=0.00, steps=2.092e+07
2023-07-07 19:49:13,147 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 700, best=0.59, avg=0.59, std=0.00, steps=2.441e+07
2023-07-07 19:49:44,354 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 800, best=0.60, avg=0.59, std=0.00, steps=2.789e+07
2023-07-07 19:50:15,513 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 900, best=0.60, avg=0.60, std=0.00, steps=3.137e+07
2023-07-07 19:50:46,753 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1000, best=0.61, avg=0.60, std=0.00, steps=3.485e+07
2023-07-07 19:51:17,960 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1100, best=0.61, avg=0.60, std=0.00, steps=3.833e+07
2023-07-07 19:51:49,155 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1200, best=0.61, avg=0.61, std=0.00, steps=4.181e+07
2023-07-07 19:52:20,335 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1300, best=0.61, avg=0.61, std=0.00, steps=4.530e+07
2023-07-07 19:52:51,487 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1400, best=0.61, avg=0.61, std=0.00, steps=4.878e+07
2023-07-07 19:53:22,650 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1500, best=0.62, avg=0.61, std=0.00, steps=5.226e+07
2023-07-07 19:53:53,821 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1600, best=0.62, avg=0.61, std=0.00, steps=5.574e+07
2023-07-07 19:54:25,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1700, best=0.62, avg=0.61, std=0.00, steps=5.922e+07
2023-07-07 19:54:56,187 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1800, best=0.62, avg=0.61, std=0.00, steps=6.270e+07
2023-07-07 19:55:27,375 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 1900, best=0.62, avg=0.62, std=0.00, steps=6.619e+07
2023-07-07 19:55:58,585 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2000, best=0.62, avg=0.62, std=0.00, steps=6.967e+07
2023-07-07 19:56:29,778 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2100, best=0.62, avg=0.62, std=0.00, steps=7.315e+07
2023-07-07 19:57:00,970 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2200, best=0.62, avg=0.62, std=0.00, steps=7.663e+07
2023-07-07 19:57:32,186 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2300, best=0.62, avg=0.62, std=0.00, steps=8.011e+07
2023-07-07 19:58:03,338 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2400, best=0.63, avg=0.62, std=0.00, steps=8.359e+07
2023-07-07 19:58:34,524 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2500, best=0.63, avg=0.62, std=0.00, steps=8.707e+07
2023-07-07 19:59:05,694 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2600, best=0.63, avg=0.62, std=0.00, steps=9.056e+07
2023-07-07 19:59:36,841 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2700, best=0.63, avg=0.62, std=0.00, steps=9.404e+07
2023-07-07 20:00:08,002 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2800, best=0.63, avg=0.62, std=0.00, steps=9.752e+07
2023-07-07 20:00:39,179 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 2900, best=0.63, avg=0.62, std=0.00, steps=1.010e+08
2023-07-07 20:01:10,369 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3000, best=0.63, avg=0.62, std=0.00, steps=1.045e+08
2023-07-07 20:01:41,522 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3100, best=0.63, avg=0.62, std=0.00, steps=1.080e+08
2023-07-07 20:02:12,737 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3200, best=0.63, avg=0.62, std=0.00, steps=1.114e+08
2023-07-07 20:02:43,987 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3300, best=0.63, avg=0.63, std=0.00, steps=1.149e+08
2023-07-07 20:03:15,180 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3400, best=0.63, avg=0.63, std=0.00, steps=1.184e+08
2023-07-07 20:03:46,345 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3500, best=0.63, avg=0.63, std=0.00, steps=1.219e+08
2023-07-07 20:04:17,538 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3600, best=0.63, avg=0.63, std=0.00, steps=1.254e+08
2023-07-07 20:04:48,763 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3700, best=0.63, avg=0.63, std=0.00, steps=1.289e+08
2023-07-07 20:05:20,000 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3800, best=0.63, avg=0.63, std=0.00, steps=1.323e+08
2023-07-07 20:05:51,189 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 3900, best=0.63, avg=0.63, std=0.00, steps=1.358e+08
2023-07-07 20:06:22,402 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4000, best=0.63, avg=0.63, std=0.00, steps=1.393e+08
2023-07-07 20:06:53,624 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4100, best=0.63, avg=0.63, std=0.00, steps=1.428e+08
2023-07-07 20:07:24,802 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4200, best=0.63, avg=0.63, std=0.00, steps=1.463e+08
2023-07-07 20:07:56,014 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4300, best=0.63, avg=0.63, std=0.00, steps=1.497e+08
2023-07-07 20:08:27,217 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4400, best=0.63, avg=0.63, std=0.00, steps=1.532e+08
2023-07-07 20:08:58,404 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4500, best=0.64, avg=0.63, std=0.00, steps=1.567e+08
2023-07-07 20:09:29,574 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4600, best=0.64, avg=0.63, std=0.00, steps=1.602e+08
2023-07-07 20:10:00,765 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4700, best=0.64, avg=0.63, std=0.00, steps=1.637e+08
2023-07-07 20:10:31,943 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4800, best=0.64, avg=0.63, std=0.00, steps=1.672e+08
2023-07-07 20:11:03,155 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 4900, best=0.64, avg=0.63, std=0.00, steps=1.706e+08
2023-07-07 20:11:34,351 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5000, best=0.64, avg=0.63, std=0.00, steps=1.741e+08
2023-07-07 20:12:05,523 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5100, best=0.64, avg=0.63, std=0.00, steps=1.776e+08
2023-07-07 20:12:36,704 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5200, best=0.64, avg=0.63, std=0.00, steps=1.811e+08
2023-07-07 20:13:07,932 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5300, best=0.64, avg=0.63, std=0.00, steps=1.846e+08
2023-07-07 20:13:39,155 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5400, best=0.64, avg=0.63, std=0.00, steps=1.880e+08
2023-07-07 20:14:10,368 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5500, best=0.64, avg=0.63, std=0.00, steps=1.915e+08
2023-07-07 20:14:41,545 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5600, best=0.64, avg=0.63, std=0.00, steps=1.950e+08
2023-07-07 20:15:12,775 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5700, best=0.64, avg=0.63, std=0.00, steps=1.985e+08
2023-07-07 20:15:43,992 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5800, best=0.64, avg=0.64, std=0.00, steps=2.020e+08
2023-07-07 20:16:15,148 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 5900, best=0.64, avg=0.64, std=0.00, steps=2.054e+08
2023-07-07 20:16:46,363 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6000, best=0.64, avg=0.64, std=0.00, steps=2.089e+08
2023-07-07 20:17:17,596 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6100, best=0.64, avg=0.64, std=0.00, steps=2.124e+08
2023-07-07 20:17:48,815 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6200, best=0.64, avg=0.64, std=0.00, steps=2.159e+08
2023-07-07 20:18:20,037 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6300, best=0.64, avg=0.64, std=0.00, steps=2.194e+08
2023-07-07 20:18:51,235 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6400, best=0.64, avg=0.64, std=0.00, steps=2.229e+08
2023-07-07 20:19:22,418 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6500, best=0.64, avg=0.64, std=0.00, steps=2.263e+08
2023-07-07 20:19:53,582 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6600, best=0.64, avg=0.64, std=0.00, steps=2.298e+08
2023-07-07 20:20:24,804 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6700, best=0.64, avg=0.64, std=0.00, steps=2.333e+08
2023-07-07 20:20:55,985 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6800, best=0.64, avg=0.64, std=0.00, steps=2.368e+08
2023-07-07 20:21:27,184 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 6900, best=0.64, avg=0.64, std=0.00, steps=2.403e+08
2023-07-07 20:21:58,365 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7000, best=0.64, avg=0.64, std=0.00, steps=2.437e+08
2023-07-07 20:22:29,571 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7100, best=0.64, avg=0.64, std=0.00, steps=2.472e+08
2023-07-07 20:23:00,777 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7200, best=0.65, avg=0.64, std=0.00, steps=2.507e+08
2023-07-07 20:23:32,008 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7300, best=0.65, avg=0.64, std=0.00, steps=2.542e+08
2023-07-07 20:24:03,202 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7400, best=0.64, avg=0.64, std=0.00, steps=2.577e+08
2023-07-07 20:24:34,455 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7500, best=0.64, avg=0.64, std=0.00, steps=2.612e+08
2023-07-07 20:25:05,662 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7600, best=0.64, avg=0.64, std=0.00, steps=2.646e+08
2023-07-07 20:25:36,849 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7700, best=0.65, avg=0.64, std=0.00, steps=2.681e+08
2023-07-07 20:26:08,045 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7800, best=0.65, avg=0.64, std=0.00, steps=2.716e+08
2023-07-07 20:26:39,258 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 7900, best=0.65, avg=0.64, std=0.00, steps=2.751e+08
2023-07-07 20:27:10,461 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8000, best=0.65, avg=0.64, std=0.00, steps=2.786e+08
2023-07-07 20:27:41,678 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8100, best=0.65, avg=0.64, std=0.00, steps=2.820e+08
2023-07-07 20:28:12,931 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8200, best=0.65, avg=0.64, std=0.00, steps=2.855e+08
2023-07-07 20:28:44,117 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8300, best=0.65, avg=0.64, std=0.00, steps=2.890e+08
2023-07-07 20:29:15,300 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8400, best=0.65, avg=0.64, std=0.00, steps=2.925e+08
2023-07-07 20:29:46,644 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8500, best=0.65, avg=0.64, std=0.00, steps=2.960e+08
2023-07-07 20:30:17,882 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8600, best=0.65, avg=0.64, std=0.00, steps=2.995e+08
2023-07-07 20:30:49,137 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8700, best=0.65, avg=0.64, std=0.00, steps=3.029e+08
2023-07-07 20:31:20,377 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8800, best=0.65, avg=0.64, std=0.00, steps=3.064e+08
2023-07-07 20:31:51,603 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 8900, best=0.65, avg=0.64, std=0.00, steps=3.099e+08
2023-07-07 20:32:22,816 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9000, best=0.65, avg=0.64, std=0.00, steps=3.134e+08
2023-07-07 20:32:54,044 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9100, best=0.65, avg=0.64, std=0.00, steps=3.169e+08
2023-07-07 20:33:25,234 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9200, best=0.65, avg=0.64, std=0.00, steps=3.203e+08
2023-07-07 20:33:56,385 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9300, best=0.65, avg=0.64, std=0.00, steps=3.238e+08
2023-07-07 20:34:27,548 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9400, best=0.65, avg=0.64, std=0.00, steps=3.273e+08
2023-07-07 20:34:58,720 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9500, best=0.65, avg=0.65, std=0.00, steps=3.308e+08
2023-07-07 20:35:29,879 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9600, best=0.65, avg=0.65, std=0.00, steps=3.343e+08
2023-07-07 20:36:01,046 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9700, best=0.65, avg=0.65, std=0.00, steps=3.378e+08
2023-07-07 20:36:32,217 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9800, best=0.65, avg=0.65, std=0.00, steps=3.412e+08
2023-07-07 20:37:03,402 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 9900, best=0.65, avg=0.65, std=0.00, steps=3.447e+08
2023-07-07 20:37:34,599 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10000, best=0.65, avg=0.65, std=0.00, steps=3.482e+08
2023-07-07 20:38:05,799 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10100, best=0.65, avg=0.65, std=0.00, steps=3.517e+08
2023-07-07 20:38:37,029 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10200, best=0.65, avg=0.65, std=0.00, steps=3.552e+08
2023-07-07 20:39:08,266 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10300, best=0.65, avg=0.65, std=0.00, steps=3.586e+08
2023-07-07 20:39:39,507 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10400, best=0.65, avg=0.65, std=0.00, steps=3.621e+08
2023-07-07 20:40:10,761 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10500, best=0.65, avg=0.65, std=0.00, steps=3.656e+08
2023-07-07 20:40:41,963 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10600, best=0.65, avg=0.65, std=0.00, steps=3.691e+08
2023-07-07 20:41:13,192 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10700, best=0.65, avg=0.65, std=0.00, steps=3.726e+08
2023-07-07 20:41:44,408 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10800, best=0.65, avg=0.65, std=0.00, steps=3.760e+08
2023-07-07 20:42:15,637 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 10900, best=0.65, avg=0.65, std=0.00, steps=3.795e+08
2023-07-07 20:42:46,858 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11000, best=0.65, avg=0.65, std=0.00, steps=3.830e+08
2023-07-07 20:43:18,061 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11100, best=0.65, avg=0.65, std=0.00, steps=3.865e+08
2023-07-07 20:43:49,248 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11200, best=0.65, avg=0.65, std=0.00, steps=3.900e+08
2023-07-07 20:44:20,391 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11300, best=0.65, avg=0.65, std=0.00, steps=3.935e+08
2023-07-07 20:44:51,580 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11400, best=0.65, avg=0.65, std=0.00, steps=3.969e+08
2023-07-07 20:45:22,727 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11500, best=0.65, avg=0.65, std=0.00, steps=4.004e+08
2023-07-07 20:45:53,890 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11600, best=0.66, avg=0.65, std=0.00, steps=4.039e+08
2023-07-07 20:46:25,117 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11700, best=0.66, avg=0.65, std=0.00, steps=4.074e+08
2023-07-07 20:46:56,324 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11800, best=0.66, avg=0.65, std=0.00, steps=4.109e+08
2023-07-07 20:47:27,581 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11900, best=0.65, avg=0.65, std=0.00, steps=4.143e+08
2023-07-07 20:47:58,774 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 8, 0, [Train]: 11999, best=0.66, avg=0.65, std=0.00, steps=4.178e+08
2023-07-07 20:47:58,775 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 20:47:58,802 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 20:47:58,841 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 20:48:34,443 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=3.723e+06
2023-07-07 20:49:07,565 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=7.410e+06
2023-07-07 20:49:40,733 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.110e+07
2023-07-07 20:50:13,919 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.478e+07
2023-07-07 20:50:47,172 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=1.847e+07
2023-07-07 20:51:20,407 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.216e+07
2023-07-07 20:51:53,787 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.584e+07
2023-07-07 20:52:27,130 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 800, best=0.51, avg=0.50, std=0.00, steps=2.953e+07
2023-07-07 20:53:00,345 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 900, best=0.51, avg=0.50, std=0.00, steps=3.321e+07
2023-07-07 20:53:33,631 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1000, best=0.51, avg=0.50, std=0.00, steps=3.690e+07
2023-07-07 20:54:06,862 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1100, best=0.51, avg=0.50, std=0.00, steps=4.059e+07
2023-07-07 20:54:40,113 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1200, best=0.51, avg=0.50, std=0.00, steps=4.427e+07
2023-07-07 20:55:13,343 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1300, best=0.51, avg=0.50, std=0.00, steps=4.796e+07
2023-07-07 20:55:46,743 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1400, best=0.51, avg=0.50, std=0.00, steps=5.165e+07
2023-07-07 20:56:20,119 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1500, best=0.51, avg=0.50, std=0.00, steps=5.533e+07
2023-07-07 20:56:53,315 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1600, best=0.51, avg=0.50, std=0.00, steps=5.902e+07
2023-07-07 20:57:26,510 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1700, best=0.51, avg=0.50, std=0.00, steps=6.271e+07
2023-07-07 20:57:59,703 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1800, best=0.51, avg=0.50, std=0.00, steps=6.639e+07
2023-07-07 20:58:32,923 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 1900, best=0.51, avg=0.50, std=0.00, steps=7.008e+07
2023-07-07 20:59:06,123 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2000, best=0.51, avg=0.50, std=0.00, steps=7.376e+07
2023-07-07 20:59:39,380 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2100, best=0.51, avg=0.50, std=0.00, steps=7.745e+07
2023-07-07 21:00:12,780 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2200, best=0.51, avg=0.50, std=0.00, steps=8.114e+07
2023-07-07 21:00:46,136 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2300, best=0.51, avg=0.50, std=0.00, steps=8.482e+07
2023-07-07 21:01:19,467 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2400, best=0.51, avg=0.50, std=0.00, steps=8.851e+07
2023-07-07 21:01:52,671 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2500, best=0.51, avg=0.50, std=0.00, steps=9.220e+07
2023-07-07 21:02:25,871 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2600, best=0.51, avg=0.50, std=0.00, steps=9.588e+07
2023-07-07 21:02:59,121 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2700, best=0.51, avg=0.50, std=0.00, steps=9.957e+07
2023-07-07 21:03:32,451 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2800, best=0.51, avg=0.50, std=0.00, steps=1.033e+08
2023-07-07 21:04:05,726 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 2900, best=0.51, avg=0.50, std=0.00, steps=1.069e+08
2023-07-07 21:04:39,019 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3000, best=0.51, avg=0.50, std=0.00, steps=1.106e+08
2023-07-07 21:05:12,279 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3100, best=0.51, avg=0.50, std=0.00, steps=1.143e+08
2023-07-07 21:05:45,464 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3200, best=0.51, avg=0.50, std=0.00, steps=1.180e+08
2023-07-07 21:06:18,682 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3300, best=0.51, avg=0.50, std=0.00, steps=1.217e+08
2023-07-07 21:06:52,032 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3400, best=0.51, avg=0.50, std=0.00, steps=1.254e+08
2023-07-07 21:07:25,259 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3500, best=0.50, avg=0.50, std=0.00, steps=1.291e+08
2023-07-07 21:07:58,526 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3600, best=0.51, avg=0.50, std=0.00, steps=1.327e+08
2023-07-07 21:08:31,808 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3700, best=0.51, avg=0.50, std=0.00, steps=1.364e+08
2023-07-07 21:09:05,066 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3800, best=0.51, avg=0.50, std=0.00, steps=1.401e+08
2023-07-07 21:09:38,522 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 3900, best=0.51, avg=0.50, std=0.00, steps=1.438e+08
2023-07-07 21:10:11,980 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4000, best=0.51, avg=0.50, std=0.00, steps=1.475e+08
2023-07-07 21:10:45,198 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4100, best=0.51, avg=0.50, std=0.00, steps=1.512e+08
2023-07-07 21:11:18,438 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4200, best=0.51, avg=0.50, std=0.00, steps=1.549e+08
2023-07-07 21:11:51,792 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4300, best=0.51, avg=0.50, std=0.00, steps=1.586e+08
2023-07-07 21:12:25,117 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4400, best=0.51, avg=0.50, std=0.00, steps=1.622e+08
2023-07-07 21:12:58,402 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4500, best=0.51, avg=0.50, std=0.00, steps=1.659e+08
2023-07-07 21:13:31,652 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4600, best=0.51, avg=0.50, std=0.00, steps=1.696e+08
2023-07-07 21:14:04,836 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4700, best=0.51, avg=0.50, std=0.00, steps=1.733e+08
2023-07-07 21:14:38,110 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4800, best=0.51, avg=0.50, std=0.00, steps=1.770e+08
2023-07-07 21:15:11,332 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 4900, best=0.51, avg=0.50, std=0.00, steps=1.807e+08
2023-07-07 21:15:44,512 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5000, best=0.51, avg=0.50, std=0.00, steps=1.844e+08
2023-07-07 21:16:17,862 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5100, best=0.51, avg=0.50, std=0.00, steps=1.880e+08
2023-07-07 21:16:51,037 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5200, best=0.51, avg=0.50, std=0.00, steps=1.917e+08
2023-07-07 21:17:24,187 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5300, best=0.51, avg=0.50, std=0.00, steps=1.954e+08
2023-07-07 21:17:57,461 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5400, best=0.51, avg=0.50, std=0.00, steps=1.991e+08
2023-07-07 21:18:30,604 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5500, best=0.51, avg=0.50, std=0.00, steps=2.028e+08
2023-07-07 21:19:03,821 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5600, best=0.51, avg=0.50, std=0.00, steps=2.065e+08
2023-07-07 21:19:36,980 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5700, best=0.51, avg=0.50, std=0.00, steps=2.102e+08
2023-07-07 21:20:10,269 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5800, best=0.51, avg=0.50, std=0.00, steps=2.138e+08
2023-07-07 21:20:43,429 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 5900, best=0.51, avg=0.50, std=0.00, steps=2.175e+08
2023-07-07 21:21:16,667 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6000, best=0.51, avg=0.50, std=0.00, steps=2.212e+08
2023-07-07 21:21:49,932 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6100, best=0.51, avg=0.50, std=0.00, steps=2.249e+08
2023-07-07 21:22:23,173 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6200, best=0.51, avg=0.50, std=0.00, steps=2.286e+08
2023-07-07 21:22:56,503 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6300, best=0.51, avg=0.50, std=0.00, steps=2.323e+08
2023-07-07 21:23:29,777 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6400, best=0.51, avg=0.50, std=0.00, steps=2.360e+08
2023-07-07 21:24:03,097 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6500, best=0.51, avg=0.50, std=0.00, steps=2.397e+08
2023-07-07 21:24:36,431 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6600, best=0.51, avg=0.50, std=0.00, steps=2.433e+08
2023-07-07 21:25:09,686 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6700, best=0.51, avg=0.50, std=0.00, steps=2.470e+08
2023-07-07 21:25:43,025 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6800, best=0.51, avg=0.50, std=0.00, steps=2.507e+08
2023-07-07 21:26:16,420 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 6900, best=0.51, avg=0.50, std=0.00, steps=2.544e+08
2023-07-07 21:26:49,783 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7000, best=0.51, avg=0.50, std=0.00, steps=2.581e+08
2023-07-07 21:27:22,884 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7100, best=0.51, avg=0.50, std=0.00, steps=2.618e+08
2023-07-07 21:27:56,033 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7200, best=0.51, avg=0.50, std=0.00, steps=2.655e+08
2023-07-07 21:28:29,156 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7300, best=0.51, avg=0.50, std=0.00, steps=2.691e+08
2023-07-07 21:29:02,516 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7400, best=0.51, avg=0.50, std=0.00, steps=2.728e+08
2023-07-07 21:29:35,774 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7500, best=0.50, avg=0.50, std=0.00, steps=2.765e+08
2023-07-07 21:30:09,162 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7600, best=0.51, avg=0.50, std=0.00, steps=2.802e+08
2023-07-07 21:30:42,560 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7700, best=0.51, avg=0.50, std=0.00, steps=2.839e+08
2023-07-07 21:31:15,709 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7800, best=0.51, avg=0.50, std=0.00, steps=2.876e+08
2023-07-07 21:31:49,050 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 7900, best=0.51, avg=0.50, std=0.00, steps=2.913e+08
2023-07-07 21:32:22,471 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8000, best=0.51, avg=0.50, std=0.00, steps=2.949e+08
2023-07-07 21:32:55,851 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8100, best=0.51, avg=0.50, std=0.00, steps=2.986e+08
2023-07-07 21:33:29,254 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8200, best=0.51, avg=0.50, std=0.00, steps=3.023e+08
2023-07-07 21:34:02,667 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8300, best=0.51, avg=0.50, std=0.00, steps=3.060e+08
2023-07-07 21:34:36,157 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8400, best=0.51, avg=0.50, std=0.00, steps=3.097e+08
2023-07-07 21:35:09,557 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8500, best=0.51, avg=0.50, std=0.00, steps=3.134e+08
2023-07-07 21:35:42,782 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8600, best=0.51, avg=0.50, std=0.00, steps=3.171e+08
2023-07-07 21:36:15,977 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8700, best=0.51, avg=0.50, std=0.00, steps=3.208e+08
2023-07-07 21:36:49,465 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8800, best=0.51, avg=0.50, std=0.00, steps=3.244e+08
2023-07-07 21:37:22,886 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 8900, best=0.51, avg=0.50, std=0.00, steps=3.281e+08
2023-07-07 21:37:56,219 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9000, best=0.51, avg=0.50, std=0.00, steps=3.318e+08
2023-07-07 21:38:29,580 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9100, best=0.51, avg=0.50, std=0.00, steps=3.355e+08
2023-07-07 21:39:02,994 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9200, best=0.51, avg=0.50, std=0.00, steps=3.392e+08
2023-07-07 21:39:36,511 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9300, best=0.51, avg=0.50, std=0.00, steps=3.429e+08
2023-07-07 21:40:09,937 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9400, best=0.51, avg=0.50, std=0.00, steps=3.466e+08
2023-07-07 21:40:43,218 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9500, best=0.51, avg=0.50, std=0.00, steps=3.502e+08
2023-07-07 21:41:16,680 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9600, best=0.51, avg=0.50, std=0.00, steps=3.539e+08
2023-07-07 21:41:50,163 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9700, best=0.51, avg=0.50, std=0.00, steps=3.576e+08
2023-07-07 21:42:23,471 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9800, best=0.51, avg=0.50, std=0.00, steps=3.613e+08
2023-07-07 21:42:56,736 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 9900, best=0.51, avg=0.50, std=0.00, steps=3.650e+08
2023-07-07 21:43:30,018 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10000, best=0.51, avg=0.50, std=0.00, steps=3.687e+08
2023-07-07 21:44:03,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10100, best=0.51, avg=0.50, std=0.00, steps=3.724e+08
2023-07-07 21:44:36,552 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10200, best=0.53, avg=0.53, std=0.00, steps=3.760e+08
2023-07-07 21:45:10,021 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10300, best=0.55, avg=0.54, std=0.00, steps=3.797e+08
2023-07-07 21:45:43,503 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10400, best=0.55, avg=0.54, std=0.00, steps=3.834e+08
2023-07-07 21:46:16,885 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10500, best=0.55, avg=0.55, std=0.00, steps=3.871e+08
2023-07-07 21:46:50,410 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10600, best=0.56, avg=0.55, std=0.00, steps=3.908e+08
2023-07-07 21:47:23,854 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10700, best=0.56, avg=0.55, std=0.00, steps=3.945e+08
2023-07-07 21:47:57,326 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10800, best=0.56, avg=0.56, std=0.00, steps=3.982e+08
2023-07-07 21:48:30,813 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 10900, best=0.58, avg=0.57, std=0.00, steps=4.019e+08
2023-07-07 21:49:04,362 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11000, best=0.58, avg=0.57, std=0.00, steps=4.055e+08
2023-07-07 21:49:37,852 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11100, best=0.58, avg=0.58, std=0.00, steps=4.092e+08
2023-07-07 21:50:11,278 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11200, best=0.59, avg=0.58, std=0.00, steps=4.129e+08
2023-07-07 21:50:44,675 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11300, best=0.59, avg=0.58, std=0.00, steps=4.166e+08
2023-07-07 21:51:18,087 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11400, best=0.59, avg=0.59, std=0.00, steps=4.203e+08
2023-07-07 21:51:51,468 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11500, best=0.59, avg=0.59, std=0.00, steps=4.240e+08
2023-07-07 21:52:24,831 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11600, best=0.60, avg=0.59, std=0.00, steps=4.277e+08
2023-07-07 21:52:58,253 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11700, best=0.60, avg=0.59, std=0.00, steps=4.313e+08
2023-07-07 21:53:31,697 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11800, best=0.60, avg=0.60, std=0.00, steps=4.350e+08
2023-07-07 21:54:04,898 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11900, best=0.60, avg=0.60, std=0.00, steps=4.387e+08
2023-07-07 21:54:37,941 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 16, 0, [Train]: 11999, best=0.60, avg=0.60, std=0.00, steps=4.424e+08
2023-07-07 21:54:37,942 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
2023-07-07 21:54:37,967 -        meta learning: [    INFO] - [Total Params]: params=53505
2023-07-07 21:54:38,001 -           SimManager: [    INFO] - use_for_loop=False
2023-07-07 21:55:17,755 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 100, best=0.51, avg=0.50, std=0.00, steps=4.137e+06
2023-07-07 21:55:54,930 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 200, best=0.51, avg=0.50, std=0.00, steps=8.233e+06
2023-07-07 21:56:31,978 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 300, best=0.51, avg=0.50, std=0.00, steps=1.233e+07
2023-07-07 21:57:08,982 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 400, best=0.51, avg=0.50, std=0.00, steps=1.642e+07
2023-07-07 21:57:46,067 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 500, best=0.51, avg=0.50, std=0.00, steps=2.052e+07
2023-07-07 21:58:23,120 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 600, best=0.51, avg=0.50, std=0.00, steps=2.462e+07
2023-07-07 21:59:00,142 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 700, best=0.51, avg=0.50, std=0.00, steps=2.871e+07
2023-07-07 21:59:37,153 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 800, best=0.55, avg=0.54, std=0.00, steps=3.281e+07
2023-07-07 22:00:14,097 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 900, best=0.55, avg=0.55, std=0.00, steps=3.690e+07
2023-07-07 22:00:51,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1000, best=0.55, avg=0.55, std=0.00, steps=4.100e+07
2023-07-07 22:01:28,074 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1100, best=0.56, avg=0.55, std=0.00, steps=4.510e+07
2023-07-07 22:02:04,976 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1200, best=0.55, avg=0.55, std=0.00, steps=4.919e+07
2023-07-07 22:02:42,028 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1300, best=0.55, avg=0.55, std=0.00, steps=5.329e+07
2023-07-07 22:03:19,158 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1400, best=0.55, avg=0.55, std=0.00, steps=5.738e+07
2023-07-07 22:03:56,254 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1500, best=0.55, avg=0.55, std=0.00, steps=6.148e+07
2023-07-07 22:04:33,427 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1600, best=0.55, avg=0.55, std=0.00, steps=6.558e+07
2023-07-07 22:05:10,600 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1700, best=0.56, avg=0.55, std=0.00, steps=6.967e+07
2023-07-07 22:05:47,728 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1800, best=0.55, avg=0.55, std=0.00, steps=7.377e+07
2023-07-07 22:06:24,683 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 1900, best=0.55, avg=0.55, std=0.00, steps=7.786e+07
2023-07-07 22:07:01,676 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2000, best=0.56, avg=0.55, std=0.00, steps=8.196e+07
2023-07-07 22:07:38,742 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2100, best=0.55, avg=0.55, std=0.00, steps=8.606e+07
2023-07-07 22:08:15,777 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2200, best=0.55, avg=0.55, std=0.00, steps=9.015e+07
2023-07-07 22:08:52,656 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2300, best=0.55, avg=0.55, std=0.00, steps=9.425e+07
2023-07-07 22:09:29,712 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2400, best=0.55, avg=0.55, std=0.00, steps=9.834e+07
2023-07-07 22:10:06,861 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2500, best=0.55, avg=0.55, std=0.00, steps=1.024e+08
2023-07-07 22:10:43,812 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2600, best=0.56, avg=0.55, std=0.00, steps=1.065e+08
2023-07-07 22:11:20,775 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2700, best=0.56, avg=0.56, std=0.00, steps=1.106e+08
2023-07-07 22:11:57,602 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2800, best=0.56, avg=0.56, std=0.00, steps=1.147e+08
2023-07-07 22:12:34,408 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 2900, best=0.57, avg=0.56, std=0.00, steps=1.188e+08
2023-07-07 22:13:11,249 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3000, best=0.57, avg=0.57, std=0.00, steps=1.229e+08
2023-07-07 22:13:48,156 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3100, best=0.57, avg=0.57, std=0.00, steps=1.270e+08
2023-07-07 22:14:25,065 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3200, best=0.57, avg=0.57, std=0.00, steps=1.311e+08
2023-07-07 22:15:01,953 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3300, best=0.57, avg=0.57, std=0.00, steps=1.352e+08
2023-07-07 22:15:38,886 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3400, best=0.57, avg=0.57, std=0.00, steps=1.393e+08
2023-07-07 22:16:15,799 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3500, best=0.57, avg=0.57, std=0.00, steps=1.434e+08
2023-07-07 22:16:52,656 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3600, best=0.57, avg=0.57, std=0.00, steps=1.475e+08
2023-07-07 22:17:29,505 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3700, best=0.57, avg=0.57, std=0.00, steps=1.516e+08
2023-07-07 22:18:06,404 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3800, best=0.58, avg=0.57, std=0.00, steps=1.557e+08
2023-07-07 22:18:43,334 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 3900, best=0.58, avg=0.57, std=0.00, steps=1.598e+08
2023-07-07 22:19:20,303 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4000, best=0.58, avg=0.57, std=0.00, steps=1.639e+08
2023-07-07 22:19:57,259 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4100, best=0.58, avg=0.57, std=0.00, steps=1.680e+08
2023-07-07 22:20:34,345 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4200, best=0.58, avg=0.57, std=0.00, steps=1.721e+08
2023-07-07 22:21:11,243 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4300, best=0.58, avg=0.57, std=0.00, steps=1.762e+08
2023-07-07 22:21:48,166 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4400, best=0.58, avg=0.57, std=0.00, steps=1.803e+08
2023-07-07 22:22:25,079 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4500, best=0.58, avg=0.57, std=0.00, steps=1.844e+08
2023-07-07 22:23:01,991 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4600, best=0.58, avg=0.57, std=0.00, steps=1.885e+08
2023-07-07 22:23:38,845 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4700, best=0.58, avg=0.57, std=0.00, steps=1.926e+08
2023-07-07 22:24:15,792 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4800, best=0.58, avg=0.57, std=0.00, steps=1.966e+08
2023-07-07 22:24:52,707 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 4900, best=0.58, avg=0.57, std=0.00, steps=2.007e+08
2023-07-07 22:25:29,727 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5000, best=0.58, avg=0.57, std=0.00, steps=2.048e+08
2023-07-07 22:26:06,571 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5100, best=0.58, avg=0.58, std=0.00, steps=2.089e+08
2023-07-07 22:26:43,474 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5200, best=0.58, avg=0.58, std=0.00, steps=2.130e+08
2023-07-07 22:27:20,349 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5300, best=0.58, avg=0.58, std=0.00, steps=2.171e+08
2023-07-07 22:27:57,160 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5400, best=0.58, avg=0.58, std=0.00, steps=2.212e+08
2023-07-07 22:28:34,168 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5500, best=0.58, avg=0.58, std=0.00, steps=2.253e+08
2023-07-07 22:29:11,137 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5600, best=0.58, avg=0.58, std=0.00, steps=2.294e+08
2023-07-07 22:29:48,174 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5700, best=0.58, avg=0.58, std=0.00, steps=2.335e+08
2023-07-07 22:30:25,136 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5800, best=0.58, avg=0.58, std=0.00, steps=2.376e+08
2023-07-07 22:31:02,063 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 5900, best=0.58, avg=0.58, std=0.00, steps=2.417e+08
2023-07-07 22:31:38,910 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6000, best=0.58, avg=0.58, std=0.00, steps=2.458e+08
2023-07-07 22:32:15,727 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6100, best=0.58, avg=0.58, std=0.00, steps=2.499e+08
2023-07-07 22:32:52,584 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6200, best=0.58, avg=0.58, std=0.00, steps=2.540e+08
2023-07-07 22:33:29,426 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6300, best=0.58, avg=0.58, std=0.00, steps=2.581e+08
2023-07-07 22:34:06,367 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6400, best=0.58, avg=0.58, std=0.00, steps=2.622e+08
2023-07-07 22:34:43,351 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6500, best=0.58, avg=0.58, std=0.00, steps=2.663e+08
2023-07-07 22:35:20,323 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6600, best=0.58, avg=0.58, std=0.00, steps=2.704e+08
2023-07-07 22:35:57,156 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6700, best=0.58, avg=0.58, std=0.00, steps=2.745e+08
2023-07-07 22:36:34,002 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6800, best=0.58, avg=0.58, std=0.00, steps=2.786e+08
2023-07-07 22:37:10,901 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 6900, best=0.59, avg=0.58, std=0.00, steps=2.827e+08
2023-07-07 22:37:47,765 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7000, best=0.58, avg=0.58, std=0.00, steps=2.868e+08
2023-07-07 22:38:24,647 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7100, best=0.59, avg=0.58, std=0.00, steps=2.909e+08
2023-07-07 22:39:01,622 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7200, best=0.58, avg=0.58, std=0.00, steps=2.950e+08
2023-07-07 22:39:38,542 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7300, best=0.58, avg=0.58, std=0.00, steps=2.990e+08
2023-07-07 22:40:15,531 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7400, best=0.58, avg=0.58, std=0.00, steps=3.031e+08
2023-07-07 22:40:52,499 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7500, best=0.58, avg=0.58, std=0.00, steps=3.072e+08
2023-07-07 22:41:29,413 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7600, best=0.59, avg=0.58, std=0.00, steps=3.113e+08
2023-07-07 22:42:06,355 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7700, best=0.58, avg=0.58, std=0.00, steps=3.154e+08
2023-07-07 22:42:43,233 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7800, best=0.58, avg=0.58, std=0.00, steps=3.195e+08
2023-07-07 22:43:20,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 7900, best=0.58, avg=0.58, std=0.00, steps=3.236e+08
2023-07-07 22:43:57,114 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8000, best=0.58, avg=0.58, std=0.00, steps=3.277e+08
2023-07-07 22:44:34,034 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8100, best=0.59, avg=0.58, std=0.00, steps=3.318e+08
2023-07-07 22:45:10,969 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8200, best=0.58, avg=0.58, std=0.00, steps=3.359e+08
2023-07-07 22:45:47,874 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8300, best=0.58, avg=0.58, std=0.00, steps=3.400e+08
2023-07-07 22:46:24,764 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8400, best=0.58, avg=0.58, std=0.00, steps=3.441e+08
2023-07-07 22:47:01,780 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8500, best=0.59, avg=0.58, std=0.00, steps=3.482e+08
2023-07-07 22:47:38,682 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8600, best=0.58, avg=0.58, std=0.00, steps=3.523e+08
2023-07-07 22:48:15,497 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8700, best=0.58, avg=0.58, std=0.00, steps=3.564e+08
2023-07-07 22:48:52,335 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8800, best=0.58, avg=0.58, std=0.00, steps=3.605e+08
2023-07-07 22:49:29,276 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 8900, best=0.58, avg=0.58, std=0.00, steps=3.646e+08
2023-07-07 22:50:06,201 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9000, best=0.59, avg=0.58, std=0.00, steps=3.687e+08
2023-07-07 22:50:43,004 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9100, best=0.58, avg=0.58, std=0.00, steps=3.728e+08
2023-07-07 22:51:19,816 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9200, best=0.58, avg=0.58, std=0.00, steps=3.769e+08
2023-07-07 22:51:56,804 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9300, best=0.58, avg=0.58, std=0.00, steps=3.810e+08
2023-07-07 22:52:33,723 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9400, best=0.59, avg=0.58, std=0.00, steps=3.851e+08
2023-07-07 22:53:10,549 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9500, best=0.59, avg=0.58, std=0.00, steps=3.892e+08
2023-07-07 22:53:47,511 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9600, best=0.59, avg=0.58, std=0.00, steps=3.933e+08
2023-07-07 22:54:24,461 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9700, best=0.59, avg=0.59, std=0.00, steps=3.974e+08
2023-07-07 22:55:01,298 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9800, best=0.59, avg=0.59, std=0.00, steps=4.014e+08
2023-07-07 22:55:38,191 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 9900, best=0.59, avg=0.59, std=0.00, steps=4.055e+08
2023-07-07 22:56:15,025 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10000, best=0.59, avg=0.59, std=0.00, steps=4.096e+08
2023-07-07 22:56:52,076 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10100, best=0.59, avg=0.59, std=0.00, steps=4.137e+08
2023-07-07 22:57:28,969 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10200, best=0.59, avg=0.59, std=0.00, steps=4.178e+08
2023-07-07 22:58:06,025 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10300, best=0.59, avg=0.59, std=0.00, steps=4.219e+08
2023-07-07 22:58:42,907 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10400, best=0.60, avg=0.59, std=0.00, steps=4.260e+08
2023-07-07 22:59:19,922 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10500, best=0.60, avg=0.59, std=0.00, steps=4.301e+08
2023-07-07 22:59:56,840 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10600, best=0.59, avg=0.59, std=0.00, steps=4.342e+08
2023-07-07 23:00:33,840 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10700, best=0.60, avg=0.59, std=0.00, steps=4.383e+08
2023-07-07 23:01:10,695 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10800, best=0.60, avg=0.59, std=0.00, steps=4.424e+08
2023-07-07 23:01:47,625 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 10900, best=0.60, avg=0.59, std=0.00, steps=4.465e+08
2023-07-07 23:02:24,476 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11000, best=0.60, avg=0.59, std=0.00, steps=4.506e+08
2023-07-07 23:03:01,340 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11100, best=0.60, avg=0.59, std=0.00, steps=4.547e+08
2023-07-07 23:03:38,221 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11200, best=0.60, avg=0.59, std=0.00, steps=4.588e+08
2023-07-07 23:04:15,127 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11300, best=0.60, avg=0.59, std=0.00, steps=4.629e+08
2023-07-07 23:04:52,070 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11400, best=0.60, avg=0.59, std=0.00, steps=4.670e+08
2023-07-07 23:05:28,914 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11500, best=0.60, avg=0.59, std=0.00, steps=4.711e+08
2023-07-07 23:06:05,764 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11600, best=0.60, avg=0.59, std=0.00, steps=4.752e+08
2023-07-07 23:06:42,723 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11700, best=0.60, avg=0.59, std=0.00, steps=4.793e+08
2023-07-07 23:07:19,508 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11800, best=0.60, avg=0.59, std=0.00, steps=4.834e+08
2023-07-07 23:07:56,350 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11900, best=0.60, avg=0.59, std=0.00, steps=4.875e+08
2023-07-07 23:08:33,040 -        meta learning: [    INFO] - [Len Lat Rep]: 64, 32, 0, [Train]: 11999, best=0.60, avg=0.59, std=0.00, steps=4.915e+08
2023-07-07 23:08:33,041 -        meta learning: [    INFO] - [OUTPUT DIR]: /data/anonymous/meta/train/PGPE-BatchedGruMetaStdpMLPPolicy-SeqTask--20230707-135906
