Matrix distribution: unif
Matrix distribution config: {'c': 0.25, 'd': 5000, 'eps': 0.001}
Initial matrix shape: torch.Size([5000, 5000])
Algorithm name: mcts
Algorithm config: {'c_ucb': 5.0, 'alpha_pw': 0.4, 'epsilon': 1e-06, 'EXPLORE_K': 5, 'early_termination_epsilon': 1e-05, 'budget': 80000, 'print_every': 1000, 'max_termination_count': 10, 'tree_initial_capacity': 10000, 'device': 'cuda', 'actions': [['sign_ns', [[0, 0], [5, 5]]], ['sign_newton', [[0], [40]]], ['sign_quintic', [[0, 0, 0], [5, 5, 5]]], ['sign_halley', [[0, 0, 0], [40, 40, 40]]]], 'initialize_with_baselines': True}
Actions: ['sign_halley', 'sign_newton', 'sign_ns', 'sign_quintic']
Action sign_halley took 1.0 times longer than sign_halley
Action sign_newton took 0.4015156458599125 times longer than sign_halley
Action sign_ns took 0.17291880880647825 times longer than sign_halley
Action sign_quintic took 0.2559094953286987 times longer than sign_halley
Skipping sign_newton_variant because not all actions are in the tree
Skipping inv_ns because not all actions are in the tree
Skipping inv_ns_chebyshev because not all actions are in the tree
Skipping sqrt_db because not all actions are in the tree
Skipping sqrt_nsv because not all actions are in the tree
Skipping sqrt_visser because not all actions are in the tree
Skipping sqrt_newton because not all actions are in the tree
Skipping sqrt_visser_coupled because not all actions are in the tree
Skipping sqrt_newton_coupled because not all actions are in the tree
Skipping proot_newton because not all actions are in the tree
Skipping proot_visser because not all actions are in the tree
Skipping proot_iannazzo because not all actions are in the tree
[?25l0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:00[0m Remaining: [36m-:--:--[0m 501607.34 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-2.9021069 -2.9021069]                                                                                                                                                  │
│ [-2.13070373 -2.13070373]                                                                                                                                                │
│ [-2.00757823 -2.00757823 -2.00757823]                                                                                                                                    │
│ [-1.4390285 -1.4390285 -1.4390285]                                                                                                                                       │
│ [-1.4390285  -1.4390285  -1.4390285  -1.03751285]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1005246.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-2.9021069 -2.9021069]                                                                                                                                                  │
│ [-2.13070373 -2.13070373]                                                                                                                                                │
│ [-2.00757823 -2.00757823 -2.00757823]                                                                                                                                    │
│ [-1.4390285 -1.4390285 -1.4390285]                                                                                                                                       │
│ [-1.4390285  -1.4390285  -1.4390285  -1.03751285]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1508757.70 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-2.9021069 -2.9021069]                                                                                                                                                  │
│ [-2.13070373 -2.13070373]                                                                                                                                                │
│ [-2.00757823 -2.00757823 -2.00757823]                                                                                                                                    │
│ [-1.4390285 -1.4390285 -1.4390285]                                                                                                                                       │
│ [-1.4390285  -1.4390285  -1.4390285  -1.03751285]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 141, 146, 153, 162, 1000]                                                                                                                                   │
│ Average cumulative reward:       -6.954544601478511                                                                                                                      │
│ Average rollout reward:          -6.537481201486542                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 141, 146, 153, 162, 1000]                                                                                                                                   │
│ Average cumulative reward:       -6.954544601478511                                                                                                                      │
│ Average rollout reward:          -6.537481201486542                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m-:--:--[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 141, 146, 153, 162, 1000]                                                                                                                                   │
│ Average cumulative reward:       -6.954544601478511                                                                                                                      │
│ Average rollout reward:          -6.537481201486542                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:02:10[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 45, 50, 53, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.582980195070264                                                                                                                      │
│ Average rollout reward:          -6.084658085489628                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:02:10[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 45, 50, 53, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.582980195070264                                                                                                                      │
│ Average rollout reward:          -6.084658085489628                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:02:10[0m   2.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 45, 50, 53, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.582980195070264                                                                                                                      │
│ Average rollout reward:          -6.084658085489628                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:02:10[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 45, 50, 53, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.582980195070264                                                                                                                      │
│ Average rollout reward:          -6.084658085489628                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:02:13[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2961, 2965, 2994, 3000]                                                                                                                                     │
│ Average cumulative reward:       -6.967146183909075                                                                                                                      │
│ Average rollout reward:          -6.457757270564835                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:02:13[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2961, 2965, 2994, 3000]                                                                                                                                     │
│ Average cumulative reward:       -6.967146183909075                                                                                                                      │
│ Average rollout reward:          -6.457757270564835                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:02:13[0m   2.18 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2961, 2965, 2994, 3000]                                                                                                                                     │
│ Average cumulative reward:       -6.967146183909075                                                                                                                      │
│ Average rollout reward:          -6.457757270564835                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:02:11[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3940, 3942, 3987, 3990, 4000]                                                                                                                               │
│ Average cumulative reward:       -6.216891118063493                                                                                                                      │
│ Average rollout reward:          -5.669838950932916                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:02:11[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3940, 3942, 3987, 3990, 4000]                                                                                                                               │
│ Average cumulative reward:       -6.216891118063493                                                                                                                      │
│ Average rollout reward:          -5.669838950932916                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:02:11[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3940, 3942, 3987, 3990, 4000]                                                                                                                               │
│ Average cumulative reward:       -6.216891118063493                                                                                                                      │
│ Average rollout reward:          -5.669838950932916                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:02:08[0m   1.71 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4888, 4889, 4900, 4919, 5000]                                                                                                                               │
│ Average cumulative reward:       -6.771515837797187                                                                                                                      │
│ Average rollout reward:          -6.228628207037764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:02:08[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4888, 4889, 4900, 4919, 5000]                                                                                                                               │
│ Average cumulative reward:       -6.771515837797187                                                                                                                      │
│ Average rollout reward:          -6.228628207037764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:02:08[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4888, 4889, 4900, 4919, 5000]                                                                                                                               │
│ Average cumulative reward:       -6.771515837797187                                                                                                                      │
│ Average rollout reward:          -6.228628207037764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:02:08[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4888, 4889, 4900, 4919, 5000]                                                                                                                               │
│ Average cumulative reward:       -6.771515837797187                                                                                                                      │
│ Average rollout reward:          -6.228628207037764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:02:08[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5963, 5964, 5969, 6000]                                                                                                                                     │
│ Average cumulative reward:       -6.7507911515238685                                                                                                                     │
│ Average rollout reward:          -6.212495827189361                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:02:08[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5963, 5964, 5969, 6000]                                                                                                                                     │
│ Average cumulative reward:       -6.7507911515238685                                                                                                                     │
│ Average rollout reward:          -6.212495827189361                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:02:08[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5963, 5964, 5969, 6000]                                                                                                                                     │
│ Average cumulative reward:       -6.7507911515238685                                                                                                                     │
│ Average rollout reward:          -6.212495827189361                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:02:06[0m   1.73 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6918, 6919, 6986, 6988, 7000]                                                                                                                               │
│ Average cumulative reward:       -6.588840408600137                                                                                                                      │
│ Average rollout reward:          -6.028576249012245                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:02:06[0m   1.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6918, 6919, 6986, 6988, 7000]                                                                                                                               │
│ Average cumulative reward:       -6.588840408600137                                                                                                                      │
│ Average rollout reward:          -6.028576249012245                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:02:06[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6918, 6919, 6986, 6988, 7000]                                                                                                                               │
│ Average cumulative reward:       -6.588840408600137                                                                                                                      │
│ Average rollout reward:          -6.028576249012245                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:02:06[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6918, 6919, 6986, 6988, 7000]                                                                                                                               │
│ Average cumulative reward:       -6.588840408600137                                                                                                                      │
│ Average rollout reward:          -6.028576249012245                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:02:04[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2815, 2816, 2906, 2940, 8000]                                                                                                                               │
│ Average cumulative reward:       -7.0786845412370445                                                                                                                     │
│ Average rollout reward:          -6.5054193318566815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:02:04[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2815, 2816, 2906, 2940, 8000]                                                                                                                               │
│ Average cumulative reward:       -7.0786845412370445                                                                                                                     │
│ Average rollout reward:          -6.5054193318566815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:02:04[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 2815, 2816, 2906, 2940, 8000]                                                                                                                               │
│ Average cumulative reward:       -7.0786845412370445                                                                                                                     │
│ Average rollout reward:          -6.5054193318566815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:02:02[0m   1.74 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4, 15, 9000]                                                                                                                                                │
│ Average cumulative reward:       -6.622525738626829                                                                                                                      │
│ Average rollout reward:          -6.05136769408413                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:02:02[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4, 15, 9000]                                                                                                                                                │
│ Average cumulative reward:       -6.622525738626829                                                                                                                      │
│ Average rollout reward:          -6.05136769408413                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:02:02[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4, 15, 9000]                                                                                                                                                │
│ Average cumulative reward:       -6.622525738626829                                                                                                                      │
│ Average rollout reward:          -6.05136769408413                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:02:02[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4, 15, 9000]                                                                                                                                                │
│ Average cumulative reward:       -6.622525738626829                                                                                                                      │
│ Average rollout reward:          -6.05136769408413                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:02:01[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 9011, 9823, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.544382566749189                                                                                                                      │
│ Average rollout reward:          -5.9590248102004955                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:02:01[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 9011, 9823, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.544382566749189                                                                                                                      │
│ Average rollout reward:          -5.9590248102004955                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:02:01[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 9011, 9823, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.544382566749189                                                                                                                      │
│ Average rollout reward:          -5.9590248102004955                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:02:01[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 9011, 9823, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.544382566749189                                                                                                                      │
│ Average rollout reward:          -5.9590248102004955                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:02:00[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10965, 10982, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.985173646945893                                                                                                                      │
│ Average rollout reward:          -6.407544274276909                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:02:00[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10965, 10982, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.985173646945893                                                                                                                      │
│ Average rollout reward:          -6.407544274276909                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:02:00[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10965, 10982, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.985173646945893                                                                                                                      │
│ Average rollout reward:          -6.407544274276909                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:01:59[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1909, 1911, 10893, 12000]                                                                                                                                   │
│ Average cumulative reward:       -6.98956114310238                                                                                                                       │
│ Average rollout reward:          -6.383427328789741                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:01:59[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1909, 1911, 10893, 12000]                                                                                                                                   │
│ Average cumulative reward:       -6.98956114310238                                                                                                                       │
│ Average rollout reward:          -6.383427328789741                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:01:59[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1909, 1911, 10893, 12000]                                                                                                                                   │
│ Average cumulative reward:       -6.98956114310238                                                                                                                       │
│ Average rollout reward:          -6.383427328789741                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:01:59[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1909, 1911, 10893, 12000]                                                                                                                                   │
│ Average cumulative reward:       -6.98956114310238                                                                                                                       │
│ Average rollout reward:          -6.383427328789741                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:01:58[0m   1.78 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 11218, 11275, 13000]                                                                                                                            │
│ Average cumulative reward:       -6.507578147423539                                                                                                                      │
│ Average rollout reward:          -5.916648454133788                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:01:58[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 11218, 11275, 13000]                                                                                                                            │
│ Average cumulative reward:       -6.507578147423539                                                                                                                      │
│ Average rollout reward:          -5.916648454133788                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:01:58[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2815, 2816, 11218, 11275, 13000]                                                                                                                            │
│ Average cumulative reward:       -6.507578147423539                                                                                                                      │
│ Average rollout reward:          -5.916648454133788                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:01:56[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13793, 13795, 13856, 13861, 14000]                                                                                                                          │
│ Average cumulative reward:       -6.7796784041104665                                                                                                                     │
│ Average rollout reward:          -6.2041733394481255                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:01:56[0m   1.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13793, 13795, 13856, 13861, 14000]                                                                                                                          │
│ Average cumulative reward:       -6.7796784041104665                                                                                                                     │
│ Average rollout reward:          -6.2041733394481255                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:01:56[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13793, 13795, 13856, 13861, 14000]                                                                                                                          │
│ Average cumulative reward:       -6.7796784041104665                                                                                                                     │
│ Average rollout reward:          -6.2041733394481255                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:01:56[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13793, 13795, 13856, 13861, 14000]                                                                                                                          │
│ Average cumulative reward:       -6.7796784041104665                                                                                                                     │
│ Average rollout reward:          -6.2041733394481255                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:01:54[0m   1.78 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14957, 14959, 14984, 14989, 15000]                                                                                                                          │
│ Average cumulative reward:       -6.860250832041296                                                                                                                      │
│ Average rollout reward:          -6.312790971335654                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:01:54[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14957, 14959, 14984, 14989, 15000]                                                                                                                          │
│ Average cumulative reward:       -6.860250832041296                                                                                                                      │
│ Average rollout reward:          -6.312790971335654                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:01:54[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14957, 14959, 14984, 14989, 15000]                                                                                                                          │
│ Average cumulative reward:       -6.860250832041296                                                                                                                      │
│ Average rollout reward:          -6.312790971335654                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:01:54[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14957, 14959, 14984, 14989, 15000]                                                                                                                          │
│ Average cumulative reward:       -6.860250832041296                                                                                                                      │
│ Average rollout reward:          -6.312790971335654                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:01:53[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15766, 15994, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.866496964754436                                                                                                                      │
│ Average rollout reward:          -6.258307983416772                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:01:53[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15766, 15994, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.866496964754436                                                                                                                      │
│ Average rollout reward:          -6.258307983416772                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:01:53[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15766, 15994, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.866496964754436                                                                                                                      │
│ Average rollout reward:          -6.258307983416772                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:01:51[0m   1.78 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4305, 4307, 6573, 6618, 17000]                                                                                                                              │
│ Average cumulative reward:       -6.667583072866145                                                                                                                      │
│ Average rollout reward:          -6.033025180092475                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:01:51[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4305, 4307, 6573, 6618, 17000]                                                                                                                              │
│ Average cumulative reward:       -6.667583072866145                                                                                                                      │
│ Average rollout reward:          -6.033025180092475                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:01:51[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4305, 4307, 6573, 6618, 17000]                                                                                                                              │
│ Average cumulative reward:       -6.667583072866145                                                                                                                      │
│ Average rollout reward:          -6.033025180092475                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:01:51[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4305, 4307, 6573, 6618, 17000]                                                                                                                              │
│ Average cumulative reward:       -6.667583072866145                                                                                                                      │
│ Average rollout reward:          -6.033025180092475                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:01:50[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 17947, 17952, 18000]                                                                                                                          │
│ Average cumulative reward:       -6.657944001240103                                                                                                                      │
│ Average rollout reward:          -6.0799103974136015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:01:50[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 17947, 17952, 18000]                                                                                                                          │
│ Average cumulative reward:       -6.657944001240103                                                                                                                      │
│ Average rollout reward:          -6.0799103974136015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:01:50[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 17947, 17952, 18000]                                                                                                                          │
│ Average cumulative reward:       -6.657944001240103                                                                                                                      │
│ Average rollout reward:          -6.0799103974136015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:01:50[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 17947, 17952, 18000]                                                                                                                          │
│ Average cumulative reward:       -6.657944001240103                                                                                                                      │
│ Average rollout reward:          -6.0799103974136015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:01:48[0m   1.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4120, 4122, 5485, 6157, 19000]                                                                                                                              │
│ Average cumulative reward:       -6.845788239389789                                                                                                                      │
│ Average rollout reward:          -6.219545538948215                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:01:48[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4120, 4122, 5485, 6157, 19000]                                                                                                                              │
│ Average cumulative reward:       -6.845788239389789                                                                                                                      │
│ Average rollout reward:          -6.219545538948215                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:01:48[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4120, 4122, 5485, 6157, 19000]                                                                                                                              │
│ Average cumulative reward:       -6.845788239389789                                                                                                                      │
│ Average rollout reward:          -6.219545538948215                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:01:46[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16179, 16194, 16204, 20000]                                                                                                                          │
│ Average cumulative reward:       -6.85035223340769                                                                                                                       │
│ Average rollout reward:          -6.221802172905313                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:01:46[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16179, 16194, 16204, 20000]                                                                                                                          │
│ Average cumulative reward:       -6.85035223340769                                                                                                                       │
│ Average rollout reward:          -6.221802172905313                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:01:46[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16179, 16194, 16204, 20000]                                                                                                                          │
│ Average cumulative reward:       -6.85035223340769                                                                                                                       │
│ Average rollout reward:          -6.221802172905313                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:01:46[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16179, 16194, 16204, 20000]                                                                                                                          │
│ Average cumulative reward:       -6.85035223340769                                                                                                                       │
│ Average rollout reward:          -6.221802172905313                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:01:45[0m   1.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 752, 756, 758, 10373, 21000]                                                                                                                                │
│ Average cumulative reward:       -6.488804006687071                                                                                                                      │
│ Average rollout reward:          -5.901933889439239                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:01:45[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 752, 756, 758, 10373, 21000]                                                                                                                                │
│ Average cumulative reward:       -6.488804006687071                                                                                                                      │
│ Average rollout reward:          -5.901933889439239                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:01:45[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 752, 756, 758, 10373, 21000]                                                                                                                                │
│ Average cumulative reward:       -6.488804006687071                                                                                                                      │
│ Average rollout reward:          -5.901933889439239                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:01:43[0m   1.79 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6194, 6196, 21497, 22000]                                                                                                                                   │
│ Average cumulative reward:       -6.4266631189239405                                                                                                                     │
│ Average rollout reward:          -5.838406935694                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:01:43[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6194, 6196, 21497, 22000]                                                                                                                                   │
│ Average cumulative reward:       -6.4266631189239405                                                                                                                     │
│ Average rollout reward:          -5.838406935694                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:01:43[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6194, 6196, 21497, 22000]                                                                                                                                   │
│ Average cumulative reward:       -6.4266631189239405                                                                                                                     │
│ Average rollout reward:          -5.838406935694                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:01:43[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6194, 6196, 21497, 22000]                                                                                                                                   │
│ Average cumulative reward:       -6.4266631189239405                                                                                                                     │
│ Average rollout reward:          -5.838406935694                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:01:42[0m   1.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 18119, 18458, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.078191307880294                                                                                                                      │
│ Average rollout reward:          -6.456820268708832                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:01:42[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 18119, 18458, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.078191307880294                                                                                                                      │
│ Average rollout reward:          -6.456820268708832                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:01:42[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 18119, 18458, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.078191307880294                                                                                                                      │
│ Average rollout reward:          -6.456820268708832                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:01:42[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 18119, 18458, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.078191307880294                                                                                                                      │
│ Average rollout reward:          -6.456820268708832                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:01:41[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23702, 23975, 23991, 24000]                                                                                                                          │
│ Average cumulative reward:       -6.410388627329439                                                                                                                      │
│ Average rollout reward:          -5.830296687265499                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:01:41[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23702, 23975, 23991, 24000]                                                                                                                          │
│ Average cumulative reward:       -6.410388627329439                                                                                                                      │
│ Average rollout reward:          -5.830296687265499                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:01:41[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23702, 23975, 23991, 24000]                                                                                                                          │
│ Average cumulative reward:       -6.410388627329439                                                                                                                      │
│ Average rollout reward:          -5.830296687265499                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:01:41[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23702, 23975, 23991, 24000]                                                                                                                          │
│ Average cumulative reward:       -6.410388627329439                                                                                                                      │
│ Average rollout reward:          -5.830296687265499                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:01:40[0m   1.81 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 111, 122, 18661, 25000]                                                                                                                                     │
│ Average cumulative reward:       -7.176986589059692                                                                                                                      │
│ Average rollout reward:          -6.561223012771758                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:01:40[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 111, 122, 18661, 25000]                                                                                                                                     │
│ Average cumulative reward:       -7.176986589059692                                                                                                                      │
│ Average rollout reward:          -6.561223012771758                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:01:40[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 111, 122, 18661, 25000]                                                                                                                                     │
│ Average cumulative reward:       -7.176986589059692                                                                                                                      │
│ Average rollout reward:          -6.561223012771758                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:01:40[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 111, 122, 18661, 25000]                                                                                                                                     │
│ Average cumulative reward:       -7.176986589059692                                                                                                                      │
│ Average rollout reward:          -6.561223012771758                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:01:39[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25856, 25857, 25972, 25996, 26000]                                                                                                                          │
│ Average cumulative reward:       -6.68767383378103                                                                                                                       │
│ Average rollout reward:          -6.119798295191308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:01:39[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25856, 25857, 25972, 25996, 26000]                                                                                                                          │
│ Average cumulative reward:       -6.68767383378103                                                                                                                       │
│ Average rollout reward:          -6.119798295191308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:01:39[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25856, 25857, 25972, 25996, 26000]                                                                                                                          │
│ Average cumulative reward:       -6.68767383378103                                                                                                                       │
│ Average rollout reward:          -6.119798295191308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:01:39[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25856, 25857, 25972, 25996, 26000]                                                                                                                          │
│ Average cumulative reward:       -6.68767383378103                                                                                                                       │
│ Average rollout reward:          -6.119798295191308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:01:37[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26976, 26978, 26995, 27000]                                                                                                                                 │
│ Average cumulative reward:       -6.761618500663696                                                                                                                      │
│ Average rollout reward:          -6.180070388932059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:01:37[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26976, 26978, 26995, 27000]                                                                                                                                 │
│ Average cumulative reward:       -6.761618500663696                                                                                                                      │
│ Average rollout reward:          -6.180070388932059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:01:37[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26976, 26978, 26995, 27000]                                                                                                                                 │
│ Average cumulative reward:       -6.761618500663696                                                                                                                      │
│ Average rollout reward:          -6.180070388932059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:01:37[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26976, 26978, 26995, 27000]                                                                                                                                 │
│ Average cumulative reward:       -6.761618500663696                                                                                                                      │
│ Average rollout reward:          -6.180070388932059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:01:36[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 964, 965, 27999, 28000]                                                                                                                                     │
│ Average cumulative reward:       -6.914656446758313                                                                                                                      │
│ Average rollout reward:          -6.3139395223482015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:01:36[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 964, 965, 27999, 28000]                                                                                                                                     │
│ Average cumulative reward:       -6.914656446758313                                                                                                                      │
│ Average rollout reward:          -6.3139395223482015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:01:36[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 964, 965, 27999, 28000]                                                                                                                                     │
│ Average cumulative reward:       -6.914656446758313                                                                                                                      │
│ Average rollout reward:          -6.3139395223482015                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:01:35[0m   1.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11633, 11634, 27738, 29000]                                                                                                                                 │
│ Average cumulative reward:       -7.303785357602169                                                                                                                      │
│ Average rollout reward:          -6.715226280630769                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:01:35[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11633, 11634, 27738, 29000]                                                                                                                                 │
│ Average cumulative reward:       -7.303785357602169                                                                                                                      │
│ Average rollout reward:          -6.715226280630769                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:01:35[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11633, 11634, 27738, 29000]                                                                                                                                 │
│ Average cumulative reward:       -7.303785357602169                                                                                                                      │
│ Average rollout reward:          -6.715226280630769                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:01:35[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11633, 11634, 27738, 29000]                                                                                                                                 │
│ Average cumulative reward:       -7.303785357602169                                                                                                                      │
│ Average rollout reward:          -6.715226280630769                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:01:33[0m   1.83 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29904, 29905, 29986, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.645843542401485                                                                                                                      │
│ Average rollout reward:          -6.048941418731225                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:01:33[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29904, 29905, 29986, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.645843542401485                                                                                                                      │
│ Average rollout reward:          -6.048941418731225                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:01:33[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29904, 29905, 29986, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.645843542401485                                                                                                                      │
│ Average rollout reward:          -6.048941418731225                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:01:33[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29904, 29905, 29986, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.645843542401485                                                                                                                      │
│ Average rollout reward:          -6.048941418731225                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:01:31[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21657, 21659, 27835, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.7342968655481545                                                                                                                     │
│ Average rollout reward:          -6.156018761520098                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:01:31[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21657, 21659, 27835, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.7342968655481545                                                                                                                     │
│ Average rollout reward:          -6.156018761520098                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:01:31[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21657, 21659, 27835, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.7342968655481545                                                                                                                     │
│ Average rollout reward:          -6.156018761520098                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:01:31[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21657, 21659, 27835, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.7342968655481545                                                                                                                     │
│ Average rollout reward:          -6.156018761520098                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:01:30[0m   1.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 31747, 31751, 31980, 32000]                                                                                                                                 │
│ Average cumulative reward:       -6.588102948862599                                                                                                                      │
│ Average rollout reward:          -6.0057902833960615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:01:30[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 31747, 31751, 31980, 32000]                                                                                                                                 │
│ Average cumulative reward:       -6.588102948862599                                                                                                                      │
│ Average rollout reward:          -6.0057902833960615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:01:30[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 31747, 31751, 31980, 32000]                                                                                                                                 │
│ Average cumulative reward:       -6.588102948862599                                                                                                                      │
│ Average rollout reward:          -6.0057902833960615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:01:30[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 31747, 31751, 31980, 32000]                                                                                                                                 │
│ Average cumulative reward:       -6.588102948862599                                                                                                                      │
│ Average rollout reward:          -6.0057902833960615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:01:29[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 49, 27812, 33000]                                                                                                                                       │
│ Average cumulative reward:       -6.776510711406738                                                                                                                      │
│ Average rollout reward:          -6.215244622410905                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:01:29[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 49, 27812, 33000]                                                                                                                                       │
│ Average cumulative reward:       -6.776510711406738                                                                                                                      │
│ Average rollout reward:          -6.215244622410905                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:01:29[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 49, 27812, 33000]                                                                                                                                       │
│ Average cumulative reward:       -6.776510711406738                                                                                                                      │
│ Average rollout reward:          -6.215244622410905                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:01:29[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 49, 27812, 33000]                                                                                                                                       │
│ Average cumulative reward:       -6.776510711406738                                                                                                                      │
│ Average rollout reward:          -6.215244622410905                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:01:27[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 141, 142, 24583, 24899, 34000]                                                                                                                              │
│ Average cumulative reward:       -7.191285182619103                                                                                                                      │
│ Average rollout reward:          -6.564204142541415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:01:27[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 141, 142, 24583, 24899, 34000]                                                                                                                              │
│ Average cumulative reward:       -7.191285182619103                                                                                                                      │
│ Average rollout reward:          -6.564204142541415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:01:27[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 141, 142, 24583, 24899, 34000]                                                                                                                              │
│ Average cumulative reward:       -7.191285182619103                                                                                                                      │
│ Average rollout reward:          -6.564204142541415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:01:27[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 141, 142, 24583, 24899, 34000]                                                                                                                              │
│ Average cumulative reward:       -7.191285182619103                                                                                                                      │
│ Average rollout reward:          -6.564204142541415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:01:26[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 34971, 34973, 34979, 35000]                                                                                                                                 │
│ Average cumulative reward:       -6.805056416021939                                                                                                                      │
│ Average rollout reward:          -6.191921637005346                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:01:26[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 34971, 34973, 34979, 35000]                                                                                                                                 │
│ Average cumulative reward:       -6.805056416021939                                                                                                                      │
│ Average rollout reward:          -6.191921637005346                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:01:26[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 34971, 34973, 34979, 35000]                                                                                                                                 │
│ Average cumulative reward:       -6.805056416021939                                                                                                                      │
│ Average rollout reward:          -6.191921637005346                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:01:26[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 34971, 34973, 34979, 35000]                                                                                                                                 │
│ Average cumulative reward:       -6.805056416021939                                                                                                                      │
│ Average rollout reward:          -6.191921637005346                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:01:24[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21163, 21165, 21272, 21273, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.261025708423676                                                                                                                      │
│ Average rollout reward:          -6.6532860493256605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:01:24[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21163, 21165, 21272, 21273, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.261025708423676                                                                                                                      │
│ Average rollout reward:          -6.6532860493256605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:01:24[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21163, 21165, 21272, 21273, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.261025708423676                                                                                                                      │
│ Average rollout reward:          -6.6532860493256605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:01:22[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36996, 37000]                                                                                                                                               │
│ Average cumulative reward:       -7.113815599611025                                                                                                                      │
│ Average rollout reward:          -6.483332532884269                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:01:22[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36996, 37000]                                                                                                                                               │
│ Average cumulative reward:       -7.113815599611025                                                                                                                      │
│ Average rollout reward:          -6.483332532884269                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:01:22[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36996, 37000]                                                                                                                                               │
│ Average cumulative reward:       -7.113815599611025                                                                                                                      │
│ Average rollout reward:          -6.483332532884269                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:01:22[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36996, 37000]                                                                                                                                               │
│ Average cumulative reward:       -7.113815599611025                                                                                                                      │
│ Average rollout reward:          -6.483332532884269                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:01:20[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 37686, 37689, 37724, 37866, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.281642778339565                                                                                                                      │
│ Average rollout reward:          -5.71758824620496                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:01:20[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 37686, 37689, 37724, 37866, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.281642778339565                                                                                                                      │
│ Average rollout reward:          -5.71758824620496                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:01:20[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 37686, 37689, 37724, 37866, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.281642778339565                                                                                                                      │
│ Average rollout reward:          -5.71758824620496                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:01:20[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 37686, 37689, 37724, 37866, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.281642778339565                                                                                                                      │
│ Average rollout reward:          -5.71758824620496                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:01:18[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5303, 5304, 36299, 37336, 39000]                                                                                                                            │
│ Average cumulative reward:       -6.830236651759328                                                                                                                      │
│ Average rollout reward:          -6.211330384491287                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:01:18[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5303, 5304, 36299, 37336, 39000]                                                                                                                            │
│ Average cumulative reward:       -6.830236651759328                                                                                                                      │
│ Average rollout reward:          -6.211330384491287                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:01:18[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5303, 5304, 36299, 37336, 39000]                                                                                                                            │
│ Average cumulative reward:       -6.830236651759328                                                                                                                      │
│ Average rollout reward:          -6.211330384491287                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:01:16[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 39802, 39804, 39980, 39988, 40000]                                                                                                                          │
│ Average cumulative reward:       -7.035220808962211                                                                                                                      │
│ Average rollout reward:          -6.352631476848254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:01:16[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 39802, 39804, 39980, 39988, 40000]                                                                                                                          │
│ Average cumulative reward:       -7.035220808962211                                                                                                                      │
│ Average rollout reward:          -6.352631476848254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:01:16[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 39802, 39804, 39980, 39988, 40000]                                                                                                                          │
│ Average cumulative reward:       -7.035220808962211                                                                                                                      │
│ Average rollout reward:          -6.352631476848254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:01:16[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 39802, 39804, 39980, 39988, 40000]                                                                                                                          │
│ Average cumulative reward:       -7.035220808962211                                                                                                                      │
│ Average rollout reward:          -6.352631476848254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:01:13[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 226, 228, 671, 736, 41000]                                                                                                                                  │
│ Average cumulative reward:       -6.601873278913931                                                                                                                      │
│ Average rollout reward:          -5.944317731816133                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:01:13[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 226, 228, 671, 736, 41000]                                                                                                                                  │
│ Average cumulative reward:       -6.601873278913931                                                                                                                      │
│ Average rollout reward:          -5.944317731816133                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:01:13[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 226, 228, 671, 736, 41000]                                                                                                                                  │
│ Average cumulative reward:       -6.601873278913931                                                                                                                      │
│ Average rollout reward:          -5.944317731816133                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:01:13[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 226, 228, 671, 736, 41000]                                                                                                                                  │
│ Average cumulative reward:       -6.601873278913931                                                                                                                      │
│ Average rollout reward:          -5.944317731816133                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:01:12[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 46, 41567, 42000]                                                                                                                                       │
│ Average cumulative reward:       -6.738104763366761                                                                                                                      │
│ Average rollout reward:          -6.08422306837851                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:01:12[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 46, 41567, 42000]                                                                                                                                       │
│ Average cumulative reward:       -6.738104763366761                                                                                                                      │
│ Average rollout reward:          -6.08422306837851                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:01:12[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 46, 41567, 42000]                                                                                                                                       │
│ Average cumulative reward:       -6.738104763366761                                                                                                                      │
│ Average rollout reward:          -6.08422306837851                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:01:12[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45, 46, 41567, 42000]                                                                                                                                       │
│ Average cumulative reward:       -6.738104763366761                                                                                                                      │
│ Average rollout reward:          -6.08422306837851                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:01:10[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10964, 11028, 11030, 12322, 43000]                                                                                                                   │
│ Average cumulative reward:       -6.910900226975306                                                                                                                      │
│ Average rollout reward:          -6.264629550834528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:01:10[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10964, 11028, 11030, 12322, 43000]                                                                                                                   │
│ Average cumulative reward:       -6.910900226975306                                                                                                                      │
│ Average rollout reward:          -6.264629550834528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:01:10[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10964, 11028, 11030, 12322, 43000]                                                                                                                   │
│ Average cumulative reward:       -6.910900226975306                                                                                                                      │
│ Average rollout reward:          -6.264629550834528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:01:10[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10962, 10964, 11028, 11030, 12322, 43000]                                                                                                                   │
│ Average cumulative reward:       -6.910900226975306                                                                                                                      │
│ Average rollout reward:          -6.264629550834528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:01:08[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5963, 5965, 42583, 42760, 44000]                                                                                                                            │
│ Average cumulative reward:       -6.886410278449545                                                                                                                      │
│ Average rollout reward:          -6.271382285976957                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:01:08[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5963, 5965, 42583, 42760, 44000]                                                                                                                            │
│ Average cumulative reward:       -6.886410278449545                                                                                                                      │
│ Average rollout reward:          -6.271382285976957                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:01:08[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5963, 5965, 42583, 42760, 44000]                                                                                                                            │
│ Average cumulative reward:       -6.886410278449545                                                                                                                      │
│ Average rollout reward:          -6.271382285976957                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:01:08[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5963, 5965, 42583, 42760, 44000]                                                                                                                            │
│ Average cumulative reward:       -6.886410278449545                                                                                                                      │
│ Average rollout reward:          -6.271382285976957                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:01:06[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30511, 30512, 30624, 37581, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.8264250994697795                                                                                                                     │
│ Average rollout reward:          -6.165510251801009                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:01:06[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30511, 30512, 30624, 37581, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.8264250994697795                                                                                                                     │
│ Average rollout reward:          -6.165510251801009                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:01:06[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30511, 30512, 30624, 37581, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.8264250994697795                                                                                                                     │
│ Average rollout reward:          -6.165510251801009                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:01:03[0m   1.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45791, 45795, 45995, 46000]                                                                                                                                 │
│ Average cumulative reward:       -6.652665895591318                                                                                                                      │
│ Average rollout reward:          -5.986025322252044                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:01:03[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45791, 45795, 45995, 46000]                                                                                                                                 │
│ Average cumulative reward:       -6.652665895591318                                                                                                                      │
│ Average rollout reward:          -5.986025322252044                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:01:03[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45791, 45795, 45995, 46000]                                                                                                                                 │
│ Average cumulative reward:       -6.652665895591318                                                                                                                      │
│ Average rollout reward:          -5.986025322252044                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:01:03[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 45791, 45795, 45995, 46000]                                                                                                                                 │
│ Average cumulative reward:       -6.652665895591318                                                                                                                      │
│ Average rollout reward:          -5.986025322252044                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:01:02[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 46603, 46609, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.9255499382340675                                                                                                                     │
│ Average rollout reward:          -6.2950993038444505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:02[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 46603, 46609, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.9255499382340675                                                                                                                     │
│ Average rollout reward:          -6.2950993038444505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:02[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 46603, 46609, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.9255499382340675                                                                                                                     │
│ Average rollout reward:          -6.2950993038444505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:01:02[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 46603, 46609, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.9255499382340675                                                                                                                     │
│ Average rollout reward:          -6.2950993038444505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:00:59[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1041, 1042, 1072, 48000]                                                                                                                                    │
│ Average cumulative reward:       -6.8641354239060774                                                                                                                     │
│ Average rollout reward:          -6.263613605745517                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:00:59[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1041, 1042, 1072, 48000]                                                                                                                                    │
│ Average cumulative reward:       -6.8641354239060774                                                                                                                     │
│ Average rollout reward:          -6.263613605745517                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:00:59[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1041, 1042, 1072, 48000]                                                                                                                                    │
│ Average cumulative reward:       -6.8641354239060774                                                                                                                     │
│ Average rollout reward:          -6.263613605745517                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:00:59[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1041, 1042, 1072, 48000]                                                                                                                                    │
│ Average cumulative reward:       -6.8641354239060774                                                                                                                     │
│ Average rollout reward:          -6.263613605745517                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:00:58[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48977, 48980, 49000]                                                                                                                                        │
│ Average cumulative reward:       -7.063201323245486                                                                                                                      │
│ Average rollout reward:          -6.476266852326597                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:00:58[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48977, 48980, 49000]                                                                                                                                        │
│ Average cumulative reward:       -7.063201323245486                                                                                                                      │
│ Average rollout reward:          -6.476266852326597                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:00:58[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48977, 48980, 49000]                                                                                                                                        │
│ Average cumulative reward:       -7.063201323245486                                                                                                                      │
│ Average rollout reward:          -6.476266852326597                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:00:58[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48977, 48980, 49000]                                                                                                                                        │
│ Average cumulative reward:       -7.063201323245486                                                                                                                      │
│ Average rollout reward:          -6.476266852326597                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:00:56[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49793, 49794, 49985, 50000]                                                                                                                                 │
│ Average cumulative reward:       -6.75938514963372                                                                                                                       │
│ Average rollout reward:          -6.142405256960081                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:00:56[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49793, 49794, 49985, 50000]                                                                                                                                 │
│ Average cumulative reward:       -6.75938514963372                                                                                                                       │
│ Average rollout reward:          -6.142405256960081                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:00:56[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49793, 49794, 49985, 50000]                                                                                                                                 │
│ Average cumulative reward:       -6.75938514963372                                                                                                                       │
│ Average rollout reward:          -6.142405256960081                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:00:54[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50619, 50620, 50992, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.631342975152575                                                                                                                      │
│ Average rollout reward:          -6.01958646609073                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:00:54[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50619, 50620, 50992, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.631342975152575                                                                                                                      │
│ Average rollout reward:          -6.01958646609073                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:00:54[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50619, 50620, 50992, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.631342975152575                                                                                                                      │
│ Average rollout reward:          -6.01958646609073                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:00:54[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50619, 50620, 50992, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.631342975152575                                                                                                                      │
│ Average rollout reward:          -6.01958646609073                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:00:54[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50619, 50620, 50992, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.631342975152575                                                                                                                      │
│ Average rollout reward:          -6.01958646609073                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:00:52[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 1590, 21589, 52000]                                                                                                                             │
│ Average cumulative reward:       -6.9770951600939615                                                                                                                     │
│ Average rollout reward:          -6.352307666616107                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:00:52[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 1590, 21589, 52000]                                                                                                                             │
│ Average cumulative reward:       -6.9770951600939615                                                                                                                     │
│ Average rollout reward:          -6.352307666616107                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:00:52[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1585, 1586, 1590, 21589, 52000]                                                                                                                             │
│ Average cumulative reward:       -6.9770951600939615                                                                                                                     │
│ Average rollout reward:          -6.352307666616107                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:00:51[0m   1.87 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11296, 52006, 53000]                                                                                                                                 │
│ Average cumulative reward:       -7.054546719380756                                                                                                                      │
│ Average rollout reward:          -6.401114676081247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:00:51[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11296, 52006, 53000]                                                                                                                                 │
│ Average cumulative reward:       -7.054546719380756                                                                                                                      │
│ Average rollout reward:          -6.401114676081247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:00:51[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11296, 52006, 53000]                                                                                                                                 │
│ Average cumulative reward:       -7.054546719380756                                                                                                                      │
│ Average rollout reward:          -6.401114676081247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:00:51[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11296, 52006, 53000]                                                                                                                                 │
│ Average cumulative reward:       -7.054546719380756                                                                                                                      │
│ Average rollout reward:          -6.401114676081247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:00:49[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54000]                                                                                                                                                      │
│ Average cumulative reward:       -7.241176325638393                                                                                                                      │
│ Average rollout reward:          -6.590041859661942                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:00:49[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54000]                                                                                                                                                      │
│ Average cumulative reward:       -7.241176325638393                                                                                                                      │
│ Average rollout reward:          -6.590041859661942                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:00:49[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54000]                                                                                                                                                      │
│ Average cumulative reward:       -7.241176325638393                                                                                                                      │
│ Average rollout reward:          -6.590041859661942                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:00:49[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54000]                                                                                                                                                      │
│ Average cumulative reward:       -7.241176325638393                                                                                                                      │
│ Average rollout reward:          -6.590041859661942                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:00:47[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54865, 54867, 54997, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.768421433610449                                                                                                                      │
│ Average rollout reward:          -6.201042134557295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:00:47[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54865, 54867, 54997, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.768421433610449                                                                                                                      │
│ Average rollout reward:          -6.201042134557295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:00:47[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54865, 54867, 54997, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.768421433610449                                                                                                                      │
│ Average rollout reward:          -6.201042134557295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:00:47[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54865, 54867, 54997, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.768421433610449                                                                                                                      │
│ Average rollout reward:          -6.201042134557295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:00:46[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4688, 4692, 23138, 55502, 56000]                                                                                                                            │
│ Average cumulative reward:       -6.728801597894628                                                                                                                      │
│ Average rollout reward:          -6.076808933601422                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:00:46[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4688, 4692, 23138, 55502, 56000]                                                                                                                            │
│ Average cumulative reward:       -6.728801597894628                                                                                                                      │
│ Average rollout reward:          -6.076808933601422                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:00:46[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4688, 4692, 23138, 55502, 56000]                                                                                                                            │
│ Average cumulative reward:       -6.728801597894628                                                                                                                      │
│ Average rollout reward:          -6.076808933601422                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:00:46[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4688, 4692, 23138, 55502, 56000]                                                                                                                            │
│ Average cumulative reward:       -6.728801597894628                                                                                                                      │
│ Average rollout reward:          -6.076808933601422                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:00:44[0m   1.88 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 53773, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.686081004410871                                                                                                                      │
│ Average rollout reward:          -6.015518522796556                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:00:44[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 53773, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.686081004410871                                                                                                                      │
│ Average rollout reward:          -6.015518522796556                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:00:44[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 53773, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.686081004410871                                                                                                                      │
│ Average rollout reward:          -6.015518522796556                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:00:44[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17898, 17899, 53773, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.686081004410871                                                                                                                      │
│ Average rollout reward:          -6.015518522796556                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:00:42[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3113, 3115, 5648, 5846, 58000]                                                                                                                              │
│ Average cumulative reward:       -7.002433366564283                                                                                                                      │
│ Average rollout reward:          -6.352778317946633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:00:42[0m   1.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3113, 3115, 5648, 5846, 58000]                                                                                                                              │
│ Average cumulative reward:       -7.002433366564283                                                                                                                      │
│ Average rollout reward:          -6.352778317946633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:00:42[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3113, 3115, 5648, 5846, 58000]                                                                                                                              │
│ Average cumulative reward:       -7.002433366564283                                                                                                                      │
│ Average rollout reward:          -6.352778317946633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:00:42[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3113, 3115, 5648, 5846, 58000]                                                                                                                              │
│ Average cumulative reward:       -7.002433366564283                                                                                                                      │
│ Average rollout reward:          -6.352778317946633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:00:42[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3113, 3115, 5648, 5846, 58000]                                                                                                                              │
│ Average cumulative reward:       -7.002433366564283                                                                                                                      │
│ Average rollout reward:          -6.352778317946633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:00:40[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 890, 891, 55667, 57912, 59000]                                                                                                                              │
│ Average cumulative reward:       -6.995899929885055                                                                                                                      │
│ Average rollout reward:          -6.40558568551818                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:00:40[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 890, 891, 55667, 57912, 59000]                                                                                                                              │
│ Average cumulative reward:       -6.995899929885055                                                                                                                      │
│ Average rollout reward:          -6.40558568551818                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:00:40[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 890, 891, 55667, 57912, 59000]                                                                                                                              │
│ Average cumulative reward:       -6.995899929885055                                                                                                                      │
│ Average rollout reward:          -6.40558568551818                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:00:40[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 890, 891, 55667, 57912, 59000]                                                                                                                              │
│ Average cumulative reward:       -6.995899929885055                                                                                                                      │
│ Average rollout reward:          -6.40558568551818                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:00:39[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10002, 57338, 57947, 60000]                                                                                                                          │
│ Average cumulative reward:       -7.15518731228426                                                                                                                       │
│ Average rollout reward:          -6.508049544385847                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:00:39[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10002, 57338, 57947, 60000]                                                                                                                          │
│ Average cumulative reward:       -7.15518731228426                                                                                                                       │
│ Average rollout reward:          -6.508049544385847                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:00:39[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10002, 57338, 57947, 60000]                                                                                                                          │
│ Average cumulative reward:       -7.15518731228426                                                                                                                       │
│ Average rollout reward:          -6.508049544385847                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:00:39[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10002, 57338, 57947, 60000]                                                                                                                          │
│ Average cumulative reward:       -7.15518731228426                                                                                                                       │
│ Average rollout reward:          -6.508049544385847                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:00:37[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17024, 17026, 30362, 30996, 34858, 61000]                                                                                                                   │
│ Average cumulative reward:       -6.6536263564536515                                                                                                                     │
│ Average rollout reward:          -6.044062171180508                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:00:37[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17024, 17026, 30362, 30996, 34858, 61000]                                                                                                                   │
│ Average cumulative reward:       -6.6536263564536515                                                                                                                     │
│ Average rollout reward:          -6.044062171180508                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:00:37[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17024, 17026, 30362, 30996, 34858, 61000]                                                                                                                   │
│ Average cumulative reward:       -6.6536263564536515                                                                                                                     │
│ Average rollout reward:          -6.044062171180508                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:00:37[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17024, 17026, 30362, 30996, 34858, 61000]                                                                                                                   │
│ Average cumulative reward:       -6.6536263564536515                                                                                                                     │
│ Average rollout reward:          -6.044062171180508                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:00:35[0m   1.90 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15767, 59213, 59928, 62000]                                                                                                                          │
│ Average cumulative reward:       -6.918081611879703                                                                                                                      │
│ Average rollout reward:          -6.270743999265763                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:00:35[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15767, 59213, 59928, 62000]                                                                                                                          │
│ Average cumulative reward:       -6.918081611879703                                                                                                                      │
│ Average rollout reward:          -6.270743999265763                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:00:35[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15767, 59213, 59928, 62000]                                                                                                                          │
│ Average cumulative reward:       -6.918081611879703                                                                                                                      │
│ Average rollout reward:          -6.270743999265763                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:00:35[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15767, 59213, 59928, 62000]                                                                                                                          │
│ Average cumulative reward:       -6.918081611879703                                                                                                                      │
│ Average rollout reward:          -6.270743999265763                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:00:35[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15765, 15767, 59213, 59928, 62000]                                                                                                                          │
│ Average cumulative reward:       -6.918081611879703                                                                                                                      │
│ Average rollout reward:          -6.270743999265763                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:00:33[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10005, 10056, 10074, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.771041314560215                                                                                                                      │
│ Average rollout reward:          -6.113728426263784                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:00:33[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10005, 10056, 10074, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.771041314560215                                                                                                                      │
│ Average rollout reward:          -6.113728426263784                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:00:33[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10005, 10056, 10074, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.771041314560215                                                                                                                      │
│ Average rollout reward:          -6.113728426263784                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:00:33[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 10001, 10005, 10056, 10074, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.771041314560215                                                                                                                      │
│ Average rollout reward:          -6.113728426263784                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:00:31[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 63986, 63990, 64000]                                                                                                                                        │
│ Average cumulative reward:       -6.373080042708572                                                                                                                      │
│ Average rollout reward:          -5.725889094495531                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:00:31[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 63986, 63990, 64000]                                                                                                                                        │
│ Average cumulative reward:       -6.373080042708572                                                                                                                      │
│ Average rollout reward:          -5.725889094495531                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:00:31[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 63986, 63990, 64000]                                                                                                                                        │
│ Average cumulative reward:       -6.373080042708572                                                                                                                      │
│ Average rollout reward:          -5.725889094495531                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:00:31[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 63986, 63990, 64000]                                                                                                                                        │
│ Average cumulative reward:       -6.373080042708572                                                                                                                      │
│ Average rollout reward:          -5.725889094495531                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:00:29[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64944, 64948, 64985, 64996, 65000]                                                                                                                          │
│ Average cumulative reward:       -6.81083177178884                                                                                                                       │
│ Average rollout reward:          -6.194612695332037                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:00:29[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64944, 64948, 64985, 64996, 65000]                                                                                                                          │
│ Average cumulative reward:       -6.81083177178884                                                                                                                       │
│ Average rollout reward:          -6.194612695332037                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:00:29[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64944, 64948, 64985, 64996, 65000]                                                                                                                          │
│ Average cumulative reward:       -6.81083177178884                                                                                                                       │
│ Average rollout reward:          -6.194612695332037                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:00:29[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64944, 64948, 64985, 64996, 65000]                                                                                                                          │
│ Average cumulative reward:       -6.81083177178884                                                                                                                       │
│ Average rollout reward:          -6.194612695332037                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:00:27[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23701, 64903, 65692, 66000]                                                                                                                          │
│ Average cumulative reward:       -7.133683680697733                                                                                                                      │
│ Average rollout reward:          -6.499743544548529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:00:27[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23701, 64903, 65692, 66000]                                                                                                                          │
│ Average cumulative reward:       -7.133683680697733                                                                                                                      │
│ Average rollout reward:          -6.499743544548529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:00:27[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23701, 64903, 65692, 66000]                                                                                                                          │
│ Average cumulative reward:       -7.133683680697733                                                                                                                      │
│ Average rollout reward:          -6.499743544548529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:00:27[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23700, 23701, 64903, 65692, 66000]                                                                                                                          │
│ Average cumulative reward:       -7.133683680697733                                                                                                                      │
│ Average rollout reward:          -6.499743544548529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:00:25[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66887, 66889, 67000]                                                                                                                                        │
│ Average cumulative reward:       -6.439402568816643                                                                                                                      │
│ Average rollout reward:          -5.783023545336195                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:00:25[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66887, 66889, 67000]                                                                                                                                        │
│ Average cumulative reward:       -6.439402568816643                                                                                                                      │
│ Average rollout reward:          -5.783023545336195                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:00:25[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66887, 66889, 67000]                                                                                                                                        │
│ Average cumulative reward:       -6.439402568816643                                                                                                                      │
│ Average rollout reward:          -5.783023545336195                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:00:25[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66887, 66889, 67000]                                                                                                                                        │
│ Average cumulative reward:       -6.439402568816643                                                                                                                      │
│ Average rollout reward:          -5.783023545336195                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:00:23[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 67872, 67874, 67903, 67905, 67975, 68000]                                                                                                                   │
│ Average cumulative reward:       -6.692607773680873                                                                                                                      │
│ Average rollout reward:          -6.0130433807632535                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:00:23[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 67872, 67874, 67903, 67905, 67975, 68000]                                                                                                                   │
│ Average cumulative reward:       -6.692607773680873                                                                                                                      │
│ Average rollout reward:          -6.0130433807632535                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:00:23[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 67872, 67874, 67903, 67905, 67975, 68000]                                                                                                                   │
│ Average cumulative reward:       -6.692607773680873                                                                                                                      │
│ Average rollout reward:          -6.0130433807632535                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:00:21[0m   1.91 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68865, 68866, 68898, 69000]                                                                                                                                 │
│ Average cumulative reward:       -6.958693688688045                                                                                                                      │
│ Average rollout reward:          -6.311987277512658                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:00:21[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68865, 68866, 68898, 69000]                                                                                                                                 │
│ Average cumulative reward:       -6.958693688688045                                                                                                                      │
│ Average rollout reward:          -6.311987277512658                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:00:21[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68865, 68866, 68898, 69000]                                                                                                                                 │
│ Average cumulative reward:       -6.958693688688045                                                                                                                      │
│ Average rollout reward:          -6.311987277512658                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:00:21[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68865, 68866, 68898, 69000]                                                                                                                                 │
│ Average cumulative reward:       -6.958693688688045                                                                                                                      │
│ Average rollout reward:          -6.311987277512658                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:00:19[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69866, 69867, 69878, 70000]                                                                                                                                 │
│ Average cumulative reward:       -6.989070508991316                                                                                                                      │
│ Average rollout reward:          -6.359319345864285                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:00:19[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69866, 69867, 69878, 70000]                                                                                                                                 │
│ Average cumulative reward:       -6.989070508991316                                                                                                                      │
│ Average rollout reward:          -6.359319345864285                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:00:19[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69866, 69867, 69878, 70000]                                                                                                                                 │
│ Average cumulative reward:       -6.989070508991316                                                                                                                      │
│ Average rollout reward:          -6.359319345864285                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:00:19[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69866, 69867, 69878, 70000]                                                                                                                                 │
│ Average cumulative reward:       -6.989070508991316                                                                                                                      │
│ Average rollout reward:          -6.359319345864285                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:00:17[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 70877, 70880, 70921, 70939, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.711107724727655                                                                                                                      │
│ Average rollout reward:          -6.106085191064432                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:00:17[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 70877, 70880, 70921, 70939, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.711107724727655                                                                                                                      │
│ Average rollout reward:          -6.106085191064432                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:00:17[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 70877, 70880, 70921, 70939, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.711107724727655                                                                                                                      │
│ Average rollout reward:          -6.106085191064432                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:00:17[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 70877, 70880, 70921, 70939, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.711107724727655                                                                                                                      │
│ Average rollout reward:          -6.106085191064432                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:00:15[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5518, 5519, 21573, 24685, 72000]                                                                                                                            │
│ Average cumulative reward:       -7.005862633466203                                                                                                                      │
│ Average rollout reward:          -6.3561299009336825                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:00:15[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5518, 5519, 21573, 24685, 72000]                                                                                                                            │
│ Average cumulative reward:       -7.005862633466203                                                                                                                      │
│ Average rollout reward:          -6.3561299009336825                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:00:15[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5518, 5519, 21573, 24685, 72000]                                                                                                                            │
│ Average cumulative reward:       -7.005862633466203                                                                                                                      │
│ Average rollout reward:          -6.3561299009336825                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:00:15[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5518, 5519, 21573, 24685, 72000]                                                                                                                            │
│ Average cumulative reward:       -7.005862633466203                                                                                                                      │
│ Average rollout reward:          -6.3561299009336825                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:00:13[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72925, 72928, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.160945213399076                                                                                                                      │
│ Average rollout reward:          -6.508181524236588                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:00:13[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72925, 72928, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.160945213399076                                                                                                                      │
│ Average rollout reward:          -6.508181524236588                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:00:13[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72925, 72928, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.160945213399076                                                                                                                      │
│ Average rollout reward:          -6.508181524236588                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:00:13[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72925, 72928, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.160945213399076                                                                                                                      │
│ Average rollout reward:          -6.508181524236588                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:00:13[0m   1.95 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72925, 72928, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.160945213399076                                                                                                                      │
│ Average rollout reward:          -6.508181524236588                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:00:11[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73961, 73963, 73999, 74000]                                                                                                                                 │
│ Average cumulative reward:       -6.5662322727797395                                                                                                                     │
│ Average rollout reward:          -5.9496688917149525                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:00:11[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73961, 73963, 73999, 74000]                                                                                                                                 │
│ Average cumulative reward:       -6.5662322727797395                                                                                                                     │
│ Average rollout reward:          -5.9496688917149525                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:00:11[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73961, 73963, 73999, 74000]                                                                                                                                 │
│ Average cumulative reward:       -6.5662322727797395                                                                                                                     │
│ Average rollout reward:          -5.9496688917149525                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:00:09[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 74510, 75000]                                                                                                                                 │
│ Average cumulative reward:       -7.084979941944518                                                                                                                      │
│ Average rollout reward:          -6.417117385293524                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:00:09[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 74510, 75000]                                                                                                                                 │
│ Average cumulative reward:       -7.084979941944518                                                                                                                      │
│ Average rollout reward:          -6.417117385293524                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:00:09[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 74510, 75000]                                                                                                                                 │
│ Average cumulative reward:       -7.084979941944518                                                                                                                      │
│ Average rollout reward:          -6.417117385293524                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:00:09[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 74510, 75000]                                                                                                                                 │
│ Average cumulative reward:       -7.084979941944518                                                                                                                      │
│ Average rollout reward:          -6.417117385293524                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:00:09[0m   1.95 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46576, 46577, 74510, 75000]                                                                                                                                 │
│ Average cumulative reward:       -7.084979941944518                                                                                                                      │
│ Average rollout reward:          -6.417117385293524                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:00:07[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16180, 16183, 16191, 16361, 76000]                                                                                                                   │
│ Average cumulative reward:       -6.858149255288991                                                                                                                      │
│ Average rollout reward:          -6.226816297276059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:00:07[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16180, 16183, 16191, 16361, 76000]                                                                                                                   │
│ Average cumulative reward:       -6.858149255288991                                                                                                                      │
│ Average rollout reward:          -6.226816297276059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:00:07[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16178, 16180, 16183, 16191, 16361, 76000]                                                                                                                   │
│ Average cumulative reward:       -6.858149255288991                                                                                                                      │
│ Average rollout reward:          -6.226816297276059                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:00:05[0m   1.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2146, 2148, 71605, 72324, 77000]                                                                                                                            │
│ Average cumulative reward:       -6.90844997785422                                                                                                                       │
│ Average rollout reward:          -6.345164190673359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:00:05[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2146, 2148, 71605, 72324, 77000]                                                                                                                            │
│ Average cumulative reward:       -6.90844997785422                                                                                                                       │
│ Average rollout reward:          -6.345164190673359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:05[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2146, 2148, 71605, 72324, 77000]                                                                                                                            │
│ Average cumulative reward:       -6.90844997785422                                                                                                                       │
│ Average rollout reward:          -6.345164190673359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:05[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2146, 2148, 71605, 72324, 77000]                                                                                                                            │
│ Average cumulative reward:       -6.90844997785422                                                                                                                       │
│ Average rollout reward:          -6.345164190673359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:02[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯m98.7%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:02[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:02[0m   1.94 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:02[0m   1.95 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:02[0m   1.95 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/79 [38;2;114;156;31m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m100.0%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:00[0m   1.93 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11294, 11295, 77562, 77786, 78000]                                                                                                                          │
│ Average cumulative reward:       -7.313845454767168                                                                                                                      │
│ Average rollout reward:          -6.643037904645608                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.439028498698782                                                                                                                              │
│ Best path: [0, 2, 45, 46]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
[?25hNode 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 45 is not terminal. Continue.
Node 46 is not terminal. Continue.
Node 41567 is not terminal. Continue.
Node 41839 is not terminal. Continue.
Node 42098 is not terminal. Continue.
Node 49739 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 1 is not terminal. Continue.
Node 77836 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 45 is not terminal. Continue.
Node 46 is not terminal. Continue.
Node 41567 is not terminal. Continue.
Node 67823 is not terminal. Continue.
Node 77656 is not terminal. Continue.
No children found. Stop.
=== RESULT ===
By Visits: estimated reward: -2.995297777957087
sign_newton [35.536823]
sign_ns [0.48577696 3.2956116 ]
By Value: estimated reward: -21.0
By Best Value: estimated reward: 0
sign_newton [35.536823]
sign_ns [0.5        0.09464214 0.         0.        ]
sign_ns [0.5, 1.6070794260508423]
sign_ns [0.5, 1.4395076317816882]
sign_ns [0.5, 1.1913326509611137]
sign_ns [0.5, 1.029930557486229]
sign_ns [0.5, 1.0006789654828754]
Best value of root node:
-1.439028498698782
Best root policy:
sign_newton [35.536823]
sign_ns [0.5        0.09464214 0.         0.        ]
sign_ns [0.5, 1.6070794260508423]
sign_ns [0.5, 1.4395076317816882]
sign_ns [0.5, 1.1913326509611137]
sign_ns [0.5, 1.029930557486229]
sign_ns [0.5, 1.0006789654828754]
=== END ===
Finished making algorithm
