Files already downloaded and verified
Matrix distribution: CIFAR
Matrix distribution config: {'c': 0.25, 'd': 5000, 'eps': 0.001}
Initial matrix shape: torch.Size([3072, 3072])
Algorithm name: mcts
Algorithm config: {'c_ucb': 5.0, 'alpha_pw': 0.4, 'epsilon': 1e-06, 'EXPLORE_K': 5, 'early_termination_epsilon': 1e-05, 'budget': 150000, 'print_every': 1000, 'max_termination_count': 10, 'tree_initial_capacity': 10000, 'device': 'cuda', 'actions': [['sqrt_db', [[0, 0], [50, 50]]], ['sqrt_nsv', [[0, 0], [5, 5]]], ['sqrt_visser', [[0, 0], [10, 10]]], ['sqrt_visser_coupled', [[0, 0], [10, 10]]], ['sqrt_couple', None]], 'initialize_with_baselines': True}
Actions: ['sqrt_couple', 'sqrt_db', 'sqrt_nsv', 'sqrt_visser', 'sqrt_visser_coupled']
Action sqrt_couple took 1.0 times longer than sqrt_couple
Action sqrt_db took 1.806884528231223 times longer than sqrt_couple
Action sqrt_nsv took 0.3474818314222929 times longer than sqrt_couple
Action sqrt_visser took 0.1354499815609969 times longer than sqrt_couple
Action sqrt_visser_coupled took 0.26909786577085326 times longer than sqrt_couple
Skipping sign_newton because not all actions are in the tree
Skipping sign_scaled_newton because not all actions are in the tree
Skipping sign_ns because not all actions are in the tree
Skipping sign_scaled_ns because not all actions are in the tree
Skipping sign_newton_variant because not all actions are in the tree
Skipping sign_halley because not all actions are in the tree
Skipping inv_ns because not all actions are in the tree
Skipping inv_ns_chebyshev because not all actions are in the tree
Skipping sqrt_newton because not all actions are in the tree
Skipping sqrt_newton_coupled because not all actions are in the tree
Skipping proot_newton because not all actions are in the tree
Skipping proot_visser because not all actions are in the tree
Skipping proot_iannazzo because not all actions are in the tree
[?25l/home/sykim/code/make_algorithm/losses.py:39: RuntimeWarning: overflow encountered in multiply
  loss = np.linalg.norm(x * x - y) / np.linalg.norm(y)
[2K/home/sykim/code/make_algorithm/actions.py:878: RuntimeWarning: overflow encountered in multiply
  intermediate = a0 - a1 * Y * Z
[2K/home/sykim/code/make_algorithm/actions.py:879: RuntimeWarning: overflow encountered in multiply
  Yn = 0.5 * Y * intermediate
[2K/home/sykim/code/make_algorithm/actions.py:880: RuntimeWarning: overflow encountered in multiply
  Zn = 0.5 * Z * intermediate
[2K0/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:00[0m Remaining: [36m-:--:--[0m 501953.39 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-10.49393031 -10.49393031]                                                                                                                                              │
│ [-4.16978198 -4.16978198]                                                                                                                                                │
│ [-3.74391618 -3.74391618 -3.74391618 -3.74391618 -3.39643435 -3.39643435]                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1005488.06 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-10.49393031 -10.49393031]                                                                                                                                              │
│ [-4.16978198 -4.16978198]                                                                                                                                                │
│ [-3.74391618 -3.74391618 -3.74391618 -3.74391618 -3.39643435 -3.39643435]                                                                                                │
│ [-3.39643435 -3.39643435 -3.39643435 -3.39643435 -3.04895252 -3.04895252                                                                                                 │
│  -3.04895252]                                                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.7%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 955, 998, 1000]                                                                                                                                             │
│ Average cumulative reward:       -10.717481001004451                                                                                                                     │
│ Average rollout reward:          -10.445541126727257                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3964343485714896                                                                                                                             │
│ Best path: [0, 2, 14, 643, 650, 655, 663]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.7%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 955, 998, 1000]                                                                                                                                             │
│ Average cumulative reward:       -10.717481001004451                                                                                                                     │
│ Average rollout reward:          -10.445541126727257                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3964343485714896                                                                                                                             │
│ Best path: [0, 2, 14, 643, 650, 655, 663]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m0:02:58[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 1912, 1922, 1926, 2000]                                                                                                                                     │
│ Average cumulative reward:       -10.625800222065951                                                                                                                     │
│ Average rollout reward:          -10.292921606948996                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3964343485714896                                                                                                                             │
│ Best path: [0, 2, 14, 643, 650, 655, 663]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:02:58[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 1912, 1922, 1926, 2000]                                                                                                                                     │
│ Average cumulative reward:       -10.625800222065951                                                                                                                     │
│ Average rollout reward:          -10.292921606948996                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3964343485714896                                                                                                                             │
│ Best path: [0, 2, 14, 643, 650, 655, 663]                                                                                                                                │
│ [-3.39643435 -3.39643435 -3.39643435 -3.39643435 -3.04895252 -3.04895252                                                                                                 │
│  -3.04895252 -2.77985465]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:02:58[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 1912, 1922, 1926, 2000]                                                                                                                                     │
│ Average cumulative reward:       -10.625800222065951                                                                                                                     │
│ Average rollout reward:          -10.292921606948996                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3964343485714896                                                                                                                             │
│ Best path: [0, 2, 14, 643, 650, 655, 663]                                                                                                                                │
│ [-3.39643435 -3.39643435 -3.39643435 -3.39643435 -3.04895252 -3.04895252                                                                                                 │
│  -3.04895252 -2.77985465]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.0%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:02:57[0m   1.34 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 561, 564, 586, 1574, 3000]                                                                                                                                  │
│ Average cumulative reward:       -10.293385733859493                                                                                                                     │
│ Average rollout reward:          -9.953970542824466                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.396434348571489                                                                                                                              │
│ Best path: [0, 2, 14, 1284, 1387, 1390, 2158, 2243]                                                                                                                      │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.0%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:02:57[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 561, 564, 586, 1574, 3000]                                                                                                                                  │
│ Average cumulative reward:       -10.293385733859493                                                                                                                     │
│ Average rollout reward:          -9.953970542824466                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.396434348571489                                                                                                                              │
│ Best path: [0, 2, 14, 1284, 1387, 1390, 2158, 2243]                                                                                                                      │
│ [-3.04895252 -3.04895252 -3.04895252 -3.04895252 -2.70147069 -2.70147069]                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:02:56[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3798, 3800, 3806, 3837, 4000]                                                                                                                               │
│ Average cumulative reward:       -10.447750770038358                                                                                                                     │
│ Average rollout reward:          -10.105423878565153                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:02:56[0m   1.39 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3798, 3800, 3806, 3837, 4000]                                                                                                                               │
│ Average cumulative reward:       -10.447750770038358                                                                                                                     │
│ Average rollout reward:          -10.105423878565153                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:02:56[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3798, 3800, 3806, 3837, 4000]                                                                                                                               │
│ Average cumulative reward:       -10.447750770038358                                                                                                                     │
│ Average rollout reward:          -10.105423878565153                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.4%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:02:56[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4940, 4956, 4960, 4961, 5000]                                                                                                                               │
│ Average cumulative reward:       -10.501793265511978                                                                                                                     │
│ Average rollout reward:          -10.134652552553531                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.4%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:02:56[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 4940, 4956, 4960, 4961, 5000]                                                                                                                               │
│ Average cumulative reward:       -10.501793265511978                                                                                                                     │
│ Average rollout reward:          -10.134652552553531                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.0%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:02:56[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5803, 5806, 6000]                                                                                                                                           │
│ Average cumulative reward:       -10.80577367244165                                                                                                                      │
│ Average rollout reward:          -10.439600677339506                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.0%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:02:56[0m   1.34 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5803, 5806, 6000]                                                                                                                                           │
│ Average cumulative reward:       -10.80577367244165                                                                                                                      │
│ Average rollout reward:          -10.439600677339506                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:02:55[0m   1.22 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6, 3240, 3521, 5461, 7000]                                                                                                                                  │
│ Average cumulative reward:       -10.38030514899206                                                                                                                      │
│ Average rollout reward:          -10.017486987599241                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:02:55[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6, 3240, 3521, 5461, 7000]                                                                                                                                  │
│ Average cumulative reward:       -10.38030514899206                                                                                                                      │
│ Average rollout reward:          -10.017486987599241                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:02:55[0m   1.37 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 6, 3240, 3521, 5461, 7000]                                                                                                                                  │
│ Average cumulative reward:       -10.38030514899206                                                                                                                      │
│ Average rollout reward:          -10.017486987599241                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.4%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:02:53[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 456, 7998, 8000]                                                                                                                                            │
│ Average cumulative reward:       -10.133224468500398                                                                                                                     │
│ Average rollout reward:          -9.791739402729975                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.4%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:02:53[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 456, 7998, 8000]                                                                                                                                            │
│ Average cumulative reward:       -10.133224468500398                                                                                                                     │
│ Average rollout reward:          -9.791739402729975                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:02:51[0m   1.23 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 8059, 8911, 8935, 9000]                                                                                                                                     │
│ Average cumulative reward:       -10.12276923158656                                                                                                                      │
│ Average rollout reward:          -9.786337938488645                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:02:51[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 8059, 8911, 8935, 9000]                                                                                                                                     │
│ Average cumulative reward:       -10.12276923158656                                                                                                                      │
│ Average rollout reward:          -9.786337938488645                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:02:51[0m   1.34 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 8059, 8911, 8935, 9000]                                                                                                                                     │
│ Average cumulative reward:       -10.12276923158656                                                                                                                      │
│ Average rollout reward:          -9.786337938488645                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.7%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:02:50[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 9873, 10000]                                                                                                                                          │
│ Average cumulative reward:       -10.475736902097937                                                                                                                     │
│ Average rollout reward:          -10.128642397965733                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.7%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:02:50[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 9873, 10000]                                                                                                                                          │
│ Average cumulative reward:       -10.475736902097937                                                                                                                     │
│ Average rollout reward:          -10.128642397965733                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:02:49[0m   1.24 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2153, 8364, 8366, 8374, 8375, 11000]                                                                                                                        │
│ Average cumulative reward:       -10.667333057868124                                                                                                                     │
│ Average rollout reward:          -10.290184528196866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:02:49[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2153, 8364, 8366, 8374, 8375, 11000]                                                                                                                        │
│ Average cumulative reward:       -10.667333057868124                                                                                                                     │
│ Average rollout reward:          -10.290184528196866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:02:49[0m   1.33 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2153, 8364, 8366, 8374, 8375, 11000]                                                                                                                        │
│ Average cumulative reward:       -10.667333057868124                                                                                                                     │
│ Average rollout reward:          -10.290184528196866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m8.1%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:02:49[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1384, 10716, 10998, 11964, 12000]                                                                                                                           │
│ Average cumulative reward:       -10.540381825920598                                                                                                                     │
│ Average rollout reward:          -10.180780735196876                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.1%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:02:49[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1384, 10716, 10998, 11964, 12000]                                                                                                                           │
│ Average cumulative reward:       -10.540381825920598                                                                                                                     │
│ Average rollout reward:          -10.180780735196876                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:02:48[0m   1.24 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 10997, 11706, 12901, 13000]                                                                                                                           │
│ Average cumulative reward:       -10.658368965687496                                                                                                                     │
│ Average rollout reward:          -10.268993976955084                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:02:48[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 10997, 11706, 12901, 13000]                                                                                                                           │
│ Average cumulative reward:       -10.658368965687496                                                                                                                     │
│ Average rollout reward:          -10.268993976955084                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:02:48[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 10997, 11706, 12901, 13000]                                                                                                                           │
│ Average cumulative reward:       -10.658368965687496                                                                                                                     │
│ Average rollout reward:          -10.268993976955084                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/149 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m9.4%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:02:47[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13978, 13995, 14000]                                                                                                                                 │
│ Average cumulative reward:       -10.972051539728673                                                                                                                     │
│ Average rollout reward:          -10.595529106776734                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/149 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m9.4%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:02:47[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13978, 13995, 14000]                                                                                                                                 │
│ Average cumulative reward:       -10.972051539728673                                                                                                                     │
│ Average rollout reward:          -10.595529106776734                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:02:46[0m   1.24 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2412, 13319, 13898, 15000]                                                                                                                                  │
│ Average cumulative reward:       -10.551931257459389                                                                                                                     │
│ Average rollout reward:          -10.137439343152806                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:02:46[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2412, 13319, 13898, 15000]                                                                                                                                  │
│ Average cumulative reward:       -10.551931257459389                                                                                                                     │
│ Average rollout reward:          -10.137439343152806                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:02:46[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2412, 13319, 13898, 15000]                                                                                                                                  │
│ Average cumulative reward:       -10.551931257459389                                                                                                                     │
│ Average rollout reward:          -10.137439343152806                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.7%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:02:45[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 15997, 16000]                                                                                                                                        │
│ Average cumulative reward:       -10.659985483429667                                                                                                                     │
│ Average rollout reward:          -10.277789673187735                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.7%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:02:45[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 15997, 16000]                                                                                                                                        │
│ Average cumulative reward:       -10.659985483429667                                                                                                                     │
│ Average rollout reward:          -10.277789673187735                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:02:44[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16045, 16047, 16218, 17000]                                                                                                                          │
│ Average cumulative reward:       -10.156038123617781                                                                                                                     │
│ Average rollout reward:          -9.772269790865053                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:02:44[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16045, 16047, 16218, 17000]                                                                                                                          │
│ Average cumulative reward:       -10.156038123617781                                                                                                                     │
│ Average rollout reward:          -9.772269790865053                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
│ [-3.04895252 -3.04895252 -3.04895252 -3.04895252 -2.70147069 -2.70147069                                                                                                 │
│  -2.70147069 -2.43237282 -2.43237282]                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:02:44[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16045, 16047, 16218, 17000]                                                                                                                          │
│ Average cumulative reward:       -10.156038123617781                                                                                                                     │
│ Average rollout reward:          -9.772269790865053                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491967                                                                                                                             │
│ Best path: [0, 2, 3626, 3636, 3648, 3668]                                                                                                                                │
│ [-3.04895252 -3.04895252 -3.04895252 -3.04895252 -2.70147069 -2.70147069                                                                                                 │
│  -2.70147069 -2.43237282 -2.43237282]                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.1%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:02:43[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 409, 1474, 1478, 15349, 17552, 18000]                                                                                                                       │
│ Average cumulative reward:       -10.526203338046301                                                                                                                     │
│ Average rollout reward:          -10.109252574432432                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.1%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:02:43[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 409, 1474, 1478, 15349, 17552, 18000]                                                                                                                       │
│ Average cumulative reward:       -10.526203338046301                                                                                                                     │
│ Average rollout reward:          -10.109252574432432                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:02:42[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5579, 5650, 5652, 19000]                                                                                                                                    │
│ Average cumulative reward:       -10.476404924232515                                                                                                                     │
│ Average rollout reward:          -10.088687925431357                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:02:42[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5579, 5650, 5652, 19000]                                                                                                                                    │
│ Average cumulative reward:       -10.476404924232515                                                                                                                     │
│ Average rollout reward:          -10.088687925431357                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:02:42[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5579, 5650, 5652, 19000]                                                                                                                                    │
│ Average cumulative reward:       -10.476404924232515                                                                                                                     │
│ Average rollout reward:          -10.088687925431357                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m13.4%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:02:41[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 19982, 19998, 20000]                                                                                                                                        │
│ Average cumulative reward:       -10.918683599309093                                                                                                                     │
│ Average rollout reward:          -10.528560315243494                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.4%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:02:41[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 19982, 19998, 20000]                                                                                                                                        │
│ Average cumulative reward:       -10.918683599309093                                                                                                                     │
│ Average rollout reward:          -10.528560315243494                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:02:40[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20947, 20970, 20973, 21000]                                                                                                                                 │
│ Average cumulative reward:       -10.55774604394894                                                                                                                      │
│ Average rollout reward:          -10.16196990260477                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:02:40[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20947, 20970, 20973, 21000]                                                                                                                                 │
│ Average cumulative reward:       -10.55774604394894                                                                                                                      │
│ Average rollout reward:          -10.16196990260477                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:02:40[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20947, 20970, 20973, 21000]                                                                                                                                 │
│ Average cumulative reward:       -10.55774604394894                                                                                                                      │
│ Average rollout reward:          -10.16196990260477                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.8%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:02:39[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21939, 21995, 21999, 22000]                                                                                                                                 │
│ Average cumulative reward:       -10.418335624273034                                                                                                                     │
│ Average rollout reward:          -10.02388954046019                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.8%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:02:39[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21939, 21995, 21999, 22000]                                                                                                                                 │
│ Average cumulative reward:       -10.418335624273034                                                                                                                     │
│ Average rollout reward:          -10.02388954046019                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.4%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:02:38[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22982, 22986, 23000]                                                                                                                                 │
│ Average cumulative reward:       -10.869445221914127                                                                                                                     │
│ Average rollout reward:          -10.458996186045262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.4%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:02:38[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22982, 22986, 23000]                                                                                                                                 │
│ Average cumulative reward:       -10.869445221914127                                                                                                                     │
│ Average rollout reward:          -10.458996186045262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.4%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:02:38[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22982, 22986, 23000]                                                                                                                                 │
│ Average cumulative reward:       -10.869445221914127                                                                                                                     │
│ Average rollout reward:          -10.458996186045262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.1%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:02:37[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6, 17434, 17715, 19366, 24000]                                                                                                                              │
│ Average cumulative reward:       -10.386674086134404                                                                                                                     │
│ Average rollout reward:          -9.973288748238907                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.1%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:02:37[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6, 17434, 17715, 19366, 24000]                                                                                                                              │
│ Average cumulative reward:       -10.386674086134404                                                                                                                     │
│ Average rollout reward:          -9.973288748238907                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:35[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16977, 25000]                                                                                                                                        │
│ Average cumulative reward:       -10.397186650821045                                                                                                                     │
│ Average rollout reward:          -10.009131258693065                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:35[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16977, 25000]                                                                                                                                        │
│ Average cumulative reward:       -10.397186650821045                                                                                                                     │
│ Average rollout reward:          -10.009131258693065                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:35[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15974, 16977, 25000]                                                                                                                                        │
│ Average cumulative reward:       -10.397186650821045                                                                                                                     │
│ Average rollout reward:          -10.009131258693065                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.4%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:35[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25082, 25979, 25984, 25997, 26000]                                                                                                                          │
│ Average cumulative reward:       -10.520101384929609                                                                                                                     │
│ Average rollout reward:          -10.13672297428633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.4%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:35[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25082, 25979, 25984, 25997, 26000]                                                                                                                          │
│ Average cumulative reward:       -10.520101384929609                                                                                                                     │
│ Average rollout reward:          -10.13672297428633                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:34[0m   1.25 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25631, 25641, 25642, 27000]                                                                                                                                 │
│ Average cumulative reward:       -10.460284029086713                                                                                                                     │
│ Average rollout reward:          -10.044484274664523                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:34[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25631, 25641, 25642, 27000]                                                                                                                                 │
│ Average cumulative reward:       -10.460284029086713                                                                                                                     │
│ Average rollout reward:          -10.044484274664523                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:34[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25631, 25641, 25642, 27000]                                                                                                                                 │
│ Average cumulative reward:       -10.460284029086713                                                                                                                     │
│ Average rollout reward:          -10.044484274664523                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m18.8%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:33[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 27986, 27988, 27992, 28000]                                                                                                                          │
│ Average cumulative reward:       -10.83045069028194                                                                                                                      │
│ Average rollout reward:          -10.420684709977227                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.8%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:33[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 27986, 27988, 27992, 28000]                                                                                                                          │
│ Average cumulative reward:       -10.83045069028194                                                                                                                      │
│ Average rollout reward:          -10.420684709977227                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.8%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:33[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 27986, 27988, 27992, 28000]                                                                                                                          │
│ Average cumulative reward:       -10.83045069028194                                                                                                                      │
│ Average rollout reward:          -10.420684709977227                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.5%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:32[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 881, 17437, 18010, 29000]                                                                                                                                   │
│ Average cumulative reward:       -10.589424970796259                                                                                                                     │
│ Average rollout reward:          -10.182703749044471                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.5%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:32[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 881, 17437, 18010, 29000]                                                                                                                                   │
│ Average cumulative reward:       -10.589424970796259                                                                                                                     │
│ Average rollout reward:          -10.182703749044471                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:31[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18586, 29664, 29666, 29669, 30000]                                                                                                                          │
│ Average cumulative reward:       -10.436641699874                                                                                                                        │
│ Average rollout reward:          -10.052864737444999                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:31[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18586, 29664, 29666, 29669, 30000]                                                                                                                          │
│ Average cumulative reward:       -10.436641699874                                                                                                                        │
│ Average rollout reward:          -10.052864737444999                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:31[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18586, 29664, 29666, 29669, 30000]                                                                                                                          │
│ Average cumulative reward:       -10.436641699874                                                                                                                        │
│ Average rollout reward:          -10.052864737444999                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.8%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:30[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 365, 30975, 30979, 30983, 31000]                                                                                                                            │
│ Average cumulative reward:       -10.695302236998367                                                                                                                     │
│ Average rollout reward:          -10.263105703901385                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.8%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:30[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 365, 30975, 30979, 30983, 31000]                                                                                                                            │
│ Average cumulative reward:       -10.695302236998367                                                                                                                     │
│ Average rollout reward:          -10.263105703901385                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:30[0m   1.26 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 286, 30441, 30445, 30446, 31179, 32000]                                                                                                                     │
│ Average cumulative reward:       -11.17755995751568                                                                                                                      │
│ Average rollout reward:          -10.76358194014483                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:30[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 286, 30441, 30445, 30446, 31179, 32000]                                                                                                                     │
│ Average cumulative reward:       -11.17755995751568                                                                                                                      │
│ Average rollout reward:          -10.76358194014483                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:30[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 286, 30441, 30445, 30446, 31179, 32000]                                                                                                                     │
│ Average cumulative reward:       -11.17755995751568                                                                                                                      │
│ Average rollout reward:          -10.76358194014483                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:29[0m   1.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32779, 32961, 32965, 32967, 32980, 33000]                                                                                                                   │
│ Average cumulative reward:       -11.020912000730789                                                                                                                     │
│ Average rollout reward:          -10.606705468636767                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:29[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32779, 32961, 32965, 32967, 32980, 33000]                                                                                                                   │
│ Average cumulative reward:       -11.020912000730789                                                                                                                     │
│ Average rollout reward:          -10.606705468636767                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:29[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32779, 32961, 32965, 32967, 32980, 33000]                                                                                                                   │
│ Average cumulative reward:       -11.020912000730789                                                                                                                     │
│ Average rollout reward:          -10.606705468636767                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:02:30[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 9925, 9928, 34000]                                                                                                                                    │
│ Average cumulative reward:       -10.818291273296584                                                                                                                     │
│ Average rollout reward:          -10.38380275713073                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:30[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9817, 9925, 9928, 34000]                                                                                                                                    │
│ Average cumulative reward:       -10.818291273296584                                                                                                                     │
│ Average rollout reward:          -10.38380275713073                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:29[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8622, 28755, 29288, 34533, 34895, 35000]                                                                                                                    │
│ Average cumulative reward:       -10.823982658177053                                                                                                                     │
│ Average rollout reward:          -10.384116686668836                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:29[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8622, 28755, 29288, 34533, 34895, 35000]                                                                                                                    │
│ Average cumulative reward:       -10.823982658177053                                                                                                                     │
│ Average rollout reward:          -10.384116686668836                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:29[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8622, 28755, 29288, 34533, 34895, 35000]                                                                                                                    │
│ Average cumulative reward:       -10.823982658177053                                                                                                                     │
│ Average rollout reward:          -10.384116686668836                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.2%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:28[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24541, 31878, 33252, 36000]                                                                                                                                 │
│ Average cumulative reward:       -10.488902034349193                                                                                                                     │
│ Average rollout reward:          -10.06625857859749                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.2%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:28[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24541, 31878, 33252, 36000]                                                                                                                                 │
│ Average cumulative reward:       -10.488902034349193                                                                                                                     │
│ Average rollout reward:          -10.06625857859749                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.8%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:27[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 36967, 36990, 36999, 37000]                                                                                                                          │
│ Average cumulative reward:       -10.492049226370408                                                                                                                     │
│ Average rollout reward:          -10.077488595512715                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.8%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:27[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 36967, 36990, 36999, 37000]                                                                                                                          │
│ Average cumulative reward:       -10.492049226370408                                                                                                                     │
│ Average rollout reward:          -10.077488595512715                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.8%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:27[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 36967, 36990, 36999, 37000]                                                                                                                          │
│ Average cumulative reward:       -10.492049226370408                                                                                                                     │
│ Average rollout reward:          -10.077488595512715                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.5%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:26[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18134, 35654, 35843, 38000]                                                                                                                                 │
│ Average cumulative reward:       -10.718061328147096                                                                                                                     │
│ Average rollout reward:          -10.315089906249527                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.5%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:26[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18134, 35654, 35843, 38000]                                                                                                                                 │
│ Average cumulative reward:       -10.718061328147096                                                                                                                     │
│ Average rollout reward:          -10.315089906249527                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:25[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 38847, 38903, 38906, 38911, 39000]                                                                                                                          │
│ Average cumulative reward:       -10.734881972570388                                                                                                                     │
│ Average rollout reward:          -10.325416853192477                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:25[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 38847, 38903, 38906, 38911, 39000]                                                                                                                          │
│ Average cumulative reward:       -10.734881972570388                                                                                                                     │
│ Average rollout reward:          -10.325416853192477                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:25[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 38847, 38903, 38906, 38911, 39000]                                                                                                                          │
│ Average cumulative reward:       -10.734881972570388                                                                                                                     │
│ Average rollout reward:          -10.325416853192477                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.8%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:23[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21939, 21940, 21947, 22005, 40000]                                                                                                                          │
│ Average cumulative reward:       -10.389488118102642                                                                                                                     │
│ Average rollout reward:          -9.98462352327572                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.8%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:23[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21939, 21940, 21947, 22005, 40000]                                                                                                                          │
│ Average cumulative reward:       -10.389488118102642                                                                                                                     │
│ Average rollout reward:          -9.98462352327572                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:22[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5148, 40797, 40979, 40988, 41000]                                                                                                                           │
│ Average cumulative reward:       -10.321748542212013                                                                                                                     │
│ Average rollout reward:          -9.888601099904394                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:22[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5148, 40797, 40979, 40988, 41000]                                                                                                                           │
│ Average cumulative reward:       -10.321748542212013                                                                                                                     │
│ Average rollout reward:          -9.888601099904394                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:22[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5148, 40797, 40979, 40988, 41000]                                                                                                                           │
│ Average cumulative reward:       -10.321748542212013                                                                                                                     │
│ Average rollout reward:          -9.888601099904394                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:20[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 41987, 41991, 41996, 42000]                                                                                                                          │
│ Average cumulative reward:       -10.39659901248365                                                                                                                      │
│ Average rollout reward:          -9.975005865065391                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:20[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 41987, 41991, 41996, 42000]                                                                                                                          │
│ Average cumulative reward:       -10.39659901248365                                                                                                                      │
│ Average rollout reward:          -9.975005865065391                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:20[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 41987, 41991, 41996, 42000]                                                                                                                          │
│ Average cumulative reward:       -10.39659901248365                                                                                                                      │
│ Average rollout reward:          -9.975005865065391                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m28.9%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:20[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 41852, 41856, 42402, 43000]                                                                                                                          │
│ Average cumulative reward:       -10.628965897288637                                                                                                                     │
│ Average rollout reward:          -10.215630737575738                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.9%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:20[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 41852, 41856, 42402, 43000]                                                                                                                          │
│ Average cumulative reward:       -10.628965897288637                                                                                                                     │
│ Average rollout reward:          -10.215630737575738                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:19[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43999, 44000]                                                                                                                                               │
│ Average cumulative reward:       -11.328417638669798                                                                                                                     │
│ Average rollout reward:          -10.902541594739324                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:19[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43999, 44000]                                                                                                                                               │
│ Average cumulative reward:       -11.328417638669798                                                                                                                     │
│ Average rollout reward:          -10.902541594739324                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:19[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43999, 44000]                                                                                                                                               │
│ Average cumulative reward:       -11.328417638669798                                                                                                                     │
│ Average rollout reward:          -10.902541594739324                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:18[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25082, 25190, 25192, 35324, 45000]                                                                                                                          │
│ Average cumulative reward:       -10.663471824304912                                                                                                                     │
│ Average rollout reward:          -10.225257204763814                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:18[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25082, 25190, 25192, 35324, 45000]                                                                                                                          │
│ Average cumulative reward:       -10.663471824304912                                                                                                                     │
│ Average rollout reward:          -10.225257204763814                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:18[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25082, 25190, 25192, 35324, 45000]                                                                                                                          │
│ Average cumulative reward:       -10.663471824304912                                                                                                                     │
│ Average rollout reward:          -10.225257204763814                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.9%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:16[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43239, 45384, 46000]                                                                                                                                        │
│ Average cumulative reward:       -10.506581176594894                                                                                                                     │
│ Average rollout reward:          -10.092755975729323                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.9%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:16[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43239, 45384, 46000]                                                                                                                                        │
│ Average cumulative reward:       -10.506581176594894                                                                                                                     │
│ Average rollout reward:          -10.092755975729323                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:16[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13978, 13995, 14706, 38147, 47000]                                                                                                                   │
│ Average cumulative reward:       -10.64172427477599                                                                                                                      │
│ Average rollout reward:          -10.177553070685605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:16[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13978, 13995, 14706, 38147, 47000]                                                                                                                   │
│ Average cumulative reward:       -10.64172427477599                                                                                                                      │
│ Average rollout reward:          -10.177553070685605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:16[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13978, 13995, 14706, 38147, 47000]                                                                                                                   │
│ Average cumulative reward:       -10.64172427477599                                                                                                                      │
│ Average rollout reward:          -10.177553070685605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.2%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:14[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47918, 47989, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -10.564004817738981                                                                                                                     │
│ Average rollout reward:          -10.145727230577384                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.2%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:14[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47918, 47989, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -10.564004817738981                                                                                                                     │
│ Average rollout reward:          -10.145727230577384                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:13[0m   1.28 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48725, 48907, 48909, 49000]                                                                                                                                 │
│ Average cumulative reward:       -10.777367388205604                                                                                                                     │
│ Average rollout reward:          -10.35913225987585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:13[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48725, 48907, 48909, 49000]                                                                                                                                 │
│ Average cumulative reward:       -10.777367388205604                                                                                                                     │
│ Average rollout reward:          -10.35913225987585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:13[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 48725, 48907, 48909, 49000]                                                                                                                                 │
│ Average cumulative reward:       -10.777367388205604                                                                                                                     │
│ Average rollout reward:          -10.35913225987585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:02:11[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49541, 49543, 49563, 49982, 50000]                                                                                                                          │
│ Average cumulative reward:       -10.290136776593076                                                                                                                     │
│ Average rollout reward:          -9.85683048001815                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:02:11[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49541, 49543, 49563, 49982, 50000]                                                                                                                          │
│ Average cumulative reward:       -10.290136776593076                                                                                                                     │
│ Average rollout reward:          -9.85683048001815                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:11[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49541, 49543, 49563, 49982, 50000]                                                                                                                          │
│ Average cumulative reward:       -10.290136776593076                                                                                                                     │
│ Average rollout reward:          -9.85683048001815                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:10[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8622, 8632, 8635, 51000]                                                                                                                                    │
│ Average cumulative reward:       -10.237100233756502                                                                                                                     │
│ Average rollout reward:          -9.785757677530594                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:10[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8622, 8632, 8635, 51000]                                                                                                                                    │
│ Average cumulative reward:       -10.237100233756502                                                                                                                     │
│ Average rollout reward:          -9.785757677530594                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:09[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 161, 51877, 51881, 51998, 52000]                                                                                                                            │
│ Average cumulative reward:       -11.031229019622213                                                                                                                     │
│ Average rollout reward:          -10.606321809304136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:09[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 161, 51877, 51881, 51998, 52000]                                                                                                                            │
│ Average cumulative reward:       -11.031229019622213                                                                                                                     │
│ Average rollout reward:          -10.606321809304136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:09[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 161, 51877, 51881, 51998, 52000]                                                                                                                            │
│ Average cumulative reward:       -11.031229019622213                                                                                                                     │
│ Average rollout reward:          -10.606321809304136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.6%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:02:08[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 52887, 52958, 52962, 53000]                                                                                                                                 │
│ Average cumulative reward:       -10.772130631286478                                                                                                                     │
│ Average rollout reward:          -10.35024759235163                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.6%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:08[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 52887, 52958, 52962, 53000]                                                                                                                                 │
│ Average cumulative reward:       -10.772130631286478                                                                                                                     │
│ Average rollout reward:          -10.35024759235163                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:07[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 53744, 53745, 53758, 53829, 54000]                                                                                                                          │
│ Average cumulative reward:       -10.251693016757716                                                                                                                     │
│ Average rollout reward:          -9.844997588680593                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:07[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 53744, 53745, 53758, 53829, 54000]                                                                                                                          │
│ Average cumulative reward:       -10.251693016757716                                                                                                                     │
│ Average rollout reward:          -9.844997588680593                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:07[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 53744, 53745, 53758, 53829, 54000]                                                                                                                          │
│ Average cumulative reward:       -10.251693016757716                                                                                                                     │
│ Average rollout reward:          -9.844997588680593                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.9%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:05[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54609, 54967, 54971, 54994, 55000]                                                                                                                          │
│ Average cumulative reward:       -10.591728329067218                                                                                                                     │
│ Average rollout reward:          -10.166822537558073                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.9%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:05[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54609, 54967, 54971, 54994, 55000]                                                                                                                          │
│ Average cumulative reward:       -10.591728329067218                                                                                                                     │
│ Average rollout reward:          -10.166822537558073                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.9%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:05[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 54609, 54967, 54971, 54994, 55000]                                                                                                                          │
│ Average cumulative reward:       -10.591728329067218                                                                                                                     │
│ Average rollout reward:          -10.166822537558073                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m37.6%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:02[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13596, 13726, 13730, 33958, 35276, 45974, 56000]                                                                                                            │
│ Average cumulative reward:       -10.891183338910714                                                                                                                     │
│ Average rollout reward:          -10.484696090307148                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m37.6%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:02[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13596, 13726, 13730, 33958, 35276, 45974, 56000]                                                                                                            │
│ Average cumulative reward:       -10.891183338910714                                                                                                                     │
│ Average rollout reward:          -10.484696090307148                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:00[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 56975, 56979, 57000]                                                                                                                                 │
│ Average cumulative reward:       -9.862971149032392                                                                                                                      │
│ Average rollout reward:          -9.477717505937433                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:00[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 56975, 56979, 57000]                                                                                                                                 │
│ Average cumulative reward:       -9.862971149032392                                                                                                                      │
│ Average rollout reward:          -9.477717505937433                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:00[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 56975, 56979, 57000]                                                                                                                                 │
│ Average cumulative reward:       -9.862971149032392                                                                                                                      │
│ Average rollout reward:          -9.477717505937433                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.9%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:01:59[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 56408, 57740, 57767, 57773, 58000]                                                                                                                   │
│ Average cumulative reward:       -10.68958442520887                                                                                                                      │
│ Average rollout reward:          -10.265546034413374                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m38.9%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:01:59[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 56408, 57740, 57767, 57773, 58000]                                                                                                                   │
│ Average cumulative reward:       -10.68958442520887                                                                                                                      │
│ Average rollout reward:          -10.265546034413374                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:01:58[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51198, 51200, 51206, 51229, 51472, 59000]                                                                                                                   │
│ Average cumulative reward:       -10.977560040469173                                                                                                                     │
│ Average rollout reward:          -10.528211172841823                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:01:58[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51198, 51200, 51206, 51229, 51472, 59000]                                                                                                                   │
│ Average cumulative reward:       -10.977560040469173                                                                                                                     │
│ Average rollout reward:          -10.528211172841823                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:01:58[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51198, 51200, 51206, 51229, 51472, 59000]                                                                                                                   │
│ Average cumulative reward:       -10.977560040469173                                                                                                                     │
│ Average rollout reward:          -10.528211172841823                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.3%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:01:57[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 59976, 59999, 60000]                                                                                                                                        │
│ Average cumulative reward:       -10.513396641485215                                                                                                                     │
│ Average rollout reward:          -10.113733067397485                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.3%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:01:57[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 59976, 59999, 60000]                                                                                                                                        │
│ Average cumulative reward:       -10.513396641485215                                                                                                                     │
│ Average rollout reward:          -10.113733067397485                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:01:55[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 60900, 60910, 60913, 61000]                                                                                                                                 │
│ Average cumulative reward:       -10.262614974049475                                                                                                                     │
│ Average rollout reward:          -9.847773827947115                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:01:55[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 60900, 60910, 60913, 61000]                                                                                                                                 │
│ Average cumulative reward:       -10.262614974049475                                                                                                                     │
│ Average rollout reward:          -9.847773827947115                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:01:55[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 60900, 60910, 60913, 61000]                                                                                                                                 │
│ Average cumulative reward:       -10.262614974049475                                                                                                                     │
│ Average rollout reward:          -9.847773827947115                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:01:54[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 61832, 61848, 61854, 61938, 62000]                                                                                                                          │
│ Average cumulative reward:       -10.872255437170676                                                                                                                     │
│ Average rollout reward:          -10.424756694661637                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:01:54[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 61832, 61848, 61854, 61938, 62000]                                                                                                                          │
│ Average cumulative reward:       -10.872255437170676                                                                                                                     │
│ Average rollout reward:          -10.424756694661637                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:01:54[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 61832, 61848, 61854, 61938, 62000]                                                                                                                          │
│ Average cumulative reward:       -10.872255437170676                                                                                                                     │
│ Average rollout reward:          -10.424756694661637                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m42.3%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:01:53[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 61832, 62939, 62941, 62983, 62990, 63000]                                                                                                                   │
│ Average cumulative reward:       -10.448223529353488                                                                                                                     │
│ Average rollout reward:          -10.036831369095244                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m42.3%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:01:53[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 61832, 62939, 62941, 62983, 62990, 63000]                                                                                                                   │
│ Average cumulative reward:       -10.448223529353488                                                                                                                     │
│ Average rollout reward:          -10.036831369095244                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:01:52[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22960, 22964, 64000]                                                                                                                                 │
│ Average cumulative reward:       -10.905943096704638                                                                                                                     │
│ Average rollout reward:          -10.43882474611209                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:01:52[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22960, 22964, 64000]                                                                                                                                 │
│ Average cumulative reward:       -10.905943096704638                                                                                                                     │
│ Average rollout reward:          -10.43882474611209                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:01:52[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 22960, 22964, 64000]                                                                                                                                 │
│ Average cumulative reward:       -10.905943096704638                                                                                                                     │
│ Average rollout reward:          -10.43882474611209                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:01:50[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64681, 64998, 65000]                                                                                                                                        │
│ Average cumulative reward:       -10.38650807219722                                                                                                                      │
│ Average rollout reward:          -9.954503257159429                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:01:50[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64681, 64998, 65000]                                                                                                                                        │
│ Average cumulative reward:       -10.38650807219722                                                                                                                      │
│ Average rollout reward:          -9.954503257159429                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:01:49[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 65647, 65890, 65892, 65897, 65900, 66000]                                                                                                                   │
│ Average cumulative reward:       -10.473046643973465                                                                                                                     │
│ Average rollout reward:          -10.063052320796915                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:01:49[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 65647, 65890, 65892, 65897, 65900, 66000]                                                                                                                   │
│ Average cumulative reward:       -10.473046643973465                                                                                                                     │
│ Average rollout reward:          -10.063052320796915                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:01:49[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 65647, 65890, 65892, 65897, 65900, 66000]                                                                                                                   │
│ Average cumulative reward:       -10.473046643973465                                                                                                                     │
│ Average rollout reward:          -10.063052320796915                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m45.0%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:01:48[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64681, 64892, 65380, 65510, 65520, 67000]                                                                                                                   │
│ Average cumulative reward:       -11.389265658676315                                                                                                                     │
│ Average rollout reward:          -10.939384828700957                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m45.0%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:01:48[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64681, 64892, 65380, 65510, 65520, 67000]                                                                                                                   │
│ Average cumulative reward:       -11.389265658676315                                                                                                                     │
│ Average rollout reward:          -10.939384828700957                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m45.0%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:01:48[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 64681, 64892, 65380, 65510, 65520, 67000]                                                                                                                   │
│ Average cumulative reward:       -11.389265658676315                                                                                                                     │
│ Average rollout reward:          -10.939384828700957                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:47[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13982, 13985, 29566, 68000]                                                                                                                          │
│ Average cumulative reward:       -10.67750395697726                                                                                                                      │
│ Average rollout reward:          -10.203479660260147                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:47[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13976, 13982, 13985, 29566, 68000]                                                                                                                          │
│ Average cumulative reward:       -10.67750395697726                                                                                                                      │
│ Average rollout reward:          -10.203479660260147                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:01:45[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68599, 68609, 68621, 68641, 69000]                                                                                                                          │
│ Average cumulative reward:       -10.40251330073695                                                                                                                      │
│ Average rollout reward:          -9.980554407975555                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:01:45[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68599, 68609, 68621, 68641, 69000]                                                                                                                          │
│ Average cumulative reward:       -10.40251330073695                                                                                                                      │
│ Average rollout reward:          -9.980554407975555                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:01:45[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68599, 68609, 68621, 68641, 69000]                                                                                                                          │
│ Average cumulative reward:       -10.40251330073695                                                                                                                      │
│ Average rollout reward:          -9.980554407975555                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:01:44[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 57099, 57103, 69964, 70000]                                                                                                                          │
│ Average cumulative reward:       -10.682452246616474                                                                                                                     │
│ Average rollout reward:          -10.208579016867434                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:01:44[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 57099, 57103, 69964, 70000]                                                                                                                          │
│ Average cumulative reward:       -10.682452246616474                                                                                                                     │
│ Average rollout reward:          -10.208579016867434                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:01:44[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56365, 57099, 57103, 69964, 70000]                                                                                                                          │
│ Average cumulative reward:       -10.682452246616474                                                                                                                     │
│ Average rollout reward:          -10.208579016867434                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m47.7%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:01:43[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 42023, 42025, 42475, 60426, 71000]                                                                                                                   │
│ Average cumulative reward:       -11.03513909120628                                                                                                                      │
│ Average rollout reward:          -10.587879488077274                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m47.7%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:01:43[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 41744, 42023, 42025, 42475, 60426, 71000]                                                                                                                   │
│ Average cumulative reward:       -11.03513909120628                                                                                                                      │
│ Average rollout reward:          -10.587879488077274                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:01:42[0m   1.29 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6751, 6794, 6798, 69537, 69892, 72000]                                                                                                                      │
│ Average cumulative reward:       -10.703385488387603                                                                                                                     │
│ Average rollout reward:          -10.218386963262683                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:01:42[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6751, 6794, 6798, 69537, 69892, 72000]                                                                                                                      │
│ Average cumulative reward:       -10.703385488387603                                                                                                                     │
│ Average rollout reward:          -10.218386963262683                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:01:42[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6751, 6794, 6798, 69537, 69892, 72000]                                                                                                                      │
│ Average cumulative reward:       -10.703385488387603                                                                                                                     │
│ Average rollout reward:          -10.218386963262683                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:01:40[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11106, 11109, 71767, 73000]                                                                                                                                 │
│ Average cumulative reward:       -10.696744927615415                                                                                                                     │
│ Average rollout reward:          -10.253582572592936                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:01:40[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11106, 11109, 71767, 73000]                                                                                                                                 │
│ Average cumulative reward:       -10.696744927615415                                                                                                                     │
│ Average rollout reward:          -10.253582572592936                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:01:40[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 11106, 11109, 71767, 73000]                                                                                                                                 │
│ Average cumulative reward:       -10.696744927615415                                                                                                                     │
│ Average rollout reward:          -10.253582572592936                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.7%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:39[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73693, 73694, 73991, 73996, 74000]                                                                                                                          │
│ Average cumulative reward:       -10.382794442861247                                                                                                                     │
│ Average rollout reward:          -9.945346741943663                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.7%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:39[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73693, 73694, 73991, 73996, 74000]                                                                                                                          │
│ Average cumulative reward:       -10.382794442861247                                                                                                                     │
│ Average rollout reward:          -9.945346741943663                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.3%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:38[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 74738, 74981, 74984, 75000]                                                                                                                                 │
│ Average cumulative reward:       -10.727986781567614                                                                                                                     │
│ Average rollout reward:          -10.294342167735785                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.3%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:38[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 74738, 74981, 74984, 75000]                                                                                                                                 │
│ Average cumulative reward:       -10.727986781567614                                                                                                                     │
│ Average rollout reward:          -10.294342167735785                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.3%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:38[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 74738, 74981, 74984, 75000]                                                                                                                                 │
│ Average cumulative reward:       -10.727986781567614                                                                                                                     │
│ Average rollout reward:          -10.294342167735785                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:37[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 75792, 75974, 75978, 75979, 76000]                                                                                                                          │
│ Average cumulative reward:       -10.278480566264745                                                                                                                     │
│ Average rollout reward:          -9.832181840398292                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:37[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 75792, 75974, 75978, 75979, 76000]                                                                                                                          │
│ Average cumulative reward:       -10.278480566264745                                                                                                                     │
│ Average rollout reward:          -9.832181840398292                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:37[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 75792, 75974, 75978, 75979, 76000]                                                                                                                          │
│ Average cumulative reward:       -10.278480566264745                                                                                                                     │
│ Average rollout reward:          -9.832181840398292                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.7%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:35[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 76855, 76857, 76864, 76895, 77000]                                                                                                                          │
│ Average cumulative reward:       -10.536944088395218                                                                                                                     │
│ Average rollout reward:          -10.125064724341911                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.7%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:35[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 76855, 76857, 76864, 76895, 77000]                                                                                                                          │
│ Average cumulative reward:       -10.536944088395218                                                                                                                     │
│ Average rollout reward:          -10.125064724341911                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:34[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 77927, 77950, 77954, 77962, 78000]                                                                                                                          │
│ Average cumulative reward:       -10.595956849893964                                                                                                                     │
│ Average rollout reward:          -10.143476687453912                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:34[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 77927, 77950, 77954, 77962, 78000]                                                                                                                          │
│ Average cumulative reward:       -10.595956849893964                                                                                                                     │
│ Average rollout reward:          -10.143476687453912                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:34[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 77927, 77950, 77954, 77962, 78000]                                                                                                                          │
│ Average cumulative reward:       -10.595956849893964                                                                                                                     │
│ Average rollout reward:          -10.143476687453912                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.0%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:33[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 79000 ===                                                                                                                                                  │
│ 79001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 286, 30362, 67569, 72178, 72358, 79000]                                                                                                                     │
│ Average cumulative reward:       -11.191116830608145                                                                                                                     │
│ Average rollout reward:          -10.738774417870713                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.0%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:33[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 79000 ===                                                                                                                                                  │
│ 79001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 286, 30362, 67569, 72178, 72358, 79000]                                                                                                                     │
│ Average cumulative reward:       -11.191116830608145                                                                                                                     │
│ Average rollout reward:          -10.738774417870713                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K80/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:32[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 37302, 37305, 37308, 37316, 80000]                                                                                                                   │
│ Average cumulative reward:       -10.776130589013016                                                                                                                     │
│ Average rollout reward:          -10.363228806279155                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K80/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:32[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 37302, 37305, 37308, 37316, 80000]                                                                                                                   │
│ Average cumulative reward:       -10.776130589013016                                                                                                                     │
│ Average rollout reward:          -10.363228806279155                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K80/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:32[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27897, 37302, 37305, 37308, 37316, 80000]                                                                                                                   │
│ Average cumulative reward:       -10.776130589013016                                                                                                                     │
│ Average rollout reward:          -10.363228806279155                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K81/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:30[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4940, 46746, 52026, 80964, 81000]                                                                                                                           │
│ Average cumulative reward:       -10.6419247760659                                                                                                                       │
│ Average rollout reward:          -10.172909567853408                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:30[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4940, 46746, 52026, 80964, 81000]                                                                                                                           │
│ Average cumulative reward:       -10.6419247760659                                                                                                                       │
│ Average rollout reward:          -10.172909567853408                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K81/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:30[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4940, 46746, 52026, 80964, 81000]                                                                                                                           │
│ Average cumulative reward:       -10.6419247760659                                                                                                                       │
│ Average rollout reward:          -10.172909567853408                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:29[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 52887, 79914, 80643, 82000]                                                                                                                                 │
│ Average cumulative reward:       -10.252653307555246                                                                                                                     │
│ Average rollout reward:          -9.829287847768729                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:29[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 52887, 79914, 80643, 82000]                                                                                                                                 │
│ Average cumulative reward:       -10.252653307555246                                                                                                                     │
│ Average rollout reward:          -9.829287847768729                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:29[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 52887, 79914, 80643, 82000]                                                                                                                                 │
│ Average cumulative reward:       -10.252653307555246                                                                                                                     │
│ Average rollout reward:          -9.829287847768729                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K83/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:28[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 83000 ===                                                                                                                                                  │
│ 83001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49541, 82982, 83000]                                                                                                                                        │
│ Average cumulative reward:       -10.76150061422097                                                                                                                      │
│ Average rollout reward:          -10.289401039318873                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K83/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:28[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 83000 ===                                                                                                                                                  │
│ 83001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 49541, 82982, 83000]                                                                                                                                        │
│ Average cumulative reward:       -10.76150061422097                                                                                                                      │
│ Average rollout reward:          -10.289401039318873                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:27[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 7519, 82970, 83072, 83881, 83904, 84000]                                                                                                                    │
│ Average cumulative reward:       -10.624590811849279                                                                                                                     │
│ Average rollout reward:          -10.198126187783911                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:27[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 7519, 82970, 83072, 83881, 83904, 84000]                                                                                                                    │
│ Average cumulative reward:       -10.624590811849279                                                                                                                     │
│ Average rollout reward:          -10.198126187783911                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:27[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 7519, 82970, 83072, 83881, 83904, 84000]                                                                                                                    │
│ Average cumulative reward:       -10.624590811849279                                                                                                                     │
│ Average rollout reward:          -10.198126187783911                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K85/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:26[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 85000 ===                                                                                                                                                  │
│ 85001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 84546, 84995, 84997, 85000]                                                                                                                                 │
│ Average cumulative reward:       -10.469288065668385                                                                                                                     │
│ Average rollout reward:          -10.03299796498982                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K85/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:26[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 85000 ===                                                                                                                                                  │
│ 85001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 84546, 84995, 84997, 85000]                                                                                                                                 │
│ Average cumulative reward:       -10.469288065668385                                                                                                                     │
│ Average rollout reward:          -10.03299796498982                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K86/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:24[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 85681, 85892, 85894, 85992, 86000]                                                                                                                          │
│ Average cumulative reward:       -10.388529231342988                                                                                                                     │
│ Average rollout reward:          -9.939949982214818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K86/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:24[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 85681, 85892, 85894, 85992, 86000]                                                                                                                          │
│ Average cumulative reward:       -10.388529231342988                                                                                                                     │
│ Average rollout reward:          -9.939949982214818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K86/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:24[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 85681, 85892, 85894, 85992, 86000]                                                                                                                          │
│ Average cumulative reward:       -10.388529231342988                                                                                                                     │
│ Average rollout reward:          -9.939949982214818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K87/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.4%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:23[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 87000 ===                                                                                                                                                  │
│ 87001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 86824, 86825, 86836, 86990, 87000]                                                                                                                          │
│ Average cumulative reward:       -10.21595592184198                                                                                                                      │
│ Average rollout reward:          -9.784527027288874                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K87/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.4%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:23[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 87000 ===                                                                                                                                                  │
│ 87001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 86824, 86825, 86836, 86990, 87000]                                                                                                                          │
│ Average cumulative reward:       -10.21595592184198                                                                                                                      │
│ Average rollout reward:          -9.784527027288874                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:22[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88000]                                                                                                                                               │
│ Average cumulative reward:       -10.78656704830354                                                                                                                      │
│ Average rollout reward:          -10.312286661759854                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:22[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88000]                                                                                                                                               │
│ Average cumulative reward:       -10.78656704830354                                                                                                                      │
│ Average rollout reward:          -10.312286661759854                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:22[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88000]                                                                                                                                               │
│ Average cumulative reward:       -10.78656704830354                                                                                                                      │
│ Average rollout reward:          -10.312286661759854                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m59.7%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:21[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 89000 ===                                                                                                                                                  │
│ 89001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88256, 88258, 88881, 88889, 89000]                                                                                                                   │
│ Average cumulative reward:       -10.825663261200074                                                                                                                     │
│ Average rollout reward:          -10.401521884850597                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K89/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.7%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:21[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 89000 ===                                                                                                                                                  │
│ 89001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88256, 88258, 88881, 88889, 89000]                                                                                                                   │
│ Average cumulative reward:       -10.825663261200074                                                                                                                     │
│ Average rollout reward:          -10.401521884850597                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K89/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.7%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:21[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 89000 ===                                                                                                                                                  │
│ 89001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 88256, 88258, 88881, 88889, 89000]                                                                                                                   │
│ Average cumulative reward:       -10.825663261200074                                                                                                                     │
│ Average rollout reward:          -10.401521884850597                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:19[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 89809, 89813, 89868, 90000]                                                                                                                          │
│ Average cumulative reward:       -10.458035222175598                                                                                                                     │
│ Average rollout reward:          -10.043365610492595                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:19[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 89809, 89813, 89868, 90000]                                                                                                                          │
│ Average cumulative reward:       -10.458035222175598                                                                                                                     │
│ Average rollout reward:          -10.043365610492595                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:19[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 89809, 89813, 89868, 90000]                                                                                                                          │
│ Average cumulative reward:       -10.458035222175598                                                                                                                     │
│ Average rollout reward:          -10.043365610492595                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K91/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.1%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:18[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 91000 ===                                                                                                                                                  │
│ 91001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 90944, 90949, 90961, 90968, 91000]                                                                                                                   │
│ Average cumulative reward:       -10.508762035864795                                                                                                                     │
│ Average rollout reward:          -10.066049013917791                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K91/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.1%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:18[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 91000 ===                                                                                                                                                  │
│ 91001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 87977, 90944, 90949, 90961, 90968, 91000]                                                                                                                   │
│ Average cumulative reward:       -10.508762035864795                                                                                                                     │
│ Average rollout reward:          -10.066049013917791                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:16[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 91145, 91154, 91205, 91320, 92000]                                                                                                                   │
│ Average cumulative reward:       -10.458986842530743                                                                                                                     │
│ Average rollout reward:          -10.023443575885599                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:16[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 91145, 91154, 91205, 91320, 92000]                                                                                                                   │
│ Average cumulative reward:       -10.458986842530743                                                                                                                     │
│ Average rollout reward:          -10.023443575885599                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:16[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 89139, 91145, 91154, 91205, 91320, 92000]                                                                                                                   │
│ Average cumulative reward:       -10.458986842530743                                                                                                                     │
│ Average rollout reward:          -10.023443575885599                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K93/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:15[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 92680, 92703, 92707, 93000]                                                                                                                                 │
│ Average cumulative reward:       -10.164616738553487                                                                                                                     │
│ Average rollout reward:          -9.731606201093435                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K93/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:15[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 92680, 92703, 92707, 93000]                                                                                                                                 │
│ Average cumulative reward:       -10.164616738553487                                                                                                                     │
│ Average rollout reward:          -9.731606201093435                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:13[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 93879, 93935, 93938, 93942, 94000]                                                                                                                          │
│ Average cumulative reward:       -10.529217574184166                                                                                                                     │
│ Average rollout reward:          -10.102259035557962                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:13[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 93879, 93935, 93938, 93942, 94000]                                                                                                                          │
│ Average cumulative reward:       -10.529217574184166                                                                                                                     │
│ Average rollout reward:          -10.102259035557962                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:13[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 93879, 93935, 93938, 93942, 94000]                                                                                                                          │
│ Average cumulative reward:       -10.529217574184166                                                                                                                     │
│ Average rollout reward:          -10.102259035557962                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K95/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.8%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:12[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 95000 ===                                                                                                                                                  │
│ 95001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6032, 72481, 75291, 80401, 92651, 95000]                                                                                                                    │
│ Average cumulative reward:       -9.839349438201264                                                                                                                      │
│ Average rollout reward:          -9.372427041592886                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K95/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.8%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:01:12[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 95000 ===                                                                                                                                                  │
│ 95001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6032, 72481, 75291, 80401, 92651, 95000]                                                                                                                    │
│ Average cumulative reward:       -9.839349438201264                                                                                                                      │
│ Average rollout reward:          -9.372427041592886                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:01:10[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6032, 95686, 95695, 95746, 95751, 95757, 96000]                                                                                                             │
│ Average cumulative reward:       -10.448021008860689                                                                                                                     │
│ Average rollout reward:          -10.014851304698615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:10[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6032, 95686, 95695, 95746, 95751, 95757, 96000]                                                                                                             │
│ Average cumulative reward:       -10.448021008860689                                                                                                                     │
│ Average rollout reward:          -10.014851304698615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:10[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6032, 95686, 95695, 95746, 95751, 95757, 96000]                                                                                                             │
│ Average cumulative reward:       -10.448021008860689                                                                                                                     │
│ Average rollout reward:          -10.014851304698615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:09[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3135, 17444, 17725, 24497, 97000]                                                                                                                           │
│ Average cumulative reward:       -10.23522454444375                                                                                                                      │
│ Average rollout reward:          -9.782816999108357                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:09[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3135, 17444, 17725, 24497, 97000]                                                                                                                           │
│ Average cumulative reward:       -10.23522454444375                                                                                                                      │
│ Average rollout reward:          -9.782816999108357                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:09[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3135, 17444, 17725, 24497, 97000]                                                                                                                           │
│ Average cumulative reward:       -10.23522454444375                                                                                                                      │
│ Average rollout reward:          -9.782816999108357                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K98/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:08[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 98000 ===                                                                                                                                                  │
│ 98001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 23114, 23118, 23126, 98000]                                                                                                                          │
│ Average cumulative reward:       -10.723284696568026                                                                                                                     │
│ Average rollout reward:          -10.24917993403776                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K98/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:08[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 98000 ===                                                                                                                                                  │
│ 98001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 22959, 23114, 23118, 23126, 98000]                                                                                                                          │
│ Average cumulative reward:       -10.723284696568026                                                                                                                     │
│ Average rollout reward:          -10.24917993403776                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:07[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 98767, 98978, 98982, 98988, 99000]                                                                                                                          │
│ Average cumulative reward:       -10.466862701107557                                                                                                                     │
│ Average rollout reward:          -9.990144167978983                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:01:07[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 98767, 98978, 98982, 98988, 99000]                                                                                                                          │
│ Average cumulative reward:       -10.466862701107557                                                                                                                     │
│ Average rollout reward:          -9.990144167978983                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:07[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 98767, 98978, 98982, 98988, 99000]                                                                                                                          │
│ Average cumulative reward:       -10.466862701107557                                                                                                                     │
│ Average rollout reward:          -9.990144167978983                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K100/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:06[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5180, 5183, 100000]                                                                                                                                   │
│ Average cumulative reward:       -10.691365825700716                                                                                                                     │
│ Average rollout reward:          -10.209356237332099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K100/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:06[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5180, 5183, 100000]                                                                                                                                   │
│ Average cumulative reward:       -10.691365825700716                                                                                                                     │
│ Average rollout reward:          -10.209356237332099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K100/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:06[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5180, 5183, 100000]                                                                                                                                   │
│ Average cumulative reward:       -10.691365825700716                                                                                                                     │
│ Average rollout reward:          -10.209356237332099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K101/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m67.8%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:04[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 101000 ===                                                                                                                                                 │
│ 101001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 9509, 100976, 100980, 100997, 101000]                                                                                                                       │
│ Average cumulative reward:       -10.591150624984317                                                                                                                     │
│ Average rollout reward:          -10.166034007049285                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K101/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m67.8%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:04[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 101000 ===                                                                                                                                                 │
│ 101001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 9509, 100976, 100980, 100997, 101000]                                                                                                                       │
│ Average cumulative reward:       -10.591150624984317                                                                                                                     │
│ Average rollout reward:          -10.166034007049285                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:03[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 9509, 101961, 101964, 101971, 101992, 102000]                                                                                                               │
│ Average cumulative reward:       -10.475284737094007                                                                                                                     │
│ Average rollout reward:          -10.046910582160555                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:03[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 9509, 101961, 101964, 101971, 101992, 102000]                                                                                                               │
│ Average cumulative reward:       -10.475284737094007                                                                                                                     │
│ Average rollout reward:          -10.046910582160555                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:03[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 9509, 101961, 101964, 101971, 101992, 102000]                                                                                                               │
│ Average cumulative reward:       -10.475284737094007                                                                                                                     │
│ Average rollout reward:          -10.046910582160555                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K103/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.1%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:01[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 103000 ===                                                                                                                                                 │
│ 103001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 102531, 102537, 102542, 103000]                                                                                                                             │
│ Average cumulative reward:       -10.388821640151473                                                                                                                     │
│ Average rollout reward:          -9.955944998757287                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K103/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.1%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:01[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 103000 ===                                                                                                                                                 │
│ 103001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 102531, 102537, 102542, 103000]                                                                                                                             │
│ Average cumulative reward:       -10.388821640151473                                                                                                                     │
│ Average rollout reward:          -9.955944998757287                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K104/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:00[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 103805, 103828, 103831, 103967, 104000]                                                                                                                     │
│ Average cumulative reward:       -10.462341158495258                                                                                                                     │
│ Average rollout reward:          -10.022784778754605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯37m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:00[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 103805, 103828, 103831, 103967, 104000]                                                                                                                     │
│ Average cumulative reward:       -10.462341158495258                                                                                                                     │
│ Average rollout reward:          -10.022784778754605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K104/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:00[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 103805, 103828, 103831, 103967, 104000]                                                                                                                     │
│ Average cumulative reward:       -10.462341158495258                                                                                                                     │
│ Average rollout reward:          -10.022784778754605                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K105/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.5%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:00:59[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 105000 ===                                                                                                                                                 │
│ 105001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 7, 80372, 99956, 104997, 105000]                                                                                                                            │
│ Average cumulative reward:       -10.470307915522119                                                                                                                     │
│ Average rollout reward:          -10.006139896020601                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K105/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.5%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:00:59[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 105000 ===                                                                                                                                                 │
│ 105001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 7, 80372, 99956, 104997, 105000]                                                                                                                            │
│ Average cumulative reward:       -10.470307915522119                                                                                                                     │
│ Average rollout reward:          -10.006139896020601                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K105/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.5%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:00:59[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 105000 ===                                                                                                                                                 │
│ 105001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 7, 80372, 99956, 104997, 105000]                                                                                                                            │
│ Average cumulative reward:       -10.470307915522119                                                                                                                     │
│ Average rollout reward:          -10.006139896020601                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K106/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m71.1%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:00:57[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 106000 ===                                                                                                                                                 │
│ 106001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 35659, 35850, 40643, 106000]                                                                                                                          │
│ Average cumulative reward:       -10.476520352120364                                                                                                                     │
│ Average rollout reward:          -10.003302566347868                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K106/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m71.1%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:00:57[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 106000 ===                                                                                                                                                 │
│ 106001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 35659, 35850, 40643, 106000]                                                                                                                          │
│ Average cumulative reward:       -10.476520352120364                                                                                                                     │
│ Average rollout reward:          -10.003302566347868                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K107/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:00:56[0m   1.30 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 106380, 106990, 106992, 107000]                                                                                                                             │
│ Average cumulative reward:       -10.44651367097178                                                                                                                      │
│ Average rollout reward:          -10.008017051906492                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K107/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:00:56[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 106380, 106990, 106992, 107000]                                                                                                                             │
│ Average cumulative reward:       -10.44651367097178                                                                                                                      │
│ Average rollout reward:          -10.008017051906492                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K107/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:00:56[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 106380, 106990, 106992, 107000]                                                                                                                             │
│ Average cumulative reward:       -10.44651367097178                                                                                                                      │
│ Average rollout reward:          -10.008017051906492                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:00:55[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 107682, 107999, 108000]                                                                                                                                     │
│ Average cumulative reward:       -10.420082486005185                                                                                                                     │
│ Average rollout reward:          -9.98220298869953                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:00:55[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 107682, 107999, 108000]                                                                                                                                     │
│ Average cumulative reward:       -10.420082486005185                                                                                                                     │
│ Average rollout reward:          -9.98220298869953                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:00:55[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 107682, 107999, 108000]                                                                                                                                     │
│ Average cumulative reward:       -10.420082486005185                                                                                                                     │
│ Average rollout reward:          -9.98220298869953                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K109/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.2%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:00:54[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 109000 ===                                                                                                                                                 │
│ 109001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 11106, 73245, 73403, 73409, 109000]                                                                                                                         │
│ Average cumulative reward:       -10.55910931622705                                                                                                                      │
│ Average rollout reward:          -10.06978371441687                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K109/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.2%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:00:54[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 109000 ===                                                                                                                                                 │
│ 109001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 11106, 73245, 73403, 73409, 109000]                                                                                                                         │
│ Average cumulative reward:       -10.55910931622705                                                                                                                      │
│ Average rollout reward:          -10.06978371441687                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:00:52[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 89139, 109589, 109777, 110000]                                                                                                                              │
│ Average cumulative reward:       -10.479232071237332                                                                                                                     │
│ Average rollout reward:          -10.022045576359922                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:00:52[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 89139, 109589, 109777, 110000]                                                                                                                              │
│ Average cumulative reward:       -10.479232071237332                                                                                                                     │
│ Average rollout reward:          -10.022045576359922                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:00:52[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 89139, 109589, 109777, 110000]                                                                                                                              │
│ Average cumulative reward:       -10.479232071237332                                                                                                                     │
│ Average rollout reward:          -10.022045576359922                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K111/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.5%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:00:50[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 111000 ===                                                                                                                                                 │
│ 111001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 110314, 110763, 110969, 110993, 111000]                                                                                                                     │
│ Average cumulative reward:       -10.28858446590272                                                                                                                      │
│ Average rollout reward:          -9.886038302269121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K111/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.5%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:00:50[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 111000 ===                                                                                                                                                 │
│ 111001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 110314, 110763, 110969, 110993, 111000]                                                                                                                     │
│ Average cumulative reward:       -10.28858446590272                                                                                                                      │
│ Average rollout reward:          -9.886038302269121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━[0m [35m75.2%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:00:49[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 112000 ===                                                                                                                                                 │
│ 112001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 111645, 111647, 111660, 112000]                                                                                                                             │
│ Average cumulative reward:       -10.73511826607876                                                                                                                      │
│ Average rollout reward:          -10.332851455017044                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K112/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.2%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:00:49[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 112000 ===                                                                                                                                                 │
│ 112001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 111645, 111647, 111660, 112000]                                                                                                                             │
│ Average cumulative reward:       -10.73511826607876                                                                                                                      │
│ Average rollout reward:          -10.332851455017044                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K112/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.2%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:00:49[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 112000 ===                                                                                                                                                 │
│ 112001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 111645, 111647, 111660, 112000]                                                                                                                             │
│ Average cumulative reward:       -10.73511826607876                                                                                                                      │
│ Average rollout reward:          -10.332851455017044                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:00:48[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 112985, 112987, 113000]                                                                                                                                     │
│ Average cumulative reward:       -10.3674791111249                                                                                                                       │
│ Average rollout reward:          -9.907769747620351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:00:48[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 112985, 112987, 113000]                                                                                                                                     │
│ Average cumulative reward:       -10.3674791111249                                                                                                                       │
│ Average rollout reward:          -9.907769747620351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:00:48[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 112985, 112987, 113000]                                                                                                                                     │
│ Average cumulative reward:       -10.3674791111249                                                                                                                       │
│ Average rollout reward:          -9.907769747620351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K114/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:47[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 4248, 113965, 114000]                                                                                                                                 │
│ Average cumulative reward:       -10.828077933419324                                                                                                                     │
│ Average rollout reward:          -10.405599153667186                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K114/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:47[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 4248, 113965, 114000]                                                                                                                                 │
│ Average cumulative reward:       -10.828077933419324                                                                                                                     │
│ Average rollout reward:          -10.405599153667186                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K115/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:45[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 115000 ===                                                                                                                                                 │
│ 115001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 4230, 114235, 114252, 115000]                                                                                                                         │
│ Average cumulative reward:       -10.222422120983214                                                                                                                     │
│ Average rollout reward:          -9.827494978568753                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K115/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:45[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 115000 ===                                                                                                                                                 │
│ 115001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 4230, 114235, 114252, 115000]                                                                                                                         │
│ Average cumulative reward:       -10.222422120983214                                                                                                                     │
│ Average rollout reward:          -9.827494978568753                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K115/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:45[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 115000 ===                                                                                                                                                 │
│ 115001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 4230, 114235, 114252, 115000]                                                                                                                         │
│ Average cumulative reward:       -10.222422120983214                                                                                                                     │
│ Average rollout reward:          -9.827494978568753                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K116/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m77.9%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:44[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 116000 ===                                                                                                                                                 │
│ 116001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 115694, 115973, 115977, 115999, 116000]                                                                                                                     │
│ Average cumulative reward:       -10.35586867755562                                                                                                                      │
│ Average rollout reward:          -9.91805115493423                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K116/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m77.9%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:44[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 116000 ===                                                                                                                                                 │
│ 116001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 115694, 115973, 115977, 115999, 116000]                                                                                                                     │
│ Average cumulative reward:       -10.35586867755562                                                                                                                      │
│ Average rollout reward:          -9.91805115493423                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:43[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 116499, 116501, 116532, 116543, 116835, 117000]                                                                                                       │
│ Average cumulative reward:       -10.638860693864842                                                                                                                     │
│ Average rollout reward:          -10.172596091566234                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:00:43[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 116499, 116501, 116532, 116543, 116835, 117000]                                                                                                       │
│ Average cumulative reward:       -10.638860693864842                                                                                                                     │
│ Average rollout reward:          -10.172596091566234                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:00:43[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 4159, 116499, 116501, 116532, 116543, 116835, 117000]                                                                                                       │
│ Average cumulative reward:       -10.638860693864842                                                                                                                     │
│ Average rollout reward:          -10.172596091566234                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:00:42[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5237, 40803, 40805, 118000]                                                                                                                           │
│ Average cumulative reward:       -10.888491504092938                                                                                                                     │
│ Average rollout reward:          -10.427774436262949                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:00:42[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5237, 40803, 40805, 118000]                                                                                                                           │
│ Average cumulative reward:       -10.888491504092938                                                                                                                     │
│ Average rollout reward:          -10.427774436262949                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:00:42[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 5148, 5237, 40803, 40805, 118000]                                                                                                                           │
│ Average cumulative reward:       -10.888491504092938                                                                                                                     │
│ Average rollout reward:          -10.427774436262949                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K119/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:00:40[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 118442, 118443, 118454, 118679, 118680, 119000]                                                                                                             │
│ Average cumulative reward:       -10.730533169519767                                                                                                                     │
│ Average rollout reward:          -10.296875022559568                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K119/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:00:40[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 118442, 118443, 118454, 118679, 118680, 119000]                                                                                                             │
│ Average cumulative reward:       -10.730533169519767                                                                                                                     │
│ Average rollout reward:          -10.296875022559568                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:00:40[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 118442, 118443, 118454, 118679, 118680, 119000]                                                                                                             │
│ Average cumulative reward:       -10.730533169519767                                                                                                                     │
│ Average rollout reward:          -10.296875022559568                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K120/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m80.5%[0m Elapsed: [33m0:02:37[0m Remaining: [36m0:00:39[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 120000 ===                                                                                                                                                 │
│ 120001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 119985, 119988, 119991, 119997, 120000]                                                                                                             │
│ Average cumulative reward:       -10.344916667692582                                                                                                                     │
│ Average rollout reward:          -9.890855740779726                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K120/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m80.5%[0m Elapsed: [33m0:02:37[0m Remaining: [36m0:00:39[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 120000 ===                                                                                                                                                 │
│ 120001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 119985, 119988, 119991, 119997, 120000]                                                                                                             │
│ Average cumulative reward:       -10.344916667692582                                                                                                                     │
│ Average rollout reward:          -9.890855740779726                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K121/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:00:38[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 120854, 120858, 120890, 121000]                                                                                                                     │
│ Average cumulative reward:       -10.287962352172437                                                                                                                     │
│ Average rollout reward:          -9.865083154754233                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K121/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:00:38[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 120854, 120858, 120890, 121000]                                                                                                                     │
│ Average cumulative reward:       -10.287962352172437                                                                                                                     │
│ Average rollout reward:          -9.865083154754233                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K121/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:00:38[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 120854, 120858, 120890, 121000]                                                                                                                     │
│ Average cumulative reward:       -10.287962352172437                                                                                                                     │
│ Average rollout reward:          -9.865083154754233                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K122/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m81.9%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:00:36[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 122000 ===                                                                                                                                                 │
│ 122001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 121228, 121962, 121966, 121968, 121970, 122000]                                                                                                             │
│ Average cumulative reward:       -10.373391653129913                                                                                                                     │
│ Average rollout reward:          -9.93952630417996                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K122/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m81.9%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:36[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 122000 ===                                                                                                                                                 │
│ 122001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 121228, 121962, 121966, 121968, 121970, 122000]                                                                                                             │
│ Average cumulative reward:       -10.373391653129913                                                                                                                     │
│ Average rollout reward:          -9.93952630417996                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:35[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 122636, 122994, 122997, 123000]                                                                                                                             │
│ Average cumulative reward:       -10.61374252271736                                                                                                                      │
│ Average rollout reward:          -10.148446485608035                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:35[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 122636, 122994, 122997, 123000]                                                                                                                             │
│ Average cumulative reward:       -10.61374252271736                                                                                                                      │
│ Average rollout reward:          -10.148446485608035                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:35[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 122636, 122994, 122997, 123000]                                                                                                                             │
│ Average cumulative reward:       -10.61374252271736                                                                                                                      │
│ Average rollout reward:          -10.148446485608035                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:34[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 123927, 123931, 123942, 123950, 123963, 123998, 124000]                                                                                             │
│ Average cumulative reward:       -10.715329315502629                                                                                                                     │
│ Average rollout reward:          -10.222207543593136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:34[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 123927, 123931, 123942, 123950, 123963, 123998, 124000]                                                                                             │
│ Average cumulative reward:       -10.715329315502629                                                                                                                     │
│ Average rollout reward:          -10.222207543593136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:34[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 119830, 123927, 123931, 123942, 123950, 123963, 123998, 124000]                                                                                             │
│ Average cumulative reward:       -10.715329315502629                                                                                                                     │
│ Average rollout reward:          -10.222207543593136                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K125/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m83.9%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:32[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 125000 ===                                                                                                                                                 │
│ 125001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 124054, 125000]                                                                                                                                             │
│ Average cumulative reward:       -10.545607509766716                                                                                                                     │
│ Average rollout reward:          -10.130642389628393                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K125/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m83.9%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:32[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 125000 ===                                                                                                                                                 │
│ 125001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 124054, 125000]                                                                                                                                             │
│ Average cumulative reward:       -10.545607509766716                                                                                                                     │
│ Average rollout reward:          -10.130642389628393                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K126/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.6%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:31[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 126000 ===                                                                                                                                                 │
│ 126001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 125481, 125980, 125982, 126000]                                                                                                                             │
│ Average cumulative reward:       -10.740720077357787                                                                                                                     │
│ Average rollout reward:          -10.311111920652877                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K126/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.6%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:31[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 126000 ===                                                                                                                                                 │
│ 126001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 125481, 125980, 125982, 126000]                                                                                                                             │
│ Average cumulative reward:       -10.740720077357787                                                                                                                     │
│ Average rollout reward:          -10.311111920652877                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K126/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.6%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:31[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 126000 ===                                                                                                                                                 │
│ 126001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 125481, 125980, 125982, 126000]                                                                                                                             │
│ Average cumulative reward:       -10.740720077357787                                                                                                                     │
│ Average rollout reward:          -10.311111920652877                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K127/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:29[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 126918, 126919, 126947, 127000]                                                                                                                             │
│ Average cumulative reward:       -10.539778243287353                                                                                                                     │
│ Average rollout reward:          -10.113274076555873                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯37m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:29[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 126918, 126919, 126947, 127000]                                                                                                                             │
│ Average cumulative reward:       -10.539778243287353                                                                                                                     │
│ Average rollout reward:          -10.113274076555873                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K127/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:29[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 126918, 126919, 126947, 127000]                                                                                                                             │
│ Average cumulative reward:       -10.539778243287353                                                                                                                     │
│ Average rollout reward:          -10.113274076555873                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K128/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.9%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:28[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 128000 ===                                                                                                                                                 │
│ 128001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 124054, 124412, 124416, 127829, 128000]                                                                                                                     │
│ Average cumulative reward:       -10.99491173682685                                                                                                                      │
│ Average rollout reward:          -10.524278180398944                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K128/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.9%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:28[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 128000 ===                                                                                                                                                 │
│ 128001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 124054, 124412, 124416, 127829, 128000]                                                                                                                     │
│ Average cumulative reward:       -10.99491173682685                                                                                                                      │
│ Average rollout reward:          -10.524278180398944                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:27[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 121228, 121244, 121246, 122305, 122473, 122491, 129000]                                                                                                     │
│ Average cumulative reward:       -10.80109062107434                                                                                                                      │
│ Average rollout reward:          -10.343486825395921                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:27[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 121228, 121244, 121246, 122305, 122473, 122491, 129000]                                                                                                     │
│ Average cumulative reward:       -10.80109062107434                                                                                                                      │
│ Average rollout reward:          -10.343486825395921                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:27[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 121228, 121244, 121246, 122305, 122473, 122491, 129000]                                                                                                     │
│ Average cumulative reward:       -10.80109062107434                                                                                                                      │
│ Average rollout reward:          -10.343486825395921                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:26[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 189, 129990, 129994, 130000]                                                                                                                                │
│ Average cumulative reward:       -10.422561041542574                                                                                                                     │
│ Average rollout reward:          -9.975827371120879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:26[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 189, 129990, 129994, 130000]                                                                                                                                │
│ Average cumulative reward:       -10.422561041542574                                                                                                                     │
│ Average rollout reward:          -9.975827371120879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:26[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 189, 129990, 129994, 130000]                                                                                                                                │
│ Average cumulative reward:       -10.422561041542574                                                                                                                     │
│ Average rollout reward:          -9.975827371120879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K131/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m87.9%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:24[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 131000 ===                                                                                                                                                 │
│ 131001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 26750, 130977, 130979, 130984, 131000]                                                                                                                      │
│ Average cumulative reward:       -10.516955023311139                                                                                                                     │
│ Average rollout reward:          -10.108678450741124                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K131/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m87.9%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:24[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 131000 ===                                                                                                                                                 │
│ 131001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 26750, 130977, 130979, 130984, 131000]                                                                                                                      │
│ Average cumulative reward:       -10.516955023311139                                                                                                                     │
│ Average rollout reward:          -10.108678450741124                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:23[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 28482, 131879, 131883, 131893, 132000]                                                                                                                      │
│ Average cumulative reward:       -10.46457824711658                                                                                                                      │
│ Average rollout reward:          -10.065360368330518                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:23[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 28482, 131879, 131883, 131893, 132000]                                                                                                                      │
│ Average cumulative reward:       -10.46457824711658                                                                                                                      │
│ Average rollout reward:          -10.065360368330518                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:23[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 2, 28482, 131879, 131883, 131893, 132000]                                                                                                                      │
│ Average cumulative reward:       -10.46457824711658                                                                                                                      │
│ Average rollout reward:          -10.065360368330518                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K133/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.3%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:22[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 133000 ===                                                                                                                                                 │
│ 133001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 132977, 133000]                                                                                                                                             │
│ Average cumulative reward:       -10.58580227674672                                                                                                                      │
│ Average rollout reward:          -10.196821854794932                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K133/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.3%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:22[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 133000 ===                                                                                                                                                 │
│ 133001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 132977, 133000]                                                                                                                                             │
│ Average cumulative reward:       -10.58580227674672                                                                                                                      │
│ Average rollout reward:          -10.196821854794932                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0489525171491962                                                                                                                             │
│ Best path: [0, 2, 15974, 16082, 16086, 16113, 16154, 16640, 17156]                                                                                                       │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:20[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 133182, 133183, 133189, 133193, 134000]                                                                                                                     │
│ Average cumulative reward:       -11.548324287011356                                                                                                                     │
│ Average rollout reward:          -11.3058363016552                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:20[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 133182, 133183, 133189, 133193, 134000]                                                                                                                     │
│ Average cumulative reward:       -11.548324287011356                                                                                                                     │
│ Average rollout reward:          -11.3058363016552                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:20[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 133182, 133183, 133189, 133193, 134000]                                                                                                                     │
│ Average cumulative reward:       -11.548324287011356                                                                                                                     │
│ Average rollout reward:          -11.3058363016552                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:19[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 134998, 135000]                                                                                                                                     │
│ Average cumulative reward:       -12.05994514341613                                                                                                                      │
│ Average rollout reward:          -11.767310141538099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K135/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:19[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 134998, 135000]                                                                                                                                     │
│ Average cumulative reward:       -12.05994514341613                                                                                                                      │
│ Average rollout reward:          -11.767310141538099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K135/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:19[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 134998, 135000]                                                                                                                                     │
│ Average cumulative reward:       -12.05994514341613                                                                                                                      │
│ Average rollout reward:          -11.767310141538099                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:18[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 135035, 135038, 135057, 135261, 135294, 135987, 136000]                                                                                             │
│ Average cumulative reward:       -11.984374833890428                                                                                                                     │
│ Average rollout reward:          -11.681006186781179                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:18[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 135035, 135038, 135057, 135261, 135294, 135987, 136000]                                                                                             │
│ Average cumulative reward:       -11.984374833890428                                                                                                                     │
│ Average rollout reward:          -11.681006186781179                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
│ [-2.70147069 -2.70147069 -2.70147069 -2.70147069 -2.43237282 -2.43237282                                                                                                 │
│  -2.43237282]                                                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:18[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 134992, 135035, 135038, 135057, 135261, 135294, 135987, 136000]                                                                                             │
│ Average cumulative reward:       -11.984374833890428                                                                                                                     │
│ Average rollout reward:          -11.681006186781179                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.970568551497757                                                                                                                              │
│ Best path: [0, 4, 133524, 133664, 133678, 133724, 133725, 133770]                                                                                                        │
│ [-2.70147069 -2.70147069 -2.70147069 -2.70147069 -2.43237282 -2.43237282                                                                                                 │
│  -2.43237282]                                                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K137/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.9%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:17[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 137000 ===                                                                                                                                                 │
│ 137001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 136795, 136797, 136832, 136998, 137000]                                                                                                             │
│ Average cumulative reward:       -11.184211355049973                                                                                                                     │
│ Average rollout reward:          -10.860424402357104                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K137/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.9%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:17[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 137000 ===                                                                                                                                                 │
│ 137001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 136795, 136797, 136832, 136998, 137000]                                                                                                             │
│ Average cumulative reward:       -11.184211355049973                                                                                                                     │
│ Average rollout reward:          -10.860424402357104                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:15[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 137901, 137904, 137909, 137949, 137994, 138000]                                                                                                             │
│ Average cumulative reward:       -11.398822632433927                                                                                                                     │
│ Average rollout reward:          -11.035573720928069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:15[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 137901, 137904, 137909, 137949, 137994, 138000]                                                                                                             │
│ Average cumulative reward:       -11.398822632433927                                                                                                                     │
│ Average rollout reward:          -11.035573720928069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:15[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 137901, 137904, 137909, 137949, 137994, 138000]                                                                                                             │
│ Average cumulative reward:       -11.398822632433927                                                                                                                     │
│ Average rollout reward:          -11.035573720928069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:02[0m Remaining: [36m0:00:14[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138322, 138477, 138481, 138482, 138760, 139000]                                                                                                             │
│ Average cumulative reward:       -11.52452788132209                                                                                                                      │
│ Average rollout reward:          -11.159318183804851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:02[0m Remaining: [36m0:00:14[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138322, 138477, 138481, 138482, 138760, 139000]                                                                                                             │
│ Average cumulative reward:       -11.52452788132209                                                                                                                      │
│ Average rollout reward:          -11.159318183804851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:03[0m Remaining: [36m0:00:14[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138322, 138477, 138481, 138482, 138760, 139000]                                                                                                             │
│ Average cumulative reward:       -11.52452788132209                                                                                                                      │
│ Average rollout reward:          -11.159318183804851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:13[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 139227, 139250, 139701, 139948, 140000]                                                                                                                     │
│ Average cumulative reward:       -12.002817506416848                                                                                                                     │
│ Average rollout reward:          -11.621057021633247                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:13[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 139227, 139250, 139701, 139948, 140000]                                                                                                                     │
│ Average cumulative reward:       -12.002817506416848                                                                                                                     │
│ Average rollout reward:          -11.621057021633247                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:13[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 139227, 139250, 139701, 139948, 140000]                                                                                                                     │
│ Average cumulative reward:       -12.002817506416848                                                                                                                     │
│ Average rollout reward:          -11.621057021633247                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K141/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.6%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:11[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 141000 ===                                                                                                                                                 │
│ 141001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 140480, 140985, 140987, 141000]                                                                                                                             │
│ Average cumulative reward:       -12.870523954987894                                                                                                                     │
│ Average rollout reward:          -12.5132006140439                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K141/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.6%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:11[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 141000 ===                                                                                                                                                 │
│ 141001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 140480, 140985, 140987, 141000]                                                                                                                             │
│ Average cumulative reward:       -12.870523954987894                                                                                                                     │
│ Average rollout reward:          -12.5132006140439                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K142/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:10[0m   1.31 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 141873, 141929, 141931, 142000]                                                                                                                             │
│ Average cumulative reward:       -11.569196790149025                                                                                                                     │
│ Average rollout reward:          -11.223432981440206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:10[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 141873, 141929, 141931, 142000]                                                                                                                             │
│ Average cumulative reward:       -11.569196790149025                                                                                                                     │
│ Average rollout reward:          -11.223432981440206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K142/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:10[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 141873, 141929, 141931, 142000]                                                                                                                             │
│ Average cumulative reward:       -11.569196790149025                                                                                                                     │
│ Average rollout reward:          -11.223432981440206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:09[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142933, 142935, 142977, 142984, 143000]                                                                                                             │
│ Average cumulative reward:       -11.289641567511715                                                                                                                     │
│ Average rollout reward:          -10.936022051222237                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:09[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142933, 142935, 142977, 142984, 143000]                                                                                                             │
│ Average cumulative reward:       -11.289641567511715                                                                                                                     │
│ Average rollout reward:          -10.936022051222237                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:09[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142933, 142935, 142977, 142984, 143000]                                                                                                             │
│ Average cumulative reward:       -11.289641567511715                                                                                                                     │
│ Average rollout reward:          -10.936022051222237                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:07[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 143198, 143202, 143960, 144000]                                                                                                                     │
│ Average cumulative reward:       -11.47843393467202                                                                                                                      │
│ Average rollout reward:          -11.124064149100667                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:07[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 143198, 143202, 143960, 144000]                                                                                                                     │
│ Average cumulative reward:       -11.47843393467202                                                                                                                      │
│ Average rollout reward:          -11.124064149100667                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:07[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 143198, 143202, 143960, 144000]                                                                                                                     │
│ Average cumulative reward:       -11.47843393467202                                                                                                                      │
│ Average rollout reward:          -11.124064149100667                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:06[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 132981, 133001, 133296, 137838, 138835, 145000]                                                                                                             │
│ Average cumulative reward:       -11.810073329936808                                                                                                                     │
│ Average rollout reward:          -11.433548049183166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:06[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 132981, 133001, 133296, 137838, 138835, 145000]                                                                                                             │
│ Average cumulative reward:       -11.810073329936808                                                                                                                     │
│ Average rollout reward:          -11.433548049183166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:06[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 132981, 133001, 133296, 137838, 138835, 145000]                                                                                                             │
│ Average cumulative reward:       -11.810073329936808                                                                                                                     │
│ Average rollout reward:          -11.433548049183166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K146/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.0%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:05[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 146000 ===                                                                                                                                                 │
│ 146001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 145813, 145829, 145833, 145984, 145994, 146000]                                                                                                             │
│ Average cumulative reward:       -11.648371180491571                                                                                                                     │
│ Average rollout reward:          -11.279366558781996                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K146/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.0%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:05[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 146000 ===                                                                                                                                                 │
│ 146001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 145813, 145829, 145833, 145984, 145994, 146000]                                                                                                             │
│ Average cumulative reward:       -11.648371180491571                                                                                                                     │
│ Average rollout reward:          -11.279366558781996                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:03[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142794, 142796, 142800, 142833, 147000]                                                                                                             │
│ Average cumulative reward:       -11.288141825447267                                                                                                                     │
│ Average rollout reward:          -10.915262405917069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:03[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142794, 142796, 142800, 142833, 147000]                                                                                                             │
│ Average cumulative reward:       -11.288141825447267                                                                                                                     │
│ Average rollout reward:          -10.915262405917069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:03[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 142778, 142794, 142796, 142800, 142833, 147000]                                                                                                             │
│ Average cumulative reward:       -11.288141825447267                                                                                                                     │
│ Average rollout reward:          -10.915262405917069                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K148/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m [35m99.3%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:02[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 141522, 145725, 145806, 148000]                                                                                                                     │
│ Average cumulative reward:       -11.089316079208636                                                                                                                     │
│ Average rollout reward:          -10.722498984332328                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K148/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m [35m99.3%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:02[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 141522, 145725, 145806, 148000]                                                                                                                     │
│ Average cumulative reward:       -11.089316079208636                                                                                                                     │
│ Average rollout reward:          -10.722498984332328                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K148/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m [35m99.3%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:02[0m   1.33 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 141522, 145725, 145806, 148000]                                                                                                                     │
│ Average cumulative reward:       -11.089316079208636                                                                                                                     │
│ Average rollout reward:          -10.722498984332328                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K149/149 [38;2;114;156;31m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m100.0%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:00[0m   1.32 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 136418, 141522, 145725, 145806, 148000]                                                                                                                     │
│ Average cumulative reward:       -11.089316079208636                                                                                                                     │
│ Average rollout reward:          -10.722498984332328                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.7014706857269033                                                                                                                             │
│ Best path: [0, 4, 136418, 136474, 136476, 136499, 136505]                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
[?25hNode 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 3626 is not terminal. Continue.
Node 3636 is not terminal. Continue.
Node 3648 is not terminal. Continue.
Node 3668 is not terminal. Continue.
Node 3672 is not terminal. Continue.
Node 3847 is not terminal. Continue.
Node 3862 is not terminal. Continue.
Node 30556 is not terminal. Continue.
Node 46660 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 3135 is not terminal. Continue.
Node 3191 is not terminal. Continue.
Node 3194 is not terminal. Continue.
Node 3200 is not terminal. Continue.
Node 6394 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 4 is not terminal. Continue.
Node 135649 is not terminal. Continue.
Node 135964 is not terminal. Continue.
Node 135968 is not terminal. Continue.
Node 135980 is not terminal. Continue.
Node 137218 is not terminal. Continue.
Node 137660 is not terminal. Continue.
Node 139184 is not terminal. Continue.
Node 139980 is not terminal. Continue.
No children found. Stop.
=== RESULT ===
By Visits: estimated reward: -11.726984732351337
sqrt_nsv [4.027697  2.5867393]
sqrt_visser_coupled [4.7515054 1.2070429]
sqrt_visser_coupled [1.446072  4.3715467]
By Value: estimated reward: -5.652713790050805
sqrt_nsv [2.9635127 1.011607 ]
By Best Value: estimated reward: 0
sqrt_visser_coupled [3.6766243 0.7222425]
sqrt_nsv [4.76178   2.0180454]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
Best value of root node:
-2.7014706857269033
Best root policy:
sqrt_visser_coupled [3.6766243 0.7222425]
sqrt_nsv [4.76178   2.0180454]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
=== END ===
Finished making algorithm
