Files already downloaded and verified
Matrix distribution: CIFAR
Matrix distribution config: {'c': 0.25, 'd': 5000, 'eps': 0.001}
Initial matrix shape: torch.Size([3072, 3072])
Algorithm name: mcts
Algorithm config: {'c_ucb': 5.0, 'alpha_pw': 0.4, 'epsilon': 1e-11, 'EXPLORE_K': 5, 'early_termination_epsilon': 1e-05, 'budget': 150000, 'print_every': 1000, 'max_termination_count': 10, 'tree_initial_capacity': 10000, 'device': 'cuda', 'actions': [['sqrt_db', [[0, 0], [50, 50]]], ['sqrt_nsv', [[0, 0], [5, 5]]], ['sqrt_visser', [[0, 0], [10, 10]]], ['sqrt_visser_coupled', [[0, 0], [10, 10]]], ['sqrt_couple', None]], 'initialize_with_baselines': True}
Actions: ['sqrt_couple', 'sqrt_db', 'sqrt_nsv', 'sqrt_visser', 'sqrt_visser_coupled']
Action sqrt_couple took 1.0 times longer than sqrt_couple
Action sqrt_db took 1.8090690666417935 times longer than sqrt_couple
Action sqrt_nsv took 0.3491496612380155 times longer than sqrt_couple
Action sqrt_visser took 0.13452366472988944 times longer than sqrt_couple
Action sqrt_visser_coupled took 0.2672588939912547 times longer than sqrt_couple
Skipping sign_newton because not all actions are in the tree
Skipping sign_scaled_newton because not all actions are in the tree
Skipping sign_ns because not all actions are in the tree
Skipping sign_scaled_ns because not all actions are in the tree
Skipping sign_newton_variant because not all actions are in the tree
Skipping sign_halley because not all actions are in the tree
Skipping inv_ns because not all actions are in the tree
Skipping inv_ns_chebyshev because not all actions are in the tree
Skipping sqrt_newton because not all actions are in the tree
Skipping sqrt_newton_coupled because not all actions are in the tree
Skipping proot_newton because not all actions are in the tree
Skipping proot_visser because not all actions are in the tree
Skipping proot_iannazzo because not all actions are in the tree
[?25l/home/sykim/code/make_algorithm/losses.py:39: RuntimeWarning: overflow encountered in multiply
  loss = np.linalg.norm(x * x - y) / np.linalg.norm(y)
[2K/home/sykim/code/make_algorithm/actions.py:878: RuntimeWarning: overflow encountered in multiply
  intermediate = a0 - a1 * Y * Z
[2K/home/sykim/code/make_algorithm/actions.py:879: RuntimeWarning: overflow encountered in multiply
  Yn = 0.5 * Y * intermediate
[2K/home/sykim/code/make_algorithm/actions.py:880: RuntimeWarning: overflow encountered in multiply
  Zn = 0.5 * Z * intermediate
[2K0/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:00[0m Remaining: [36m-:--:--[0m 501470.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-16.2816216 -16.2816216]                                                                                                                                                │
│ [-4.5389456 -4.5389456]                                                                                                                                                  │
│ [-4.45705483 -4.45705483]                                                                                                                                                │
│ [-3.75875551 -3.75875551 -3.75875551]                                                                                                                                    │
│ [-3.40960585 -3.40960585 -3.40960585]                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1004438.34 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-16.2816216 -16.2816216]                                                                                                                                                │
│ [-4.5389456 -4.5389456]                                                                                                                                                  │
│ [-4.45705483 -4.45705483]                                                                                                                                                │
│ [-3.75875551 -3.75875551 -3.75875551]                                                                                                                                    │
│ [-3.40960585 -3.40960585 -3.40960585]                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.7%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 1000]                                                                                                                                                  │
│ Average cumulative reward:       -12.019686433510692                                                                                                                     │
│ Average rollout reward:          -11.807987190921358                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.7%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 1000]                                                                                                                                                  │
│ Average cumulative reward:       -12.019686433510692                                                                                                                     │
│ Average rollout reward:          -11.807987190921358                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/149 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.7%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 1000]                                                                                                                                                  │
│ Average cumulative reward:       -12.019686433510692                                                                                                                     │
│ Average rollout reward:          -11.807987190921358                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:03:18[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1811, 1868, 1872, 2000]                                                                                                                                     │
│ Average cumulative reward:       -12.352836458637295                                                                                                                     │
│ Average rollout reward:          -12.059557767201905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
│ [-3.40960585 -3.40960585 -3.40960585 -3.40960585 -3.14234695 -3.14234695                                                                                                 │
│  -3.14234695 -2.79319729]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:03:18[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1811, 1868, 1872, 2000]                                                                                                                                     │
│ Average cumulative reward:       -12.352836458637295                                                                                                                     │
│ Average rollout reward:          -12.059557767201905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
│ [-3.40960585 -3.40960585 -3.40960585 -3.40960585 -3.14234695 -3.14234695                                                                                                 │
│  -3.14234695 -2.79319729]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:03:18[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1811, 1868, 1872, 2000]                                                                                                                                     │
│ Average cumulative reward:       -12.352836458637295                                                                                                                     │
│ Average rollout reward:          -12.059557767201905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.409605845133395                                                                                                                              │
│ Best path: [0, 4, 60]                                                                                                                                                    │
│ [-3.40960585 -3.40960585 -3.40960585 -3.40960585 -3.14234695 -3.14234695                                                                                                 │
│  -3.14234695 -2.79319729]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.0%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:03:22[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 2546, 2603, 2605, 2646, 2675, 2681, 3000]                                                                                                                   │
│ Average cumulative reward:       -13.576984283962252                                                                                                                     │
│ Average rollout reward:          -13.25964549009921                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.4096058451333944                                                                                                                             │
│ Best path: [0, 4, 1309, 1310, 1315, 1329, 1364, 2150]                                                                                                                    │
│ [-3.32771508 -3.32771508 -3.32771508 -3.32771508 -3.06045618 -3.06045618]                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/149 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.0%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:03:22[0m   1.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 2546, 2603, 2605, 2646, 2675, 2681, 3000]                                                                                                                   │
│ Average cumulative reward:       -13.576984283962252                                                                                                                     │
│ Average rollout reward:          -13.25964549009921                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.4096058451333944                                                                                                                             │
│ Best path: [0, 4, 1309, 1310, 1315, 1329, 1364, 2150]                                                                                                                    │
│ [-3.32771508 -3.32771508 -3.32771508 -3.32771508 -3.06045618 -3.06045618]                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:03:20[0m   1.38 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 981, 984, 3985, 3988, 4000]                                                                                                                            │
│ Average cumulative reward:       -12.572677020517945                                                                                                                     │
│ Average rollout reward:          -12.221930257629051                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:03:20[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 981, 984, 3985, 3988, 4000]                                                                                                                            │
│ Average cumulative reward:       -12.572677020517945                                                                                                                     │
│ Average rollout reward:          -12.221930257629051                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.7%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:03:20[0m   1.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 981, 984, 3985, 3988, 4000]                                                                                                                            │
│ Average cumulative reward:       -12.572677020517945                                                                                                                     │
│ Average rollout reward:          -12.221930257629051                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.4%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:03:18[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 4036, 4038, 4502, 4604, 5000]                                                                                                                          │
│ Average cumulative reward:       -12.588402232306048                                                                                                                     │
│ Average rollout reward:          -12.248235073040343                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.4%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:03:18[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 4036, 4038, 4502, 4604, 5000]                                                                                                                          │
│ Average cumulative reward:       -12.588402232306048                                                                                                                     │
│ Average rollout reward:          -12.248235073040343                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/149 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.4%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:03:18[0m   1.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 4036, 4038, 4502, 4604, 5000]                                                                                                                          │
│ Average cumulative reward:       -12.588402232306048                                                                                                                     │
│ Average rollout reward:          -12.248235073040343                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.3277150778866336                                                                                                                             │
│ Best path: [0, 4, 975, 981, 984, 1400, 3938, 3948, 3959]                                                                                                                 │
│ [-3.06045618 -3.06045618 -3.06045618 -3.06045618 -2.79319729 -2.79319729                                                                                                 │
│  -2.79319729 -2.44404763 -2.44404763]                                                                                                                                    │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.0%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:03:17[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 4036, 4038, 4041, 5849, 6000]                                                                                                                          │
│ Average cumulative reward:       -12.074049373973436                                                                                                                     │
│ Average rollout reward:          -11.719009831863723                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.0%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:03:17[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 4036, 4038, 4041, 5849, 6000]                                                                                                                          │
│ Average cumulative reward:       -12.074049373973436                                                                                                                     │
│ Average rollout reward:          -11.719009831863723                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:03:16[0m   1.37 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1309, 1334, 1336, 7000]                                                                                                                                     │
│ Average cumulative reward:       -12.506396228164624                                                                                                                     │
│ Average rollout reward:          -12.145680543706156                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:03:16[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1309, 1334, 1336, 7000]                                                                                                                                     │
│ Average cumulative reward:       -12.506396228164624                                                                                                                     │
│ Average rollout reward:          -12.145680543706156                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/149 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m4.7%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:03:16[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 1309, 1334, 1336, 7000]                                                                                                                                     │
│ Average cumulative reward:       -12.506396228164624                                                                                                                     │
│ Average rollout reward:          -12.145680543706156                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.4%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:03:15[0m   1.38 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 5938, 5940, 8000]                                                                                                                                      │
│ Average cumulative reward:       -12.458365780408666                                                                                                                     │
│ Average rollout reward:          -12.11032399344406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.4%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:03:15[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 5938, 5940, 8000]                                                                                                                                      │
│ Average cumulative reward:       -12.458365780408666                                                                                                                     │
│ Average rollout reward:          -12.11032399344406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.4%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:03:15[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 975, 5938, 5940, 8000]                                                                                                                                      │
│ Average cumulative reward:       -12.458365780408666                                                                                                                     │
│ Average rollout reward:          -12.11032399344406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:03:15[0m   1.40 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 87, 7742, 8425, 9000]                                                                                                                                       │
│ Average cumulative reward:       -13.143971332805975                                                                                                                     │
│ Average rollout reward:          -12.811895507238711                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:03:15[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 87, 7742, 8425, 9000]                                                                                                                                       │
│ Average cumulative reward:       -13.143971332805975                                                                                                                     │
│ Average rollout reward:          -12.811895507238711                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/149 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.0%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:03:15[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 4, 87, 7742, 8425, 9000]                                                                                                                                       │
│ Average cumulative reward:       -13.143971332805975                                                                                                                     │
│ Average rollout reward:          -12.811895507238711                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m6.7%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:03:15[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 9968, 10000]                                                                                                                                                │
│ Average cumulative reward:       -13.558010294723816                                                                                                                     │
│ Average rollout reward:          -13.188762893353443                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.7%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:03:15[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 9968, 10000]                                                                                                                                                │
│ Average cumulative reward:       -13.558010294723816                                                                                                                     │
│ Average rollout reward:          -13.188762893353443                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.7%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:03:15[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 9968, 10000]                                                                                                                                                │
│ Average cumulative reward:       -13.558010294723816                                                                                                                     │
│ Average rollout reward:          -13.188762893353443                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:03:13[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 10925, 10982, 10984, 10992, 11000]                                                                                                                          │
│ Average cumulative reward:       -12.187928806849119                                                                                                                     │
│ Average rollout reward:          -11.854830206019344                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:03:13[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 10925, 10982, 10984, 10992, 11000]                                                                                                                          │
│ Average cumulative reward:       -12.187928806849119                                                                                                                     │
│ Average rollout reward:          -11.854830206019344                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/149 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.4%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:03:13[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 10925, 10982, 10984, 10992, 11000]                                                                                                                          │
│ Average cumulative reward:       -12.187928806849119                                                                                                                     │
│ Average rollout reward:          -11.854830206019344                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.1%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:03:12[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 2546, 2656, 11548, 11817, 12000]                                                                                                                            │
│ Average cumulative reward:       -12.637832421234387                                                                                                                     │
│ Average rollout reward:          -12.28580248492491                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.1%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:03:12[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 2546, 2656, 11548, 11817, 12000]                                                                                                                            │
│ Average cumulative reward:       -12.637832421234387                                                                                                                     │
│ Average rollout reward:          -12.28580248492491                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:03:11[0m   1.39 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 9903, 10357, 11812, 11910, 13000]                                                                                                                       │
│ Average cumulative reward:       -12.464540605782751                                                                                                                     │
│ Average rollout reward:          -12.100961116720944                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:03:11[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 9903, 10357, 11812, 11910, 13000]                                                                                                                       │
│ Average cumulative reward:       -12.464540605782751                                                                                                                     │
│ Average rollout reward:          -12.100961116720944                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/149 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.7%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:03:11[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 9903, 10357, 11812, 11910, 13000]                                                                                                                       │
│ Average cumulative reward:       -12.464540605782751                                                                                                                     │
│ Average rollout reward:          -12.100961116720944                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/149 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m9.4%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:03:09[0m   1.40 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 60, 62, 250, 14000]                                                                                                                                         │
│ Average cumulative reward:       -12.063869444837623                                                                                                                     │
│ Average rollout reward:          -11.706959007408981                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/149 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m9.4%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:03:09[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 60, 62, 250, 14000]                                                                                                                                         │
│ Average cumulative reward:       -12.063869444837623                                                                                                                     │
│ Average rollout reward:          -11.706959007408981                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/149 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m9.4%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:03:09[0m   1.48 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 60, 62, 250, 14000]                                                                                                                                         │
│ Average cumulative reward:       -12.063869444837623                                                                                                                     │
│ Average rollout reward:          -11.706959007408981                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:03:09[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 14907, 14923, 14925, 14973, 15000]                                                                                                                          │
│ Average cumulative reward:       -14.296537044525712                                                                                                                     │
│ Average rollout reward:          -13.978730940535431                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:03:09[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 14907, 14923, 14925, 14973, 15000]                                                                                                                          │
│ Average cumulative reward:       -14.296537044525712                                                                                                                     │
│ Average rollout reward:          -13.978730940535431                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:03:09[0m   1.48 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 14907, 14923, 14925, 14973, 15000]                                                                                                                          │
│ Average cumulative reward:       -14.296537044525712                                                                                                                     │
│ Average rollout reward:          -13.978730940535431                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.7%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:03:07[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 15944, 15946, 15952, 15955, 16000]                                                                                                                     │
│ Average cumulative reward:       -12.143498150869148                                                                                                                     │
│ Average rollout reward:          -11.764861625715216                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.7%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:03:07[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 15944, 15946, 15952, 15955, 16000]                                                                                                                     │
│ Average cumulative reward:       -12.143498150869148                                                                                                                     │
│ Average rollout reward:          -11.764861625715216                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/149 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.7%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:03:07[0m   1.48 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 15944, 15946, 15952, 15955, 16000]                                                                                                                     │
│ Average cumulative reward:       -12.143498150869148                                                                                                                     │
│ Average rollout reward:          -11.764861625715216                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:03:05[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 16969, 16985, 16987, 16990, 17000]                                                                                                                          │
│ Average cumulative reward:       -12.271219607203133                                                                                                                     │
│ Average rollout reward:          -11.94072940591798                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:03:05[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 16969, 16985, 16987, 16990, 17000]                                                                                                                          │
│ Average cumulative reward:       -12.271219607203133                                                                                                                     │
│ Average rollout reward:          -11.94072940591798                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:03:05[0m   1.48 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 16969, 16985, 16987, 16990, 17000]                                                                                                                          │
│ Average cumulative reward:       -12.271219607203133                                                                                                                     │
│ Average rollout reward:          -11.94072940591798                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.1%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:03:05[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 17970, 17983, 17999, 18000]                                                                                                                          │
│ Average cumulative reward:       -13.357691879046698                                                                                                                     │
│ Average rollout reward:          -13.012287003644024                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/149 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.1%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:03:05[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 17970, 17983, 17999, 18000]                                                                                                                          │
│ Average cumulative reward:       -13.357691879046698                                                                                                                     │
│ Average rollout reward:          -13.012287003644024                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:03:03[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13004, 13321, 13325, 13361, 18993, 18994, 19000]                                                                                                            │
│ Average cumulative reward:       -12.770839660672356                                                                                                                     │
│ Average rollout reward:          -12.411133051269585                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:03:03[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13004, 13321, 13325, 13361, 18993, 18994, 19000]                                                                                                            │
│ Average cumulative reward:       -12.770839660672356                                                                                                                     │
│ Average rollout reward:          -12.411133051269585                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.8%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:03:03[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13004, 13321, 13325, 13361, 18993, 18994, 19000]                                                                                                            │
│ Average cumulative reward:       -12.770839660672356                                                                                                                     │
│ Average rollout reward:          -12.411133051269585                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.4%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:03:02[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19661, 19769, 19982, 19998, 20000]                                                                                                                          │
│ Average cumulative reward:       -11.94565759811913                                                                                                                      │
│ Average rollout reward:          -11.553311790267864                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.4%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:03:02[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19661, 19769, 19982, 19998, 20000]                                                                                                                          │
│ Average cumulative reward:       -11.94565759811913                                                                                                                      │
│ Average rollout reward:          -11.553311790267864                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/149 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.4%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:03:02[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19661, 19769, 19982, 19998, 20000]                                                                                                                          │
│ Average cumulative reward:       -11.94565759811913                                                                                                                      │
│ Average rollout reward:          -11.553311790267864                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:03:00[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 20612, 20970, 20973, 20977, 21000]                                                                                                                          │
│ Average cumulative reward:       -12.602109385143223                                                                                                                     │
│ Average rollout reward:          -12.2384056536831                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:03:00[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 20612, 20970, 20973, 20977, 21000]                                                                                                                          │
│ Average cumulative reward:       -12.602109385143223                                                                                                                     │
│ Average rollout reward:          -12.2384056536831                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.1%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:03:00[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 20612, 20970, 20973, 20977, 21000]                                                                                                                          │
│ Average cumulative reward:       -12.602109385143223                                                                                                                     │
│ Average rollout reward:          -12.2384056536831                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.8%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:59[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 21050, 21097, 21970, 22000]                                                                                                                            │
│ Average cumulative reward:       -12.616168743851564                                                                                                                     │
│ Average rollout reward:          -12.254148457310809                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.8%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:59[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 21050, 21097, 21970, 22000]                                                                                                                            │
│ Average cumulative reward:       -12.616168743851564                                                                                                                     │
│ Average rollout reward:          -12.254148457310809                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/149 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m14.8%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:59[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 21050, 21097, 21970, 22000]                                                                                                                            │
│ Average cumulative reward:       -12.616168743851564                                                                                                                     │
│ Average rollout reward:          -12.254148457310809                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.4%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:58[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 7153, 7396, 22492, 23000]                                                                                                                                   │
│ Average cumulative reward:       -13.362995513372907                                                                                                                     │
│ Average rollout reward:          -13.003648038324458                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.4%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:58[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 7153, 7396, 22492, 23000]                                                                                                                                   │
│ Average cumulative reward:       -13.362995513372907                                                                                                                     │
│ Average rollout reward:          -13.003648038324458                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.1%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:57[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 5510, 9807, 12191, 14695, 14866, 24000]                                                                                                                     │
│ Average cumulative reward:       -12.562594432847119                                                                                                                     │
│ Average rollout reward:          -12.176737143508342                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m16.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:57[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 5510, 9807, 12191, 14695, 14866, 24000]                                                                                                                     │
│ Average cumulative reward:       -12.562594432847119                                                                                                                     │
│ Average rollout reward:          -12.176737143508342                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/149 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:57[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 5510, 9807, 12191, 14695, 14866, 24000]                                                                                                                     │
│ Average cumulative reward:       -12.562594432847119                                                                                                                     │
│ Average rollout reward:          -12.176737143508342                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:56[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 87, 24949, 24952, 24998, 25000]                                                                                                                             │
│ Average cumulative reward:       -12.258400343329582                                                                                                                     │
│ Average rollout reward:          -11.88588131001295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:56[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 87, 24949, 24952, 24998, 25000]                                                                                                                             │
│ Average cumulative reward:       -12.258400343329582                                                                                                                     │
│ Average rollout reward:          -11.88588131001295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.8%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:56[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 87, 24949, 24952, 24998, 25000]                                                                                                                             │
│ Average cumulative reward:       -12.258400343329582                                                                                                                     │
│ Average rollout reward:          -11.88588131001295                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.4%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:55[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24713, 24715, 25868, 26000]                                                                                                                          │
│ Average cumulative reward:       -12.513517660957099                                                                                                                     │
│ Average rollout reward:          -12.157389404559606                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.4%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:55[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24713, 24715, 25868, 26000]                                                                                                                          │
│ Average cumulative reward:       -12.513517660957099                                                                                                                     │
│ Average rollout reward:          -12.157389404559606                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/149 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.4%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:55[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24713, 24715, 25868, 26000]                                                                                                                          │
│ Average cumulative reward:       -12.513517660957099                                                                                                                     │
│ Average rollout reward:          -12.157389404559606                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:54[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26482, 26881, 27000]                                                                                                                     │
│ Average cumulative reward:       -12.641412553928964                                                                                                                     │
│ Average rollout reward:          -12.259589512530866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:54[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26482, 26881, 27000]                                                                                                                     │
│ Average cumulative reward:       -12.641412553928964                                                                                                                     │
│ Average rollout reward:          -12.259589512530866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/149 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.1%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:54[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26482, 26881, 27000]                                                                                                                     │
│ Average cumulative reward:       -12.641412553928964                                                                                                                     │
│ Average rollout reward:          -12.259589512530866                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.8%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:52[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 27003, 27033, 27631, 28000]                                                                                                                            │
│ Average cumulative reward:       -12.50685636276101                                                                                                                      │
│ Average rollout reward:          -12.116985023706333                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m18.8%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:52[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 27003, 27033, 27631, 28000]                                                                                                                            │
│ Average cumulative reward:       -12.50685636276101                                                                                                                      │
│ Average rollout reward:          -12.116985023706333                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.5%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:51[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 3875, 4048, 4125, 29000]                                                                                                                                │
│ Average cumulative reward:       -12.676264097093899                                                                                                                     │
│ Average rollout reward:          -12.296929188797858                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.5%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:51[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 3875, 4048, 4125, 29000]                                                                                                                                │
│ Average cumulative reward:       -12.676264097093899                                                                                                                     │
│ Average rollout reward:          -12.296929188797858                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/149 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.5%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:51[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 46, 3875, 4048, 4125, 29000]                                                                                                                                │
│ Average cumulative reward:       -12.676264097093899                                                                                                                     │
│ Average rollout reward:          -12.296929188797858                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:50[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13747, 26131, 26527, 28531, 30000]                                                                                                                          │
│ Average cumulative reward:       -12.898034564548757                                                                                                                     │
│ Average rollout reward:          -12.514165957669425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:50[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13747, 26131, 26527, 28531, 30000]                                                                                                                          │
│ Average cumulative reward:       -12.898034564548757                                                                                                                     │
│ Average rollout reward:          -12.514165957669425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.1%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:02:50[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13747, 26131, 26527, 28531, 30000]                                                                                                                          │
│ Average cumulative reward:       -12.898034564548757                                                                                                                     │
│ Average rollout reward:          -12.514165957669425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.8%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:02:48[0m   1.41 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24288, 24290, 24327, 24362, 31000]                                                                                                                   │
│ Average cumulative reward:       -12.72298536326777                                                                                                                      │
│ Average rollout reward:          -12.294436380618846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m20.8%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:48[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24288, 24290, 24327, 24362, 31000]                                                                                                                   │
│ Average cumulative reward:       -12.72298536326777                                                                                                                      │
│ Average rollout reward:          -12.294436380618846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/149 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.8%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:48[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 24288, 24290, 24327, 24362, 31000]                                                                                                                   │
│ Average cumulative reward:       -12.72298536326777                                                                                                                      │
│ Average rollout reward:          -12.294436380618846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:47[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31864, 31866, 32000]                                                                                                                                 │
│ Average cumulative reward:       -12.744111624921512                                                                                                                     │
│ Average rollout reward:          -12.36380286769134                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:47[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31864, 31866, 32000]                                                                                                                                 │
│ Average cumulative reward:       -12.744111624921512                                                                                                                     │
│ Average rollout reward:          -12.36380286769134                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:47[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31864, 31866, 32000]                                                                                                                                 │
│ Average cumulative reward:       -12.744111624921512                                                                                                                     │
│ Average rollout reward:          -12.36380286769134                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:46[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 32931, 32987, 32990, 32992, 32997, 33000]                                                                                                                   │
│ Average cumulative reward:       -12.508264826292255                                                                                                                     │
│ Average rollout reward:          -12.117892560615886                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:46[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 32931, 32987, 32990, 32992, 32997, 33000]                                                                                                                   │
│ Average cumulative reward:       -12.508264826292255                                                                                                                     │
│ Average rollout reward:          -12.117892560615886                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/149 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.1%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:46[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 32931, 32987, 32990, 32992, 32997, 33000]                                                                                                                   │
│ Average cumulative reward:       -12.508264826292255                                                                                                                     │
│ Average rollout reward:          -12.117892560615886                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:45[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 30576, 34000]                                                                                                                   │
│ Average cumulative reward:       -13.51508258593522                                                                                                                      │
│ Average rollout reward:          -13.107396138259956                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
│ [-3.06045618 -3.06045618 -3.06045618 -3.06045618 -2.79319729 -2.79319729                                                                                                 │
│  -2.79319729 -2.44404763 -2.44404763 -2.44404763 -2.09489797]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:45[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 30576, 34000]                                                                                                                   │
│ Average cumulative reward:       -13.51508258593522                                                                                                                      │
│ Average rollout reward:          -13.107396138259956                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
│ [-3.06045618 -3.06045618 -3.06045618 -3.06045618 -2.79319729 -2.79319729                                                                                                 │
│  -2.79319729 -2.44404763 -2.44404763 -2.44404763 -2.09489797]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:45[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 30576, 34000]                                                                                                                   │
│ Average cumulative reward:       -13.51508258593522                                                                                                                      │
│ Average rollout reward:          -13.107396138259956                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.060456183895379                                                                                                                              │
│ Best path: [0, 4, 975, 4036, 4038, 4502, 4604, 4649, 5817]                                                                                                               │
│ [-3.06045618 -3.06045618 -3.06045618 -3.06045618 -2.79319729 -2.79319729                                                                                                 │
│  -2.79319729 -2.44404763 -2.44404763 -2.44404763 -2.09489797]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:43[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 34886, 34909, 34913, 34998, 35000]                                                                                                                          │
│ Average cumulative reward:       -13.085296572196617                                                                                                                     │
│ Average rollout reward:          -12.678703595248468                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:43[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 34886, 34909, 34913, 34998, 35000]                                                                                                                          │
│ Average cumulative reward:       -13.085296572196617                                                                                                                     │
│ Average rollout reward:          -12.678703595248468                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/149 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m23.5%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:43[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 34886, 34909, 34913, 34998, 35000]                                                                                                                          │
│ Average cumulative reward:       -13.085296572196617                                                                                                                     │
│ Average rollout reward:          -12.678703595248468                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.2%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:42[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26480, 26583, 36000]                                                                                                                     │
│ Average cumulative reward:       -12.404928807307726                                                                                                                     │
│ Average rollout reward:          -12.000780445628262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.2%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:42[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26480, 26583, 36000]                                                                                                                     │
│ Average cumulative reward:       -12.404928807307726                                                                                                                     │
│ Average rollout reward:          -12.000780445628262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.2%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:42[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 25916, 25979, 26480, 26583, 36000]                                                                                                                     │
│ Average cumulative reward:       -12.404928807307726                                                                                                                     │
│ Average rollout reward:          -12.000780445628262                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.8%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:41[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 8773, 8919, 9090, 9151, 34802, 36310, 37000]                                                                                                                │
│ Average cumulative reward:       -12.653037659973485                                                                                                                     │
│ Average rollout reward:          -12.24408181813383                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/149 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.8%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:41[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 8773, 8919, 9090, 9151, 34802, 36310, 37000]                                                                                                                │
│ Average cumulative reward:       -12.653037659973485                                                                                                                     │
│ Average rollout reward:          -12.24408181813383                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.5%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:39[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 37598, 38000]                                                                                                                                               │
│ Average cumulative reward:       -12.280724657546502                                                                                                                     │
│ Average rollout reward:          -11.889313974805043                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m25.5%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:39[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 37598, 38000]                                                                                                                                               │
│ Average cumulative reward:       -12.280724657546502                                                                                                                     │
│ Average rollout reward:          -11.889313974805043                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.5%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:39[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 37598, 38000]                                                                                                                                               │
│ Average cumulative reward:       -12.280724657546502                                                                                                                     │
│ Average rollout reward:          -11.889313974805043                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:38[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 206, 36757, 36997, 37164, 39000]                                                                                                                            │
│ Average cumulative reward:       -11.934392794135503                                                                                                                     │
│ Average rollout reward:          -11.573633271279578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:38[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 206, 36757, 36997, 37164, 39000]                                                                                                                            │
│ Average cumulative reward:       -11.934392794135503                                                                                                                     │
│ Average rollout reward:          -11.573633271279578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.2%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:38[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 206, 36757, 36997, 37164, 39000]                                                                                                                            │
│ Average cumulative reward:       -11.934392794135503                                                                                                                     │
│ Average rollout reward:          -11.573633271279578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.8%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:37[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 25782, 25814, 26414, 40000]                                                                                                                                 │
│ Average cumulative reward:       -12.474961491206003                                                                                                                     │
│ Average rollout reward:          -12.06009763491716                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.8%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:37[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 25782, 25814, 26414, 40000]                                                                                                                                 │
│ Average cumulative reward:       -12.474961491206003                                                                                                                     │
│ Average rollout reward:          -12.06009763491716                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/149 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.8%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:37[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 25782, 25814, 26414, 40000]                                                                                                                                 │
│ Average cumulative reward:       -12.474961491206003                                                                                                                     │
│ Average rollout reward:          -12.06009763491716                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:36[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40987, 40991, 40994, 41000]                                                                                                                          │
│ Average cumulative reward:       -12.543875695316256                                                                                                                     │
│ Average rollout reward:          -12.178888443280709                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:36[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40987, 40991, 40994, 41000]                                                                                                                          │
│ Average cumulative reward:       -12.543875695316256                                                                                                                     │
│ Average rollout reward:          -12.178888443280709                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.5%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:36[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40987, 40991, 40994, 41000]                                                                                                                          │
│ Average cumulative reward:       -12.543875695316256                                                                                                                     │
│ Average rollout reward:          -12.178888443280709                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:34[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40589, 40591, 40595, 40611, 41102, 41103, 41782, 42000]                                                                                              │
│ Average cumulative reward:       -11.914596648210498                                                                                                                     │
│ Average rollout reward:          -11.520003045900527                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:34[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40589, 40591, 40595, 40611, 41102, 41103, 41782, 42000]                                                                                              │
│ Average cumulative reward:       -11.914596648210498                                                                                                                     │
│ Average rollout reward:          -11.520003045900527                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.2%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:34[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40589, 40591, 40595, 40611, 41102, 41103, 41782, 42000]                                                                                              │
│ Average cumulative reward:       -11.914596648210498                                                                                                                     │
│ Average rollout reward:          -11.520003045900527                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.9%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:33[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 2044, 2108, 36546, 37360, 40037, 43000]                                                                                                                │
│ Average cumulative reward:       -12.175039157442116                                                                                                                     │
│ Average rollout reward:          -11.763859891605819                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m28.9%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:33[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 2044, 2108, 36546, 37360, 40037, 43000]                                                                                                                │
│ Average cumulative reward:       -12.175039157442116                                                                                                                     │
│ Average rollout reward:          -11.763859891605819                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:31[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43843, 43845, 43870, 43880, 43882, 44000]                                                                                                            │
│ Average cumulative reward:       -12.267676652211252                                                                                                                     │
│ Average rollout reward:          -11.891312960444019                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:31[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43843, 43845, 43870, 43880, 43882, 44000]                                                                                                            │
│ Average cumulative reward:       -12.267676652211252                                                                                                                     │
│ Average rollout reward:          -11.891312960444019                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/149 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.5%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:31[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43843, 43845, 43870, 43880, 43882, 44000]                                                                                                            │
│ Average cumulative reward:       -12.267676652211252                                                                                                                     │
│ Average rollout reward:          -11.891312960444019                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:30[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 44922, 44924, 44929, 44941, 45000]                                                                                                                          │
│ Average cumulative reward:       -12.450229874844606                                                                                                                     │
│ Average rollout reward:          -12.051387940898302                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:02:30[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 44922, 44924, 44929, 44941, 45000]                                                                                                                          │
│ Average cumulative reward:       -12.450229874844606                                                                                                                     │
│ Average rollout reward:          -12.051387940898302                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.2%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:30[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 44922, 44924, 44929, 44941, 45000]                                                                                                                          │
│ Average cumulative reward:       -12.450229874844606                                                                                                                     │
│ Average rollout reward:          -12.051387940898302                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.9%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:29[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43410, 45644, 45672, 45956, 46000]                                                                                                                   │
│ Average cumulative reward:       -12.801254365080474                                                                                                                     │
│ Average rollout reward:          -12.467549236961203                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.9%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:29[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43410, 45644, 45672, 45956, 46000]                                                                                                                   │
│ Average cumulative reward:       -12.801254365080474                                                                                                                     │
│ Average rollout reward:          -12.467549236961203                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.9%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:29[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43410, 45644, 45672, 45956, 46000]                                                                                                                   │
│ Average cumulative reward:       -12.801254365080474                                                                                                                     │
│ Average rollout reward:          -12.467549236961203                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:28[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 46983, 46987, 46993, 47000]                                                                                                                          │
│ Average cumulative reward:       -12.909784026756602                                                                                                                     │
│ Average rollout reward:          -12.503149438939646                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:28[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 46983, 46987, 46993, 47000]                                                                                                                          │
│ Average cumulative reward:       -12.909784026756602                                                                                                                     │
│ Average rollout reward:          -12.503149438939646                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.5%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:02:28[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 46983, 46987, 46993, 47000]                                                                                                                          │
│ Average cumulative reward:       -12.909784026756602                                                                                                                     │
│ Average rollout reward:          -12.503149438939646                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.2%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:02:26[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 47973, 47976, 47993, 48000]                                                                                                                          │
│ Average cumulative reward:       -12.067100581757611                                                                                                                     │
│ Average rollout reward:          -11.616771189213562                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:26[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 47973, 47976, 47993, 48000]                                                                                                                          │
│ Average cumulative reward:       -12.067100581757611                                                                                                                     │
│ Average rollout reward:          -11.616771189213562                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/149 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:26[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 47973, 47976, 47993, 48000]                                                                                                                          │
│ Average cumulative reward:       -12.067100581757611                                                                                                                     │
│ Average rollout reward:          -11.616771189213562                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:25[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 48882, 48953, 48955, 48967, 48969, 49000]                                                                                                                   │
│ Average cumulative reward:       -12.250108335247495                                                                                                                     │
│ Average rollout reward:          -11.882991741870926                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:25[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 48882, 48953, 48955, 48967, 48969, 49000]                                                                                                                   │
│ Average cumulative reward:       -12.250108335247495                                                                                                                     │
│ Average rollout reward:          -11.882991741870926                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:23[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 49699, 49701, 49704, 49740, 49960, 50000]                                                                                                                   │
│ Average cumulative reward:       -12.006888182507765                                                                                                                     │
│ Average rollout reward:          -11.666937744433291                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:23[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 49699, 49701, 49704, 49740, 49960, 50000]                                                                                                                   │
│ Average cumulative reward:       -12.006888182507765                                                                                                                     │
│ Average rollout reward:          -11.666937744433291                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m33.6%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:23[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 49699, 49701, 49704, 49740, 49960, 50000]                                                                                                                   │
│ Average cumulative reward:       -12.006888182507765                                                                                                                     │
│ Average rollout reward:          -11.666937744433291                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:21[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 50524, 50973, 50977, 50999, 51000]                                                                                                                          │
│ Average cumulative reward:       -11.867443040232956                                                                                                                     │
│ Average rollout reward:          -11.49398871269589                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:21[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 50524, 50973, 50977, 50999, 51000]                                                                                                                          │
│ Average cumulative reward:       -11.867443040232956                                                                                                                     │
│ Average rollout reward:          -11.49398871269589                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:21[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 50524, 50973, 50977, 50999, 51000]                                                                                                                          │
│ Average cumulative reward:       -11.867443040232956                                                                                                                     │
│ Average rollout reward:          -11.49398871269589                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:19[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40677, 43383, 46805, 46818, 52000]                                                                                                                   │
│ Average cumulative reward:       -11.950101846363681                                                                                                                     │
│ Average rollout reward:          -11.55826744823228                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:19[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40677, 43383, 46805, 46818, 52000]                                                                                                                   │
│ Average cumulative reward:       -11.950101846363681                                                                                                                     │
│ Average rollout reward:          -11.55826744823228                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/149 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.9%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:02:19[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 40434, 40677, 43383, 46805, 46818, 52000]                                                                                                                   │
│ Average cumulative reward:       -11.950101846363681                                                                                                                     │
│ Average rollout reward:          -11.55826744823228                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.6%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:02:19[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 23630, 37374, 39293, 53000]                                                                                                                                 │
│ Average cumulative reward:       -13.188491368410967                                                                                                                     │
│ Average rollout reward:          -12.823676527950765                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.6%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:02:19[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 23630, 37374, 39293, 53000]                                                                                                                                 │
│ Average cumulative reward:       -13.188491368410967                                                                                                                     │
│ Average rollout reward:          -12.823676527950765                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.6%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:02:19[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 23630, 37374, 39293, 53000]                                                                                                                                 │
│ Average cumulative reward:       -13.188491368410967                                                                                                                     │
│ Average rollout reward:          -12.823676527950765                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:02:17[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 53903, 53913, 53915, 53920, 53924, 54000]                                                                                                                   │
│ Average cumulative reward:       -12.257237775276568                                                                                                                     │
│ Average rollout reward:          -11.906131846385298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:02:17[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 53903, 53913, 53915, 53920, 53924, 54000]                                                                                                                   │
│ Average cumulative reward:       -12.257237775276568                                                                                                                     │
│ Average rollout reward:          -11.906131846385298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.2%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:02:17[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 53903, 53913, 53915, 53920, 53924, 54000]                                                                                                                   │
│ Average cumulative reward:       -12.257237775276568                                                                                                                     │
│ Average rollout reward:          -11.906131846385298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.9%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:02:15[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 32295, 34338, 34361, 34575, 55000]                                                                                                                          │
│ Average cumulative reward:       -12.048180783588881                                                                                                                     │
│ Average rollout reward:          -11.676554761489628                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/149 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.9%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:02:15[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 32295, 34338, 34361, 34575, 55000]                                                                                                                          │
│ Average cumulative reward:       -12.048180783588881                                                                                                                     │
│ Average rollout reward:          -11.676554761489628                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m37.6%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:02:14[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55985, 55987, 55997, 56000]                                                                                                                          │
│ Average cumulative reward:       -12.13036931459292                                                                                                                      │
│ Average rollout reward:          -11.690990006422023                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m37.6%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:02:14[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55985, 55987, 55997, 56000]                                                                                                                          │
│ Average cumulative reward:       -12.13036931459292                                                                                                                      │
│ Average rollout reward:          -11.690990006422023                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m37.6%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:02:14[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55985, 55987, 55997, 56000]                                                                                                                          │
│ Average cumulative reward:       -12.13036931459292                                                                                                                      │
│ Average rollout reward:          -11.690990006422023                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:02:12[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55502, 55506, 55627, 55811, 55923, 56979, 57000]                                                                                                     │
│ Average cumulative reward:       -12.397320873626994                                                                                                                     │
│ Average rollout reward:          -11.971878842545298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:02:12[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55502, 55506, 55627, 55811, 55923, 56979, 57000]                                                                                                     │
│ Average cumulative reward:       -12.397320873626994                                                                                                                     │
│ Average rollout reward:          -11.971878842545298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.3%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:02:12[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 13372, 55502, 55506, 55627, 55811, 55923, 56979, 57000]                                                                                                     │
│ Average cumulative reward:       -12.397320873626994                                                                                                                     │
│ Average rollout reward:          -11.971878842545298                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.9%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:02:11[0m   1.42 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 57416, 57733, 57797, 57994, 58000]                                                                                                                          │
│ Average cumulative reward:       -13.748561642175883                                                                                                                     │
│ Average rollout reward:          -13.350600699090698                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.9%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:02:11[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 57416, 57733, 57797, 57994, 58000]                                                                                                                          │
│ Average cumulative reward:       -13.748561642175883                                                                                                                     │
│ Average rollout reward:          -13.350600699090698                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.9%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:02:11[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 57416, 57733, 57797, 57994, 58000]                                                                                                                          │
│ Average cumulative reward:       -13.748561642175883                                                                                                                     │
│ Average rollout reward:          -13.350600699090698                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:02:10[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 57256, 57274, 58855, 58901, 59000]                                                                                                                     │
│ Average cumulative reward:       -12.84855270073108                                                                                                                      │
│ Average rollout reward:          -12.447146098800085                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:02:10[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 57256, 57274, 58855, 58901, 59000]                                                                                                                     │
│ Average cumulative reward:       -12.84855270073108                                                                                                                      │
│ Average rollout reward:          -12.447146098800085                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/149 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.6%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:02:10[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 177, 57256, 57274, 58855, 58901, 59000]                                                                                                                     │
│ Average cumulative reward:       -12.84855270073108                                                                                                                      │
│ Average rollout reward:          -12.447146098800085                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.3%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:02:09[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59932, 59934, 59995, 60000]                                                                                                                            │
│ Average cumulative reward:       -12.673470006550234                                                                                                                     │
│ Average rollout reward:          -12.278976823854569                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.3%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:02:09[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59932, 59934, 59995, 60000]                                                                                                                            │
│ Average cumulative reward:       -12.673470006550234                                                                                                                     │
│ Average rollout reward:          -12.278976823854569                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.3%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:02:09[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59932, 59934, 59995, 60000]                                                                                                                            │
│ Average cumulative reward:       -12.673470006550234                                                                                                                     │
│ Average rollout reward:          -12.278976823854569                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:02:08[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59808, 59810, 60061, 60669, 60703, 61000]                                                                                                              │
│ Average cumulative reward:       -12.705358210929292                                                                                                                     │
│ Average rollout reward:          -12.310969202931311                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:02:08[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59808, 59810, 60061, 60669, 60703, 61000]                                                                                                              │
│ Average cumulative reward:       -12.705358210929292                                                                                                                     │
│ Average rollout reward:          -12.310969202931311                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.9%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:02:08[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 59808, 59810, 60061, 60669, 60703, 61000]                                                                                                              │
│ Average cumulative reward:       -12.705358210929292                                                                                                                     │
│ Average rollout reward:          -12.310969202931311                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:02:06[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 61994, 62000]                                                                                                                                               │
│ Average cumulative reward:       -12.353199438022914                                                                                                                     │
│ Average rollout reward:          -11.93087037645891                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:02:06[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 61994, 62000]                                                                                                                                               │
│ Average cumulative reward:       -12.353199438022914                                                                                                                     │
│ Average rollout reward:          -11.93087037645891                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.6%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:02:06[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 61994, 62000]                                                                                                                                               │
│ Average cumulative reward:       -12.353199438022914                                                                                                                     │
│ Average rollout reward:          -11.93087037645891                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m42.3%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:02:05[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 62975, 62979, 62980, 62983, 63000]                                                                                                                   │
│ Average cumulative reward:       -12.711412667042408                                                                                                                     │
│ Average rollout reward:          -12.290493806437206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m42.3%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:02:05[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 62975, 62979, 62980, 62983, 63000]                                                                                                                   │
│ Average cumulative reward:       -12.711412667042408                                                                                                                     │
│ Average rollout reward:          -12.290493806437206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m42.3%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:02:05[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 62975, 62979, 62980, 62983, 63000]                                                                                                                   │
│ Average cumulative reward:       -12.711412667042408                                                                                                                     │
│ Average rollout reward:          -12.290493806437206                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:02:04[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 19304, 64000]                                                                                                                                        │
│ Average cumulative reward:       -12.393911996765292                                                                                                                     │
│ Average rollout reward:          -11.949900807347223                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:02:04[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 19304, 64000]                                                                                                                                        │
│ Average cumulative reward:       -12.393911996765292                                                                                                                     │
│ Average rollout reward:          -11.949900807347223                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:02:04[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 19304, 64000]                                                                                                                                        │
│ Average cumulative reward:       -12.393911996765292                                                                                                                     │
│ Average rollout reward:          -11.949900807347223                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.6%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:02:02[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 64845, 65000]                                                                                                                                               │
│ Average cumulative reward:       -12.45387902148687                                                                                                                      │
│ Average rollout reward:          -12.031382694877587                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.6%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:02:02[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 64845, 65000]                                                                                                                                               │
│ Average cumulative reward:       -12.45387902148687                                                                                                                      │
│ Average rollout reward:          -12.031382694877587                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:02:00[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 65811, 65993, 65996, 66000]                                                                                                                                 │
│ Average cumulative reward:       -12.286955667446533                                                                                                                     │
│ Average rollout reward:          -11.897142523213008                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:02:00[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 65811, 65993, 65996, 66000]                                                                                                                                 │
│ Average cumulative reward:       -12.286955667446533                                                                                                                     │
│ Average rollout reward:          -11.897142523213008                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:02:00[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 65811, 65993, 65996, 66000]                                                                                                                                 │
│ Average cumulative reward:       -12.286955667446533                                                                                                                     │
│ Average rollout reward:          -11.897142523213008                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:02:00[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 65811, 65993, 65996, 66000]                                                                                                                                 │
│ Average cumulative reward:       -12.286955667446533                                                                                                                     │
│ Average rollout reward:          -11.897142523213008                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m45.0%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:59[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 66787, 66917, 66919, 66984, 66987, 67000]                                                                                                                   │
│ Average cumulative reward:       -12.379826900782822                                                                                                                     │
│ Average rollout reward:          -12.004785287810341                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m45.0%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:59[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 66787, 66917, 66919, 66984, 66987, 67000]                                                                                                                   │
│ Average cumulative reward:       -12.379826900782822                                                                                                                     │
│ Average rollout reward:          -12.004785287810341                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:58[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 67771, 67860, 67863, 68000]                                                                                                                                 │
│ Average cumulative reward:       -12.27063142268424                                                                                                                      │
│ Average rollout reward:          -11.899795738304096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:58[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 67771, 67860, 67863, 68000]                                                                                                                                 │
│ Average cumulative reward:       -12.27063142268424                                                                                                                      │
│ Average rollout reward:          -11.899795738304096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:58[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 67771, 67860, 67863, 68000]                                                                                                                                 │
│ Average cumulative reward:       -12.27063142268424                                                                                                                      │
│ Average rollout reward:          -11.899795738304096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:56[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 45680, 45682, 54657, 54965, 65713, 69000]                                                                                                            │
│ Average cumulative reward:       -11.923644540205613                                                                                                                     │
│ Average rollout reward:          -11.51922272367049                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:56[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 45680, 45682, 54657, 54965, 65713, 69000]                                                                                                            │
│ Average cumulative reward:       -11.923644540205613                                                                                                                     │
│ Average rollout reward:          -11.51922272367049                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.3%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:56[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 45680, 45682, 54657, 54965, 65713, 69000]                                                                                                            │
│ Average cumulative reward:       -11.923644540205613                                                                                                                     │
│ Average rollout reward:          -11.51922272367049                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:55[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 69765, 69854, 69856, 69902, 70000]                                                                                                                          │
│ Average cumulative reward:       -12.628799423140608                                                                                                                     │
│ Average rollout reward:          -12.213835178211319                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:55[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 69765, 69854, 69856, 69902, 70000]                                                                                                                          │
│ Average cumulative reward:       -12.628799423140608                                                                                                                     │
│ Average rollout reward:          -12.213835178211319                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m47.0%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:55[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 69765, 69854, 69856, 69902, 70000]                                                                                                                          │
│ Average cumulative reward:       -12.628799423140608                                                                                                                     │
│ Average rollout reward:          -12.213835178211319                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m47.7%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:54[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 8489, 8619, 8623, 8658, 71000]                                                                                                                              │
│ Average cumulative reward:       -12.29857435232899                                                                                                                      │
│ Average rollout reward:          -11.953919249859739                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m47.7%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:54[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 8489, 8619, 8623, 8658, 71000]                                                                                                                              │
│ Average cumulative reward:       -12.29857435232899                                                                                                                      │
│ Average rollout reward:          -11.953919249859739                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m47.7%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:54[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 8489, 8619, 8623, 8658, 71000]                                                                                                                              │
│ Average cumulative reward:       -12.29857435232899                                                                                                                      │
│ Average rollout reward:          -11.953919249859739                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:52[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 71794, 71924, 71928, 71958, 72000]                                                                                                                          │
│ Average cumulative reward:       -12.705572202011798                                                                                                                     │
│ Average rollout reward:          -12.307759804580916                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:52[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 71794, 71924, 71928, 71958, 72000]                                                                                                                          │
│ Average cumulative reward:       -12.705572202011798                                                                                                                     │
│ Average rollout reward:          -12.307759804580916                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.3%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:52[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 71794, 71924, 71928, 71958, 72000]                                                                                                                          │
│ Average cumulative reward:       -12.705572202011798                                                                                                                     │
│ Average rollout reward:          -12.307759804580916                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:51[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 72822, 72930, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -12.93043480354269                                                                                                                      │
│ Average rollout reward:          -12.575774969247547                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:51[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 72822, 72930, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -12.93043480354269                                                                                                                      │
│ Average rollout reward:          -12.575774969247547                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.0%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:51[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 72822, 72930, 72932, 73000]                                                                                                                                 │
│ Average cumulative reward:       -12.93043480354269                                                                                                                      │
│ Average rollout reward:          -12.575774969247547                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.7%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:50[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43502, 43504, 73969, 74000]                                                                                                                          │
│ Average cumulative reward:       -12.293732765481925                                                                                                                     │
│ Average rollout reward:          -11.881874992558284                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.7%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:50[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43502, 43504, 73969, 74000]                                                                                                                          │
│ Average cumulative reward:       -12.293732765481925                                                                                                                     │
│ Average rollout reward:          -11.881874992558284                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.7%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:50[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 43394, 43502, 43504, 73969, 74000]                                                                                                                          │
│ Average cumulative reward:       -12.293732765481925                                                                                                                     │
│ Average rollout reward:          -11.881874992558284                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.3%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:48[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 74904, 74993, 74996, 75000]                                                                                                                                 │
│ Average cumulative reward:       -12.347004505364229                                                                                                                     │
│ Average rollout reward:          -11.936191280019559                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.3%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:48[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 74904, 74993, 74996, 75000]                                                                                                                                 │
│ Average cumulative reward:       -12.347004505364229                                                                                                                     │
│ Average rollout reward:          -11.936191280019559                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:47[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 39395, 39404, 40343, 40347, 76000]                                                                                                                   │
│ Average cumulative reward:       -12.226317325008985                                                                                                                     │
│ Average rollout reward:          -11.827407326700303                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:47[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 39395, 39404, 40343, 40347, 76000]                                                                                                                   │
│ Average cumulative reward:       -12.226317325008985                                                                                                                     │
│ Average rollout reward:          -11.827407326700303                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.0%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:47[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 39395, 39404, 40343, 40347, 76000]                                                                                                                   │
│ Average cumulative reward:       -12.226317325008985                                                                                                                     │
│ Average rollout reward:          -11.827407326700303                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.7%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:46[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 66787, 67340, 67342, 67387, 77000]                                                                                                                          │
│ Average cumulative reward:       -12.796354787746607                                                                                                                     │
│ Average rollout reward:          -12.339664931270867                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.7%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:46[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 66787, 67340, 67342, 67387, 77000]                                                                                                                          │
│ Average cumulative reward:       -12.796354787746607                                                                                                                     │
│ Average rollout reward:          -12.339664931270867                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.7%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:46[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 66787, 67340, 67342, 67387, 77000]                                                                                                                          │
│ Average cumulative reward:       -12.796354787746607                                                                                                                     │
│ Average rollout reward:          -12.339664931270867                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:44[0m   1.43 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 60803, 60805, 60837, 60838, 60844, 78000]                                                                                                              │
│ Average cumulative reward:       -12.661786779094404                                                                                                                     │
│ Average rollout reward:          -12.230469326554049                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:44[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 60803, 60805, 60837, 60838, 60844, 78000]                                                                                                              │
│ Average cumulative reward:       -12.661786779094404                                                                                                                     │
│ Average rollout reward:          -12.230469326554049                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m52.3%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:44[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 345, 60803, 60805, 60837, 60838, 60844, 78000]                                                                                                              │
│ Average cumulative reward:       -12.661786779094404                                                                                                                     │
│ Average rollout reward:          -12.230469326554049                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.0%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:43[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 79000 ===                                                                                                                                                  │
│ 79001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 78554, 78575, 79000]                                                                                                                                 │
│ Average cumulative reward:       -12.393320120152223                                                                                                                     │
│ Average rollout reward:          -11.967260071882398                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.0%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:43[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 79000 ===                                                                                                                                                  │
│ 79001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 78554, 78575, 79000]                                                                                                                                 │
│ Average cumulative reward:       -12.393320120152223                                                                                                                     │
│ Average rollout reward:          -11.967260071882398                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.0%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:43[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 79000 ===                                                                                                                                                  │
│ 79001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 17840, 78554, 78575, 79000]                                                                                                                                 │
│ Average cumulative reward:       -12.393320120152223                                                                                                                     │
│ Average rollout reward:          -11.967260071882398                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:41[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78576, 80000]                                                                                                                                 │
│ Average cumulative reward:       -13.334511280520092                                                                                                                     │
│ Average rollout reward:          -12.94310184362629                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K80/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:41[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78576, 80000]                                                                                                                                 │
│ Average cumulative reward:       -13.334511280520092                                                                                                                     │
│ Average rollout reward:          -12.94310184362629                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K80/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.7%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:41[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 80000 ===                                                                                                                                                  │
│ 80001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78576, 80000]                                                                                                                                 │
│ Average cumulative reward:       -13.334511280520092                                                                                                                     │
│ Average rollout reward:          -12.94310184362629                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K81/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:40[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 80265, 80999, 81000]                                                                                                                                        │
│ Average cumulative reward:       -12.140032173047713                                                                                                                     │
│ Average rollout reward:          -11.766279918288411                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K81/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:40[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 80265, 80999, 81000]                                                                                                                                        │
│ Average cumulative reward:       -12.140032173047713                                                                                                                     │
│ Average rollout reward:          -11.766279918288411                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K81/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:40[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 81000 ===                                                                                                                                                  │
│ 81001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 80265, 80999, 81000]                                                                                                                                        │
│ Average cumulative reward:       -12.140032173047713                                                                                                                     │
│ Average rollout reward:          -11.766279918288411                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:38[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 81363, 81419, 81893, 81900, 81908, 82000]                                                                                                                   │
│ Average cumulative reward:       -12.034704905559332                                                                                                                     │
│ Average rollout reward:          -11.685441064222273                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:38[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 81363, 81419, 81893, 81900, 81908, 82000]                                                                                                                   │
│ Average cumulative reward:       -12.034704905559332                                                                                                                     │
│ Average rollout reward:          -11.685441064222273                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
│ [-2.97856542 -2.97856542 -2.97856542 -2.97856542 -2.71130652 -2.71130652                                                                                                 │
│  -2.71130652 -2.44404763 -2.44404763 -2.44404763]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K82/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.0%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:38[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 82000 ===                                                                                                                                                  │
│ 82001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 81363, 81419, 81893, 81900, 81908, 82000]                                                                                                                   │
│ Average cumulative reward:       -12.034704905559332                                                                                                                     │
│ Average rollout reward:          -11.685441064222273                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -3.0604561838953788                                                                                                                             │
│ Best path: [0, 4, 31045, 32165, 32167, 32877, 32878, 32884, 32907, 33534, 34135]                                                                                         │
│ [-2.97856542 -2.97856542 -2.97856542 -2.97856542 -2.71130652 -2.71130652                                                                                                 │
│  -2.71130652 -2.44404763 -2.44404763 -2.44404763]                                                                                                                        │
│ [-2.97856542 -2.97856542 -2.97856542 -2.97856542 -2.71130652 -2.71130652                                                                                                 │
│  -2.71130652 -2.44404763 -2.44404763 -2.44404763 -2.09489797 -2.09489797]                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K83/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:37[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 83000 ===                                                                                                                                                  │
│ 83001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 82302, 82308, 82322, 82327, 83000]                                                                                                     │
│ Average cumulative reward:       -14.025202384489953                                                                                                                     │
│ Average rollout reward:          -13.596689755960824                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K83/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:37[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 83000 ===                                                                                                                                                  │
│ 83001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 82302, 82308, 82322, 82327, 83000]                                                                                                     │
│ Average cumulative reward:       -14.025202384489953                                                                                                                     │
│ Average rollout reward:          -13.596689755960824                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K83/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:37[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 83000 ===                                                                                                                                                  │
│ 83001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 82302, 82308, 82322, 82327, 83000]                                                                                                     │
│ Average cumulative reward:       -14.025202384489953                                                                                                                     │
│ Average rollout reward:          -13.596689755960824                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:36[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 83844, 83848, 83984, 84000]                                                                                                                          │
│ Average cumulative reward:       -12.326151618420926                                                                                                                     │
│ Average rollout reward:          -11.896266114103645                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:36[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 83844, 83848, 83984, 84000]                                                                                                                          │
│ Average cumulative reward:       -12.326151618420926                                                                                                                     │
│ Average rollout reward:          -11.896266114103645                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K84/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m56.4%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:36[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 84000 ===                                                                                                                                                  │
│ 84001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 83844, 83848, 83984, 84000]                                                                                                                          │
│ Average cumulative reward:       -12.326151618420926                                                                                                                     │
│ Average rollout reward:          -11.896266114103645                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K85/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:34[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 85000 ===                                                                                                                                                  │
│ 85001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 82909, 85000]                                                                                                     │
│ Average cumulative reward:       -12.779955319211757                                                                                                                     │
│ Average rollout reward:          -12.334431610122024                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K85/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:34[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 85000 ===                                                                                                                                                  │
│ 85001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 82909, 85000]                                                                                                     │
│ Average cumulative reward:       -12.779955319211757                                                                                                                     │
│ Average rollout reward:          -12.334431610122024                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K85/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:34[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 85000 ===                                                                                                                                                  │
│ 85001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 82909, 85000]                                                                                                     │
│ Average cumulative reward:       -12.779955319211757                                                                                                                     │
│ Average rollout reward:          -12.334431610122024                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K86/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:33[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 85848, 85919, 85923, 85974, 86000]                                                                                                                          │
│ Average cumulative reward:       -12.482499480693692                                                                                                                     │
│ Average rollout reward:          -12.057396290213212                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K86/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:01:33[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 85848, 85919, 85923, 85974, 86000]                                                                                                                          │
│ Average cumulative reward:       -12.482499480693692                                                                                                                     │
│ Average rollout reward:          -12.057396290213212                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━[0m [35m57.7%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:33[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 86000 ===                                                                                                                                                  │
│ 86001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 85848, 85919, 85923, 85974, 86000]                                                                                                                          │
│ Average cumulative reward:       -12.482499480693692                                                                                                                     │
│ Average rollout reward:          -12.057396290213212                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K87/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.4%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:32[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 87000 ===                                                                                                                                                  │
│ 87001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78653, 78695, 78854, 87000]                                                                                                                   │
│ Average cumulative reward:       -11.904099960230958                                                                                                                     │
│ Average rollout reward:          -11.510140966573221                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K87/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.4%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:32[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 87000 ===                                                                                                                                                  │
│ 87001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78653, 78695, 78854, 87000]                                                                                                                   │
│ Average cumulative reward:       -11.904099960230958                                                                                                                     │
│ Average rollout reward:          -11.510140966573221                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K87/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.4%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:32[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 87000 ===                                                                                                                                                  │
│ 87001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 19196, 78555, 78653, 78695, 78854, 87000]                                                                                                                   │
│ Average cumulative reward:       -11.904099960230958                                                                                                                     │
│ Average rollout reward:          -11.510140966573221                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:30[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 88000]                                                                                                                          │
│ Average cumulative reward:       -12.100476913311594                                                                                                                     │
│ Average rollout reward:          -11.69571795454952                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:30[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 88000]                                                                                                                          │
│ Average cumulative reward:       -12.100476913311594                                                                                                                     │
│ Average rollout reward:          -11.69571795454952                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K88/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.1%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:30[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 88000 ===                                                                                                                                                  │
│ 88001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 24158, 30543, 30547, 30563, 88000]                                                                                                                          │
│ Average cumulative reward:       -12.100476913311594                                                                                                                     │
│ Average rollout reward:          -11.69571795454952                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K89/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.7%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:29[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 89000 ===                                                                                                                                                  │
│ 89001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 74904, 75705, 75707, 78726, 82207, 85715, 89000]                                                                                                            │
│ Average cumulative reward:       -12.534592991934552                                                                                                                     │
│ Average rollout reward:          -12.118965349824546                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K89/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.7%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:01:29[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 89000 ===                                                                                                                                                  │
│ 89001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 74904, 75705, 75707, 78726, 82207, 85715, 89000]                                                                                                            │
│ Average cumulative reward:       -12.534592991934552                                                                                                                     │
│ Average rollout reward:          -12.118965349824546                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:01:28[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 89306, 89414, 90000]                                                                                                                                        │
│ Average cumulative reward:       -12.810422133715127                                                                                                                     │
│ Average rollout reward:          -12.427782221746016                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:28[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 89306, 89414, 90000]                                                                                                                                        │
│ Average cumulative reward:       -12.810422133715127                                                                                                                     │
│ Average rollout reward:          -12.427782221746016                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K90/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.4%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:28[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 90000 ===                                                                                                                                                  │
│ 90001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 89306, 89414, 90000]                                                                                                                                        │
│ Average cumulative reward:       -12.810422133715127                                                                                                                     │
│ Average rollout reward:          -12.427782221746016                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K91/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:26[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 91000 ===                                                                                                                                                  │
│ 91001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 78552, 78573, 85591, 91000]                                                                                                                            │
│ Average cumulative reward:       -12.155316651736374                                                                                                                     │
│ Average rollout reward:          -11.743125774966359                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K91/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.1%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:26[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 91000 ===                                                                                                                                                  │
│ 91001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 78552, 78573, 85591, 91000]                                                                                                                            │
│ Average cumulative reward:       -12.155316651736374                                                                                                                     │
│ Average rollout reward:          -11.743125774966359                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K91/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.1%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:26[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 91000 ===                                                                                                                                                  │
│ 91001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 975, 78552, 78573, 85591, 91000]                                                                                                                            │
│ Average cumulative reward:       -12.155316651736374                                                                                                                     │
│ Average rollout reward:          -11.743125774966359                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:25[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91840, 91859, 91912, 91914, 92000]                                                                                                                   │
│ Average cumulative reward:       -12.337430432120373                                                                                                                     │
│ Average rollout reward:          -11.90471792137871                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:25[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91840, 91859, 91912, 91914, 92000]                                                                                                                   │
│ Average cumulative reward:       -12.337430432120373                                                                                                                     │
│ Average rollout reward:          -11.90471792137871                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K92/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m61.7%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:25[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 92000 ===                                                                                                                                                  │
│ 92001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91840, 91859, 91912, 91914, 92000]                                                                                                                   │
│ Average cumulative reward:       -12.337430432120373                                                                                                                     │
│ Average rollout reward:          -11.90471792137871                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K93/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:24[0m   1.44 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 92060, 92995, 92997, 93000]                                                                                                                          │
│ Average cumulative reward:       -14.758585598118447                                                                                                                     │
│ Average rollout reward:          -14.30090420795563                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K93/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:24[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 92060, 92995, 92997, 93000]                                                                                                                          │
│ Average cumulative reward:       -14.758585598118447                                                                                                                     │
│ Average rollout reward:          -14.30090420795563                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K93/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:24[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 92060, 92995, 92997, 93000]                                                                                                                          │
│ Average cumulative reward:       -14.758585598118447                                                                                                                     │
│ Average rollout reward:          -14.30090420795563                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━━[0m [35m62.4%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:24[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 93000 ===                                                                                                                                                  │
│ 93001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 92060, 92995, 92997, 93000]                                                                                                                          │
│ Average cumulative reward:       -14.758585598118447                                                                                                                     │
│ Average rollout reward:          -14.30090420795563                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:23[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 93644, 93759, 93978, 93981, 93988, 94000]                                                                                                            │
│ Average cumulative reward:       -14.572482755393562                                                                                                                     │
│ Average rollout reward:          -14.140721937882478                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:23[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 93644, 93759, 93978, 93981, 93988, 94000]                                                                                                            │
│ Average cumulative reward:       -14.572482755393562                                                                                                                     │
│ Average rollout reward:          -14.140721937882478                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K94/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.1%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:01:23[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 94000 ===                                                                                                                                                  │
│ 94001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 93644, 93759, 93978, 93981, 93988, 94000]                                                                                                            │
│ Average cumulative reward:       -14.572482755393562                                                                                                                     │
│ Average rollout reward:          -14.140721937882478                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K95/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.8%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:01:22[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 95000 ===                                                                                                                                                  │
│ 95001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 94721, 94763, 94771, 94792, 94817, 94885, 95000]                                                                                │
│ Average cumulative reward:       -14.973476265486704                                                                                                                     │
│ Average rollout reward:          -14.516924986872707                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K95/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.8%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:01:22[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 95000 ===                                                                                                                                                  │
│ 95001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 94721, 94763, 94771, 94792, 94817, 94885, 95000]                                                                                │
│ Average cumulative reward:       -14.973476265486704                                                                                                                     │
│ Average rollout reward:          -14.516924986872707                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K95/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.8%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:01:22[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 95000 ===                                                                                                                                                  │
│ 95001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 94721, 94763, 94771, 94792, 94817, 94885, 95000]                                                                                │
│ Average cumulative reward:       -14.973476265486704                                                                                                                     │
│ Average rollout reward:          -14.516924986872707                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:01:21[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 95728, 95732, 95813, 95817, 95823, 96000]                                                                                                            │
│ Average cumulative reward:       -14.789936667956527                                                                                                                     │
│ Average rollout reward:          -14.332439993309421                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:01:21[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 95728, 95732, 95813, 95817, 95823, 96000]                                                                                                            │
│ Average cumulative reward:       -14.789936667956527                                                                                                                     │
│ Average rollout reward:          -14.332439993309421                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:01:21[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 95728, 95732, 95813, 95817, 95823, 96000]                                                                                                            │
│ Average cumulative reward:       -14.789936667956527                                                                                                                     │
│ Average rollout reward:          -14.332439993309421                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K96/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.4%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:01:21[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 96000 ===                                                                                                                                                  │
│ 96001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 95728, 95732, 95813, 95817, 95823, 96000]                                                                                                            │
│ Average cumulative reward:       -14.789936667956527                                                                                                                     │
│ Average rollout reward:          -14.332439993309421                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:01:20[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 96101, 96105, 96479, 96677, 96988, 97000]                                                                                                            │
│ Average cumulative reward:       -14.77709440538722                                                                                                                      │
│ Average rollout reward:          -14.292108457874773                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:01:20[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 96101, 96105, 96479, 96677, 96988, 97000]                                                                                                            │
│ Average cumulative reward:       -14.77709440538722                                                                                                                      │
│ Average rollout reward:          -14.292108457874773                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K97/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.1%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:01:20[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 97000 ===                                                                                                                                                  │
│ 97001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 91658, 96101, 96105, 96479, 96677, 96988, 97000]                                                                                                            │
│ Average cumulative reward:       -14.77709440538722                                                                                                                      │
│ Average rollout reward:          -14.292108457874773                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K98/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:01:18[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 98000 ===                                                                                                                                                  │
│ 98001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 97698, 97741, 97743, 97764, 97880, 97913, 97955, 98000]                                                                                                     │
│ Average cumulative reward:       -12.49895523766054                                                                                                                      │
│ Average rollout reward:          -12.04896214913247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K98/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:01:18[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 98000 ===                                                                                                                                                  │
│ 98001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 97698, 97741, 97743, 97764, 97880, 97913, 97955, 98000]                                                                                                     │
│ Average cumulative reward:       -12.49895523766054                                                                                                                      │
│ Average rollout reward:          -12.04896214913247                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:01:16[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 2158, 97867, 99000]                                                                                                                                         │
│ Average cumulative reward:       -12.600581000493806                                                                                                                     │
│ Average rollout reward:          -12.23426752562976                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:01:16[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 2158, 97867, 99000]                                                                                                                                         │
│ Average cumulative reward:       -12.600581000493806                                                                                                                     │
│ Average rollout reward:          -12.23426752562976                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K99/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m66.4%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:01:16[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 99000 ===                                                                                                                                                  │
│ 99001  nodes in tree                                                                                                                                                     │
│ Path: [0, 4, 2158, 97867, 99000]                                                                                                                                         │
│ Average cumulative reward:       -12.600581000493806                                                                                                                     │
│ Average rollout reward:          -12.23426752562976                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K100/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:01:15[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100000]                                                                                                                                               │
│ Average cumulative reward:       -12.473410960932613                                                                                                                     │
│ Average rollout reward:          -12.059781284305402                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯7m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:01:15[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100000]                                                                                                                                               │
│ Average cumulative reward:       -12.473410960932613                                                                                                                     │
│ Average rollout reward:          -12.059781284305402                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K100/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:01:15[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 100000 ===                                                                                                                                                 │
│ 100001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100000]                                                                                                                                               │
│ Average cumulative reward:       -12.473410960932613                                                                                                                     │
│ Average rollout reward:          -12.059781284305402                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K101/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m67.8%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:01:14[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 101000 ===                                                                                                                                                 │
│ 101001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100970, 100975, 100979, 100992, 101000]                                                                                                               │
│ Average cumulative reward:       -12.603374246486133                                                                                                                     │
│ Average rollout reward:          -12.184668136880846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K101/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m67.8%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:01:14[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 101000 ===                                                                                                                                                 │
│ 101001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100970, 100975, 100979, 100992, 101000]                                                                                                               │
│ Average cumulative reward:       -12.603374246486133                                                                                                                     │
│ Average rollout reward:          -12.184668136880846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K101/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m67.8%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:01:14[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 101000 ===                                                                                                                                                 │
│ 101001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 100970, 100975, 100979, 100992, 101000]                                                                                                               │
│ Average cumulative reward:       -12.603374246486133                                                                                                                     │
│ Average rollout reward:          -12.184668136880846                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:01:12[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101525, 101577, 101612, 102000]                                                                                                               │
│ Average cumulative reward:       -11.886861307297401                                                                                                                     │
│ Average rollout reward:          -11.526581823719095                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:01:12[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101525, 101577, 101612, 102000]                                                                                                               │
│ Average cumulative reward:       -11.886861307297401                                                                                                                     │
│ Average rollout reward:          -11.526581823719095                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K102/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.5%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:01:12[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 102000 ===                                                                                                                                                 │
│ 102001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101525, 101577, 101612, 102000]                                                                                                               │
│ Average cumulative reward:       -11.886861307297401                                                                                                                     │
│ Average rollout reward:          -11.526581823719095                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K103/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.1%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:01:10[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 103000 ===                                                                                                                                                 │
│ 103001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101527, 102823, 102844, 103000]                                                                                                               │
│ Average cumulative reward:       -11.738685767727864                                                                                                                     │
│ Average rollout reward:          -11.346681088908445                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K103/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.1%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:01:10[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 103000 ===                                                                                                                                                 │
│ 103001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101527, 102823, 102844, 103000]                                                                                                               │
│ Average cumulative reward:       -11.738685767727864                                                                                                                     │
│ Average rollout reward:          -11.346681088908445                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K103/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.1%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:01:10[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 103000 ===                                                                                                                                                 │
│ 103001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101523, 101527, 102823, 102844, 103000]                                                                                                               │
│ Average cumulative reward:       -11.738685767727864                                                                                                                     │
│ Average rollout reward:          -11.346681088908445                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K104/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:01:08[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101244, 101249, 103880, 104000]                                                                                                                       │
│ Average cumulative reward:       -12.199242229005359                                                                                                                     │
│ Average rollout reward:          -11.777508841611592                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K104/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:01:08[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101244, 101249, 103880, 104000]                                                                                                                       │
│ Average cumulative reward:       -12.199242229005359                                                                                                                     │
│ Average rollout reward:          -11.777508841611592                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K104/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.8%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:01:08[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 104000 ===                                                                                                                                                 │
│ 104001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101244, 101249, 103880, 104000]                                                                                                                       │
│ Average cumulative reward:       -12.199242229005359                                                                                                                     │
│ Average rollout reward:          -11.777508841611592                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K105/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.5%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:01:07[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 105000 ===                                                                                                                                                 │
│ 105001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101561, 102077, 102107, 102244, 102403, 103515, 105000]                                                                                               │
│ Average cumulative reward:       -12.487776667099286                                                                                                                     │
│ Average rollout reward:          -12.038169433557192                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K105/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.5%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:01:07[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 105000 ===                                                                                                                                                 │
│ 105001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 101561, 102077, 102107, 102244, 102403, 103515, 105000]                                                                                               │
│ Average cumulative reward:       -12.487776667099286                                                                                                                     │
│ Average rollout reward:          -12.038169433557192                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K106/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m71.1%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:01:05[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 106000 ===                                                                                                                                                 │
│ 106001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 105255, 105989, 105992, 106000]                                                                                                                             │
│ Average cumulative reward:       -12.121765833926299                                                                                                                     │
│ Average rollout reward:          -11.700376507870164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K106/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m71.1%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:01:05[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 106000 ===                                                                                                                                                 │
│ 106001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 105255, 105989, 105992, 106000]                                                                                                                             │
│ Average cumulative reward:       -12.121765833926299                                                                                                                     │
│ Average rollout reward:          -11.700376507870164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K106/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m71.1%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:01:05[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 106000 ===                                                                                                                                                 │
│ 106001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 105255, 105989, 105992, 106000]                                                                                                                             │
│ Average cumulative reward:       -12.121765833926299                                                                                                                     │
│ Average rollout reward:          -11.700376507870164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K107/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:01:03[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 106547, 106563, 106566, 107000]                                                                                                                             │
│ Average cumulative reward:       -12.15077710750722                                                                                                                      │
│ Average rollout reward:          -11.796491553772578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:01:03[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 106547, 106563, 106566, 107000]                                                                                                                             │
│ Average cumulative reward:       -12.15077710750722                                                                                                                      │
│ Average rollout reward:          -11.796491553772578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K107/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m71.8%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:01:03[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 107000 ===                                                                                                                                                 │
│ 107001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 106547, 106563, 106566, 107000]                                                                                                                             │
│ Average cumulative reward:       -12.15077710750722                                                                                                                      │
│ Average rollout reward:          -11.796491553772578                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:01:02[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107952, 107954, 107983, 108000]                                                                                                                       │
│ Average cumulative reward:       -12.39209117384894                                                                                                                      │
│ Average rollout reward:          -12.006950248377096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:37[0m Remaining: [36m0:01:02[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107952, 107954, 107983, 108000]                                                                                                                       │
│ Average cumulative reward:       -12.39209117384894                                                                                                                      │
│ Average rollout reward:          -12.006950248377096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K108/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.5%[0m Elapsed: [33m0:02:37[0m Remaining: [36m0:01:02[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 108000 ===                                                                                                                                                 │
│ 108001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107952, 107954, 107983, 108000]                                                                                                                       │
│ Average cumulative reward:       -12.39209117384894                                                                                                                      │
│ Average rollout reward:          -12.006950248377096                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K109/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.2%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:01:01[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 109000 ===                                                                                                                                                 │
│ 109001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107097, 108951, 108959, 108987, 109000]                                                                                                               │
│ Average cumulative reward:       -12.221143496366928                                                                                                                     │
│ Average rollout reward:          -11.836897388919754                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K109/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.2%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:01:01[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 109000 ===                                                                                                                                                 │
│ 109001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107097, 108951, 108959, 108987, 109000]                                                                                                               │
│ Average cumulative reward:       -12.221143496366928                                                                                                                     │
│ Average rollout reward:          -11.836897388919754                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K109/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.2%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:01:01[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 109000 ===                                                                                                                                                 │
│ 109001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 107097, 108951, 108959, 108987, 109000]                                                                                                               │
│ Average cumulative reward:       -12.221143496366928                                                                                                                     │
│ Average rollout reward:          -11.836897388919754                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:00:59[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 27471, 109912, 109918, 110000]                                                                                                                              │
│ Average cumulative reward:       -12.68173607832015                                                                                                                      │
│ Average rollout reward:          -12.333657006934729                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:59[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 27471, 109912, 109918, 110000]                                                                                                                              │
│ Average cumulative reward:       -12.68173607832015                                                                                                                      │
│ Average rollout reward:          -12.333657006934729                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K110/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m73.8%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:59[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 110000 ===                                                                                                                                                 │
│ 110001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 27471, 109912, 109918, 110000]                                                                                                                              │
│ Average cumulative reward:       -12.68173607832015                                                                                                                      │
│ Average rollout reward:          -12.333657006934729                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K111/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.5%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:58[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 111000 ===                                                                                                                                                 │
│ 111001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 46483, 110995, 110997, 111000]                                                                                                                              │
│ Average cumulative reward:       -11.978660623836                                                                                                                        │
│ Average rollout reward:          -11.617278442465926                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K111/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.5%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:58[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 111000 ===                                                                                                                                                 │
│ 111001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 46483, 110995, 110997, 111000]                                                                                                                              │
│ Average cumulative reward:       -11.978660623836                                                                                                                        │
│ Average rollout reward:          -11.617278442465926                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K111/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.5%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:58[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 111000 ===                                                                                                                                                 │
│ 111001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 46483, 110995, 110997, 111000]                                                                                                                              │
│ Average cumulative reward:       -11.978660623836                                                                                                                        │
│ Average rollout reward:          -11.617278442465926                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K112/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.2%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:56[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 112000 ===                                                                                                                                                 │
│ 112001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 59221, 111984, 111988, 111991, 112000]                                                                                                                      │
│ Average cumulative reward:       -11.838372829479189                                                                                                                     │
│ Average rollout reward:          -11.48691019285255                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K112/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.2%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:56[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 112000 ===                                                                                                                                                 │
│ 112001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 59221, 111984, 111988, 111991, 112000]                                                                                                                      │
│ Average cumulative reward:       -11.838372829479189                                                                                                                     │
│ Average rollout reward:          -11.48691019285255                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:54[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 105048, 105050, 105087, 105101, 105106, 105117, 113000]                                                                                               │
│ Average cumulative reward:       -12.435916776745719                                                                                                                     │
│ Average rollout reward:          -12.039961589258894                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:54[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 105048, 105050, 105087, 105101, 105106, 105117, 113000]                                                                                               │
│ Average cumulative reward:       -12.435916776745719                                                                                                                     │
│ Average rollout reward:          -12.039961589258894                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K113/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.8%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:54[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 113000 ===                                                                                                                                                 │
│ 113001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 105048, 105050, 105087, 105101, 105106, 105117, 113000]                                                                                               │
│ Average cumulative reward:       -12.435916776745719                                                                                                                     │
│ Average rollout reward:          -12.039961589258894                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K114/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:52[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106184, 106188, 112691, 113785, 114000]                                                                                                               │
│ Average cumulative reward:       -12.24491408104977                                                                                                                      │
│ Average rollout reward:          -11.809300147051033                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:52[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106184, 106188, 112691, 113785, 114000]                                                                                                               │
│ Average cumulative reward:       -12.24491408104977                                                                                                                      │
│ Average rollout reward:          -11.809300147051033                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K114/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:52[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106184, 106188, 112691, 113785, 114000]                                                                                                               │
│ Average cumulative reward:       -12.24491408104977                                                                                                                      │
│ Average rollout reward:          -11.809300147051033                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K114/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m76.5%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:52[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 114000 ===                                                                                                                                                 │
│ 114001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106184, 106188, 112691, 113785, 114000]                                                                                                               │
│ Average cumulative reward:       -12.24491408104977                                                                                                                      │
│ Average rollout reward:          -11.809300147051033                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K115/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:50[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 115000 ===                                                                                                                                                 │
│ 115001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 97656, 100196, 100199, 114378, 115000]                                                                                                                │
│ Average cumulative reward:       -12.181552909319384                                                                                                                     │
│ Average rollout reward:          -11.726309007116939                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K115/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:50[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 115000 ===                                                                                                                                                 │
│ 115001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 97656, 100196, 100199, 114378, 115000]                                                                                                                │
│ Average cumulative reward:       -12.181552909319384                                                                                                                     │
│ Average rollout reward:          -11.726309007116939                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K116/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m77.9%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:49[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 116000 ===                                                                                                                                                 │
│ 116001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 115862, 115933, 115935, 115943, 116000]                                                                                                                     │
│ Average cumulative reward:       -12.92246911934095                                                                                                                      │
│ Average rollout reward:          -12.538746521693424                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K116/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m77.9%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:49[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 116000 ===                                                                                                                                                 │
│ 116001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 115862, 115933, 115935, 115943, 116000]                                                                                                                     │
│ Average cumulative reward:       -12.92246911934095                                                                                                                      │
│ Average rollout reward:          -12.538746521693424                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K116/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m77.9%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:49[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 116000 ===                                                                                                                                                 │
│ 116001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 115862, 115933, 115935, 115943, 116000]                                                                                                                     │
│ Average cumulative reward:       -12.92246911934095                                                                                                                      │
│ Average rollout reward:          -12.538746521693424                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:47[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 61560, 61655, 116986, 117000]                                                                                                                        │
│ Average cumulative reward:       -13.007604266841275                                                                                                                     │
│ Average rollout reward:          -12.59456250018536                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:47[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 61560, 61655, 116986, 117000]                                                                                                                        │
│ Average cumulative reward:       -13.007604266841275                                                                                                                     │
│ Average rollout reward:          -12.59456250018536                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K117/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:47[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 117000 ===                                                                                                                                                 │
│ 117001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 61560, 61655, 116986, 117000]                                                                                                                        │
│ Average cumulative reward:       -13.007604266841275                                                                                                                     │
│ Average rollout reward:          -12.59456250018536                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:45[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 117945, 117949, 117958, 117961, 118000]                                                                                                              │
│ Average cumulative reward:       -12.184085040294237                                                                                                                     │
│ Average rollout reward:          -11.807894140428058                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:45[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 117945, 117949, 117958, 117961, 118000]                                                                                                              │
│ Average cumulative reward:       -12.184085040294237                                                                                                                     │
│ Average rollout reward:          -11.807894140428058                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K118/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.2%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:45[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 118000 ===                                                                                                                                                 │
│ 118001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 117945, 117949, 117958, 117961, 118000]                                                                                                              │
│ Average cumulative reward:       -12.184085040294237                                                                                                                     │
│ Average rollout reward:          -11.807894140428058                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K119/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:44[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 118237, 118992, 118995, 119000]                                                                                                                      │
│ Average cumulative reward:       -12.25578234750334                                                                                                                      │
│ Average rollout reward:          -11.813967502906596                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K119/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:44[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 118237, 118992, 118995, 119000]                                                                                                                      │
│ Average cumulative reward:       -12.25578234750334                                                                                                                      │
│ Average rollout reward:          -11.813967502906596                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K119/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.9%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:44[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 119000 ===                                                                                                                                                 │
│ 119001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 61062, 118237, 118992, 118995, 119000]                                                                                                                      │
│ Average cumulative reward:       -12.25578234750334                                                                                                                      │
│ Average rollout reward:          -11.813967502906596                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K120/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m80.5%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:43[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 120000 ===                                                                                                                                                 │
│ 120001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 119998, 120000]                                                                                                                                             │
│ Average cumulative reward:       -12.273631249439823                                                                                                                     │
│ Average rollout reward:          -11.832308398709195                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K120/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m80.5%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:43[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 120000 ===                                                                                                                                                 │
│ 120001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 119998, 120000]                                                                                                                                             │
│ Average cumulative reward:       -12.273631249439823                                                                                                                     │
│ Average rollout reward:          -11.832308398709195                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K120/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m80.5%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:43[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 120000 ===                                                                                                                                                 │
│ 120001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 119998, 120000]                                                                                                                                             │
│ Average cumulative reward:       -12.273631249439823                                                                                                                     │
│ Average rollout reward:          -11.832308398709195                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:41[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 120771, 120776, 120967, 120995, 121000]                                                                                                               │
│ Average cumulative reward:       -12.414996936053715                                                                                                                     │
│ Average rollout reward:          -12.054045103695604                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K121/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:41[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 120771, 120776, 120967, 120995, 121000]                                                                                                               │
│ Average cumulative reward:       -12.414996936053715                                                                                                                     │
│ Average rollout reward:          -12.054045103695604                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K121/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.2%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:41[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 121000 ===                                                                                                                                                 │
│ 121001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2827, 120771, 120776, 120967, 120995, 121000]                                                                                                               │
│ Average cumulative reward:       -12.414996936053715                                                                                                                     │
│ Average rollout reward:          -12.054045103695604                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K122/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m81.9%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:40[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 122000 ===                                                                                                                                                 │
│ 122001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106296, 121783, 121875, 122000]                                                                                                                       │
│ Average cumulative reward:       -12.557852875334838                                                                                                                     │
│ Average rollout reward:          -12.14115735343528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K122/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m81.9%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:40[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 122000 ===                                                                                                                                                 │
│ 122001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106296, 121783, 121875, 122000]                                                                                                                       │
│ Average cumulative reward:       -12.557852875334838                                                                                                                     │
│ Average rollout reward:          -12.14115735343528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K122/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m81.9%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:40[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 122000 ===                                                                                                                                                 │
│ 122001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5295, 106296, 121783, 121875, 122000]                                                                                                                       │
│ Average cumulative reward:       -12.557852875334838                                                                                                                     │
│ Average rollout reward:          -12.14115735343528                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:38[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 120870, 120896, 120952, 120980, 121231, 121674, 122167, 122763, 123000]                                                                               │
│ Average cumulative reward:       -12.361359219002686                                                                                                                     │
│ Average rollout reward:          -11.958899760031743                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:38[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 120870, 120896, 120952, 120980, 121231, 121674, 122167, 122763, 123000]                                                                               │
│ Average cumulative reward:       -12.361359219002686                                                                                                                     │
│ Average rollout reward:          -11.958899760031743                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K123/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m82.6%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:38[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 123000 ===                                                                                                                                                 │
│ 123001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 2158, 120870, 120896, 120952, 120980, 121231, 121674, 122167, 122763, 123000]                                                                               │
│ Average cumulative reward:       -12.361359219002686                                                                                                                     │
│ Average rollout reward:          -11.958899760031743                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:37[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 1133, 123894, 124000]                                                                                                                                       │
│ Average cumulative reward:       -12.12765798488488                                                                                                                      │
│ Average rollout reward:          -11.72296739966596                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:37[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 1133, 123894, 124000]                                                                                                                                       │
│ Average cumulative reward:       -12.12765798488488                                                                                                                      │
│ Average rollout reward:          -11.72296739966596                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K124/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.2%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:37[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 124000 ===                                                                                                                                                 │
│ 124001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 1133, 123894, 124000]                                                                                                                                       │
│ Average cumulative reward:       -12.12765798488488                                                                                                                      │
│ Average rollout reward:          -11.72296739966596                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K125/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m83.9%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:36[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 125000 ===                                                                                                                                                 │
│ 125001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 91517, 91531, 125000]                                                                                                                                │
│ Average cumulative reward:       -13.237539085927857                                                                                                                     │
│ Average rollout reward:          -12.82869270154276                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K125/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m83.9%[0m Elapsed: [33m0:03:02[0m Remaining: [36m0:00:36[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 125000 ===                                                                                                                                                 │
│ 125001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 91517, 91531, 125000]                                                                                                                                │
│ Average cumulative reward:       -13.237539085927857                                                                                                                     │
│ Average rollout reward:          -12.82869270154276                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K125/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m83.9%[0m Elapsed: [33m0:03:03[0m Remaining: [36m0:00:36[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 125000 ===                                                                                                                                                 │
│ 125001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 91517, 91531, 125000]                                                                                                                                │
│ Average cumulative reward:       -13.237539085927857                                                                                                                     │
│ Average rollout reward:          -12.82869270154276                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K126/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.6%[0m Elapsed: [33m0:03:03[0m Remaining: [36m0:00:34[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 126000 ===                                                                                                                                                 │
│ 126001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 125649, 125705, 125708, 125713, 126000]                                                                                                                     │
│ Average cumulative reward:       -12.4272667465845                                                                                                                       │
│ Average rollout reward:          -12.025188530427318                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K126/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.6%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:34[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 126000 ===                                                                                                                                                 │
│ 126001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 125649, 125705, 125708, 125713, 126000]                                                                                                                     │
│ Average cumulative reward:       -12.4272667465845                                                                                                                       │
│ Average rollout reward:          -12.025188530427318                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K127/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:33[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 95358, 95360, 95409, 127000]                                                                                                           │
│ Average cumulative reward:       -12.246336594066939                                                                                                                     │
│ Average rollout reward:          -11.837412372094164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K127/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:33[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 95358, 95360, 95409, 127000]                                                                                                           │
│ Average cumulative reward:       -12.246336594066939                                                                                                                     │
│ Average rollout reward:          -11.837412372094164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K127/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.2%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:33[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 127000 ===                                                                                                                                                 │
│ 127001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 95358, 95360, 95409, 127000]                                                                                                           │
│ Average cumulative reward:       -12.246336594066939                                                                                                                     │
│ Average rollout reward:          -11.837412372094164                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K128/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.9%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:31[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 128000 ===                                                                                                                                                 │
│ 128001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 432, 11931, 11933, 11946, 128000]                                                                                                                           │
│ Average cumulative reward:       -11.809581604205441                                                                                                                     │
│ Average rollout reward:          -11.41666823113907                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯37m━━━━━[0m [35m85.9%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:31[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 128000 ===                                                                                                                                                 │
│ 128001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 432, 11931, 11933, 11946, 128000]                                                                                                                           │
│ Average cumulative reward:       -11.809581604205441                                                                                                                     │
│ Average rollout reward:          -11.41666823113907                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K128/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m85.9%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:31[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 128000 ===                                                                                                                                                 │
│ 128001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 432, 11931, 11933, 11946, 128000]                                                                                                                           │
│ Average cumulative reward:       -11.809581604205441                                                                                                                     │
│ Average rollout reward:          -11.41666823113907                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:30[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 15829, 16405, 91532, 91582, 129000]                                                                                                                    │
│ Average cumulative reward:       -12.370514334239326                                                                                                                     │
│ Average rollout reward:          -11.926020995988642                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:30[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 15829, 16405, 91532, 91582, 129000]                                                                                                                    │
│ Average cumulative reward:       -12.370514334239326                                                                                                                     │
│ Average rollout reward:          -11.926020995988642                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K129/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m86.6%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:30[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 129000 ===                                                                                                                                                 │
│ 129001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 15829, 16405, 91532, 91582, 129000]                                                                                                                    │
│ Average cumulative reward:       -12.370514334239326                                                                                                                     │
│ Average rollout reward:          -11.926020995988642                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:28[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 129990, 130000]                                                                                                                                             │
│ Average cumulative reward:       -12.476210273480179                                                                                                                     │
│ Average rollout reward:          -12.002168342998857                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:28[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 129990, 130000]                                                                                                                                             │
│ Average cumulative reward:       -12.476210273480179                                                                                                                     │
│ Average rollout reward:          -12.002168342998857                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K130/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.2%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:28[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 130000 ===                                                                                                                                                 │
│ 130001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 129990, 130000]                                                                                                                                             │
│ Average cumulative reward:       -12.476210273480179                                                                                                                     │
│ Average rollout reward:          -12.002168342998857                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K131/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m87.9%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:27[0m   1.45 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 131000 ===                                                                                                                                                 │
│ 131001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 128500, 128518, 128566, 129614, 129772, 131000]                                                                                                      │
│ Average cumulative reward:       -12.618463938124716                                                                                                                     │
│ Average rollout reward:          -12.183521920341166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K131/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m87.9%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:27[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 131000 ===                                                                                                                                                 │
│ 131001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 128500, 128518, 128566, 129614, 129772, 131000]                                                                                                      │
│ Average cumulative reward:       -12.618463938124716                                                                                                                     │
│ Average rollout reward:          -12.183521920341166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K131/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m87.9%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:27[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 131000 ===                                                                                                                                                 │
│ 131001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 31045, 128500, 128518, 128566, 129614, 129772, 131000]                                                                                                      │
│ Average cumulative reward:       -12.618463938124716                                                                                                                     │
│ Average rollout reward:          -12.183521920341166                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:26[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 131660, 131664, 132000]                                                                                                             │
│ Average cumulative reward:       -12.224324224296401                                                                                                                     │
│ Average rollout reward:          -11.834670358207026                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:26[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 131660, 131664, 132000]                                                                                                             │
│ Average cumulative reward:       -12.224324224296401                                                                                                                     │
│ Average rollout reward:          -11.834670358207026                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K132/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:26[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 132000 ===                                                                                                                                                 │
│ 132001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 131660, 131664, 132000]                                                                                                             │
│ Average cumulative reward:       -12.224324224296401                                                                                                                     │
│ Average rollout reward:          -11.834670358207026                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K133/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.3%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:24[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 133000 ===                                                                                                                                                 │
│ 133001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 132258, 132262, 132344, 133000]                                                                                                                     │
│ Average cumulative reward:       -12.026438424340197                                                                                                                     │
│ Average rollout reward:          -11.631043992268332                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K133/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.3%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:24[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 133000 ===                                                                                                                                                 │
│ 133001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 132258, 132262, 132344, 133000]                                                                                                                     │
│ Average cumulative reward:       -12.026438424340197                                                                                                                     │
│ Average rollout reward:          -11.631043992268332                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K133/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.3%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:24[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 133000 ===                                                                                                                                                 │
│ 133001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 132258, 132262, 132344, 133000]                                                                                                                     │
│ Average cumulative reward:       -12.026438424340197                                                                                                                     │
│ Average rollout reward:          -11.631043992268332                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:23[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 133967, 133980, 134000]                                                                                                             │
│ Average cumulative reward:       -12.8248321011942                                                                                                                       │
│ Average rollout reward:          -12.389605042505245                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:23[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 133967, 133980, 134000]                                                                                                             │
│ Average cumulative reward:       -12.8248321011942                                                                                                                       │
│ Average rollout reward:          -12.389605042505245                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K134/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:23[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 134000 ===                                                                                                                                                 │
│ 134001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 131457, 131467, 131469, 133967, 133980, 134000]                                                                                                             │
│ Average cumulative reward:       -12.8248321011942                                                                                                                       │
│ Average rollout reward:          -12.389605042505245                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:21[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 34886, 85195, 87348, 123123, 124846, 135000]                                                                                                                │
│ Average cumulative reward:       -12.198518907285415                                                                                                                     │
│ Average rollout reward:          -11.782952204420987                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K135/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:03:17[0m Remaining: [36m0:00:21[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 34886, 85195, 87348, 123123, 124846, 135000]                                                                                                                │
│ Average cumulative reward:       -12.198518907285415                                                                                                                     │
│ Average rollout reward:          -11.782952204420987                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K135/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m90.6%[0m Elapsed: [33m0:03:17[0m Remaining: [36m0:00:21[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 135000 ===                                                                                                                                                 │
│ 135001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 34886, 85195, 87348, 123123, 124846, 135000]                                                                                                                │
│ Average cumulative reward:       -12.198518907285415                                                                                                                     │
│ Average rollout reward:          -11.782952204420987                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:03:18[0m Remaining: [36m0:00:20[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 135917, 135949, 135995, 136000]                                                                                                                             │
│ Average cumulative reward:       -12.668540670037247                                                                                                                     │
│ Average rollout reward:          -12.27372741415747                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:03:18[0m Remaining: [36m0:00:20[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 135917, 135949, 135995, 136000]                                                                                                                             │
│ Average cumulative reward:       -12.668540670037247                                                                                                                     │
│ Average rollout reward:          -12.27372741415747                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K136/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.3%[0m Elapsed: [33m0:03:19[0m Remaining: [36m0:00:20[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 136000 ===                                                                                                                                                 │
│ 136001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 135917, 135949, 135995, 136000]                                                                                                                             │
│ Average cumulative reward:       -12.668540670037247                                                                                                                     │
│ Average rollout reward:          -12.27372741415747                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K137/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.9%[0m Elapsed: [33m0:03:19[0m Remaining: [36m0:00:18[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 137000 ===                                                                                                                                                 │
│ 137001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 59221, 111937, 124947, 129030, 129213, 135866, 137000]                                                                                                      │
│ Average cumulative reward:       -16.481319636244695                                                                                                                     │
│ Average rollout reward:          -16.13654019221406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K137/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.9%[0m Elapsed: [33m0:03:20[0m Remaining: [36m0:00:18[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 137000 ===                                                                                                                                                 │
│ 137001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 59221, 111937, 124947, 129030, 129213, 135866, 137000]                                                                                                      │
│ Average cumulative reward:       -16.481319636244695                                                                                                                     │
│ Average rollout reward:          -16.13654019221406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K137/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m91.9%[0m Elapsed: [33m0:03:20[0m Remaining: [36m0:00:18[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 137000 ===                                                                                                                                                 │
│ 137001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 59221, 111937, 124947, 129030, 129213, 135866, 137000]                                                                                                      │
│ Average cumulative reward:       -16.481319636244695                                                                                                                     │
│ Average rollout reward:          -16.13654019221406                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:21[0m Remaining: [36m0:00:17[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 40434, 41175, 41342, 42303, 42326, 42448, 42519, 138000]                                                                                                    │
│ Average cumulative reward:       -12.440540235658718                                                                                                                     │
│ Average rollout reward:          -11.99321088812879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:21[0m Remaining: [36m0:00:17[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 40434, 41175, 41342, 42303, 42326, 42448, 42519, 138000]                                                                                                    │
│ Average cumulative reward:       -12.440540235658718                                                                                                                     │
│ Average rollout reward:          -11.99321088812879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K138/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m92.6%[0m Elapsed: [33m0:03:22[0m Remaining: [36m0:00:17[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 138000 ===                                                                                                                                                 │
│ 138001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 40434, 41175, 41342, 42303, 42326, 42448, 42519, 138000]                                                                                                    │
│ Average cumulative reward:       -12.440540235658718                                                                                                                     │
│ Average rollout reward:          -11.99321088812879                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:22[0m Remaining: [36m0:00:15[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138940, 138996, 139000]                                                                                                                                     │
│ Average cumulative reward:       -12.714612499631185                                                                                                                     │
│ Average rollout reward:          -12.269115264474122                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:23[0m Remaining: [36m0:00:15[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138940, 138996, 139000]                                                                                                                                     │
│ Average cumulative reward:       -12.714612499631185                                                                                                                     │
│ Average rollout reward:          -12.269115264474122                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K139/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.3%[0m Elapsed: [33m0:03:23[0m Remaining: [36m0:00:15[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 139000 ===                                                                                                                                                 │
│ 139001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 138940, 138996, 139000]                                                                                                                                     │
│ Average cumulative reward:       -12.714612499631185                                                                                                                     │
│ Average rollout reward:          -12.269115264474122                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:24[0m Remaining: [36m0:00:14[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 28048, 86791, 87305, 87695, 88085, 90378, 128220, 140000]                                                                                                   │
│ Average cumulative reward:       -12.413607252928022                                                                                                                     │
│ Average rollout reward:          -12.041534926574673                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:24[0m Remaining: [36m0:00:14[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 28048, 86791, 87305, 87695, 88085, 90378, 128220, 140000]                                                                                                   │
│ Average cumulative reward:       -12.413607252928022                                                                                                                     │
│ Average rollout reward:          -12.041534926574673                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K140/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.0%[0m Elapsed: [33m0:03:25[0m Remaining: [36m0:00:14[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 140000 ===                                                                                                                                                 │
│ 140001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 28048, 86791, 87305, 87695, 88085, 90378, 128220, 140000]                                                                                                   │
│ Average cumulative reward:       -12.413607252928022                                                                                                                     │
│ Average rollout reward:          -12.041534926574673                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K141/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.6%[0m Elapsed: [33m0:03:25[0m Remaining: [36m0:00:12[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 141000 ===                                                                                                                                                 │
│ 141001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 69765, 69797, 122786, 139958, 141000]                                                                                                                       │
│ Average cumulative reward:       -11.943935084076399                                                                                                                     │
│ Average rollout reward:          -11.518469969382851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K141/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.6%[0m Elapsed: [33m0:03:26[0m Remaining: [36m0:00:12[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 141000 ===                                                                                                                                                 │
│ 141001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 69765, 69797, 122786, 139958, 141000]                                                                                                                       │
│ Average cumulative reward:       -11.943935084076399                                                                                                                     │
│ Average rollout reward:          -11.518469969382851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯0m[38;5;237m━━[0m [35m94.6%[0m Elapsed: [33m0:03:26[0m Remaining: [36m0:00:12[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 141000 ===                                                                                                                                                 │
│ 141001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 69765, 69797, 122786, 139958, 141000]                                                                                                                       │
│ Average cumulative reward:       -11.943935084076399                                                                                                                     │
│ Average rollout reward:          -11.518469969382851                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K142/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:27[0m Remaining: [36m0:00:11[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 19196, 141349, 141416, 141981, 142000]                                                                                                                      │
│ Average cumulative reward:       -12.493375635279989                                                                                                                     │
│ Average rollout reward:          -12.072608401536353                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K142/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:27[0m Remaining: [36m0:00:11[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 19196, 141349, 141416, 141981, 142000]                                                                                                                      │
│ Average cumulative reward:       -12.493375635279989                                                                                                                     │
│ Average rollout reward:          -12.072608401536353                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K142/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m95.3%[0m Elapsed: [33m0:03:28[0m Remaining: [36m0:00:11[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 142000 ===                                                                                                                                                 │
│ 142001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 19196, 141349, 141416, 141981, 142000]                                                                                                                      │
│ Average cumulative reward:       -12.493375635279989                                                                                                                     │
│ Average rollout reward:          -12.072608401536353                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:28[0m Remaining: [36m0:00:09[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 143000]                                                                                                                         │
│ Average cumulative reward:       -12.654967914883755                                                                                                                     │
│ Average rollout reward:          -12.202429168408171                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:29[0m Remaining: [36m0:00:09[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 143000]                                                                                                                         │
│ Average cumulative reward:       -12.654967914883755                                                                                                                     │
│ Average rollout reward:          -12.202429168408171                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K143/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.0%[0m Elapsed: [33m0:03:29[0m Remaining: [36m0:00:09[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 143000 ===                                                                                                                                                 │
│ 143001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 91664, 91686, 93656, 143000]                                                                                                                         │
│ Average cumulative reward:       -12.654967914883755                                                                                                                     │
│ Average rollout reward:          -12.202429168408171                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:30[0m Remaining: [36m0:00:08[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 143106, 143133, 143248, 143556, 144000]                                                                                                              │
│ Average cumulative reward:       -12.900889888227589                                                                                                                     │
│ Average rollout reward:          -12.46883229284756                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:30[0m Remaining: [36m0:00:08[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 143106, 143133, 143248, 143556, 144000]                                                                                                              │
│ Average cumulative reward:       -12.900889888227589                                                                                                                     │
│ Average rollout reward:          -12.46883229284756                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K144/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m96.6%[0m Elapsed: [33m0:03:31[0m Remaining: [36m0:00:08[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 144000 ===                                                                                                                                                 │
│ 144001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 91658, 143106, 143133, 143248, 143556, 144000]                                                                                                              │
│ Average cumulative reward:       -12.900889888227589                                                                                                                     │
│ Average rollout reward:          -12.46883229284756                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:31[0m Remaining: [36m0:00:06[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5510, 15522, 15692, 74599, 86916, 145000]                                                                                                                   │
│ Average cumulative reward:       -12.860285117081515                                                                                                                     │
│ Average rollout reward:          -12.396444483445567                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:32[0m Remaining: [36m0:00:06[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5510, 15522, 15692, 74599, 86916, 145000]                                                                                                                   │
│ Average cumulative reward:       -12.860285117081515                                                                                                                     │
│ Average rollout reward:          -12.396444483445567                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K145/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.3%[0m Elapsed: [33m0:03:32[0m Remaining: [36m0:00:06[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 145000 ===                                                                                                                                                 │
│ 145001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 5510, 15522, 15692, 74599, 86916, 145000]                                                                                                                   │
│ Average cumulative reward:       -12.860285117081515                                                                                                                     │
│ Average rollout reward:          -12.396444483445567                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K146/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.0%[0m Elapsed: [33m0:03:33[0m Remaining: [36m0:00:05[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 146000 ===                                                                                                                                                 │
│ 146001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 71794, 86806, 87321, 143418, 145356, 146000]                                                                                                                │
│ Average cumulative reward:       -12.545357805714133                                                                                                                     │
│ Average rollout reward:          -12.079264432595666                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K146/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.0%[0m Elapsed: [33m0:03:33[0m Remaining: [36m0:00:05[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 146000 ===                                                                                                                                                 │
│ 146001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 71794, 86806, 87321, 143418, 145356, 146000]                                                                                                                │
│ Average cumulative reward:       -12.545357805714133                                                                                                                     │
│ Average rollout reward:          -12.079264432595666                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K146/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.0%[0m Elapsed: [33m0:03:34[0m Remaining: [36m0:00:05[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 146000 ===                                                                                                                                                 │
│ 146001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 71794, 86806, 87321, 143418, 145356, 146000]                                                                                                                │
│ Average cumulative reward:       -12.545357805714133                                                                                                                     │
│ Average rollout reward:          -12.079264432595666                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:34[0m Remaining: [36m0:00:04[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 146673, 146952, 146956, 146965, 147000]                                                                                                                     │
│ Average cumulative reward:       -12.405520485898087                                                                                                                     │
│ Average rollout reward:          -11.991847245507444                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:35[0m Remaining: [36m0:00:04[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 146673, 146952, 146956, 146965, 147000]                                                                                                                     │
│ Average cumulative reward:       -12.405520485898087                                                                                                                     │
│ Average rollout reward:          -11.991847245507444                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K147/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:35[0m Remaining: [36m0:00:04[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 147000 ===                                                                                                                                                 │
│ 147001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 146673, 146952, 146956, 146965, 147000]                                                                                                                     │
│ Average cumulative reward:       -12.405520485898087                                                                                                                     │
│ Average rollout reward:          -11.991847245507444                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K148/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m [35m99.3%[0m Elapsed: [33m0:03:36[0m Remaining: [36m0:00:02[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 25916, 25979, 26480, 148000]                                                                                                                           │
│ Average cumulative reward:       -12.312117029849809                                                                                                                     │
│ Average rollout reward:          -11.914671173832629                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K148/149 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m [35m99.3%[0m Elapsed: [33m0:03:36[0m Remaining: [36m0:00:02[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 25916, 25979, 26480, 148000]                                                                                                                           │
│ Average cumulative reward:       -12.312117029849809                                                                                                                     │
│ Average rollout reward:          -11.914671173832629                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯[0m [35m99.3%[0m Elapsed: [33m0:03:37[0m Remaining: [36m0:00:02[0m   1.47 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 25916, 25979, 26480, 148000]                                                                                                                           │
│ Average cumulative reward:       -12.312117029849809                                                                                                                     │
│ Average rollout reward:          -11.914671173832629                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K149/149 [38;2;114;156;31m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m100.0%[0m Elapsed: [33m0:03:37[0m Remaining: [36m0:00:00[0m   1.46 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 148000 ===                                                                                                                                                 │
│ 148001  nodes in tree                                                                                                                                                    │
│ Path: [0, 4, 177, 25916, 25979, 26480, 148000]                                                                                                                           │
│ Average cumulative reward:       -12.312117029849809                                                                                                                     │
│ Average rollout reward:          -11.914671173832629                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -2.978565416648618                                                                                                                              │
│ Best path: [0, 4, 31045, 31288, 31292, 32110, 34373, 35530, 73890, 82287, 82298, 82720]                                                                                  │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
[?25hNode 0 is not terminal. Continue.
Node 4 is not terminal. Continue.
Node 2827 is not terminal. Continue.
Node 101523 is not terminal. Continue.
Node 101527 is not terminal. Continue.
Node 101537 is not terminal. Continue.
Node 101540 is not terminal. Continue.
Node 101542 is not terminal. Continue.
Node 101551 is not terminal. Continue.
Node 101563 is not terminal. Continue.
Node 101673 is not terminal. Continue.
Node 101684 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 353 is not terminal. Continue.
Node 383 is not terminal. Continue.
Node 471 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 4 is not terminal. Continue.
Node 2827 is not terminal. Continue.
Node 101523 is not terminal. Continue.
Node 101527 is not terminal. Continue.
Node 101537 is not terminal. Continue.
Node 101540 is not terminal. Continue.
Node 101542 is not terminal. Continue.
Node 101551 is not terminal. Continue.
Node 101563 is not terminal. Continue.
Node 101673 is not terminal. Continue.
Node 101684 is not terminal. Continue.
No children found. Stop.
=== RESULT ===
By Visits: estimated reward: -12.00534074306254
sqrt_visser_coupled [3.6924827 0.6034996]
sqrt_visser_coupled [1.3687549 1.3759973]
sqrt_nsv [4.1735826 1.5983189]
By Value: estimated reward: -13.361782788968586
sqrt_nsv [2.9560091 4.2181187]
By Best Value: estimated reward: 0
sqrt_visser_coupled [3.87682   0.7539997]
sqrt_visser_coupled [1.4710367 1.5965489]
sqrt_nsv [3.6262176 0.8792589]
sqrt_nsv [3.3663144 1.        0.        0.       ]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
Best value of root node:
-2.978565416648618
Best root policy:
sqrt_visser_coupled [3.87682   0.7539997]
sqrt_visser_coupled [1.4710367 1.5965489]
sqrt_nsv [3.6262176 0.8792589]
sqrt_nsv [3.3663144 1.        0.        0.       ]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
sqrt_nsv [3, 1]
=== END ===
Finished making algorithm
