Matrix distribution: unif
Matrix distribution config: {'c': 0.25, 'd': 10000, 'eps': 0.001}
Initial matrix shape: torch.Size([10000, 10000])
Algorithm name: mcts
Algorithm config: {'c_ucb': 5.0, 'alpha_pw': 0.4, 'epsilon': 1e-06, 'EXPLORE_K': 5, 'early_termination_epsilon': 1e-05, 'budget': 80000, 'print_every': 1000, 'max_termination_count': 10, 'tree_initial_capacity': 10000, 'device': 'cuda', 'actions': [['sign_ns', [[0, 0], [5, 5]]], ['sign_newton', [[0], [40]]], ['sign_quintic', [[0, 0, 0], [5, 5, 5]]], ['sign_halley', [[0, 0, 0], [40, 40, 40]]]], 'initialize_with_baselines': True}
Actions: ['sign_halley', 'sign_newton', 'sign_ns', 'sign_quintic']
Action sign_halley took 1.0 times longer than sign_halley
Action sign_newton took 0.37188211073323263 times longer than sign_halley
Action sign_ns took 0.2719659412661313 times longer than sign_halley
Action sign_quintic took 0.40676209954893716 times longer than sign_halley
Skipping sign_newton_variant because not all actions are in the tree
Skipping inv_ns because not all actions are in the tree
Skipping inv_ns_chebyshev because not all actions are in the tree
Skipping sqrt_db because not all actions are in the tree
Skipping sqrt_nsv because not all actions are in the tree
Skipping sqrt_visser because not all actions are in the tree
Skipping sqrt_newton because not all actions are in the tree
Skipping sqrt_visser_coupled because not all actions are in the tree
Skipping sqrt_newton_coupled because not all actions are in the tree
Skipping proot_newton because not all actions are in the tree
Skipping proot_visser because not all actions are in the tree
Skipping proot_iannazzo because not all actions are in the tree
[?25l0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:00[0m Remaining: [36m-:--:--[0m 501971.96 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.99162535 -3.99162535]                                                                                                                                                │
│ [-3.09154152 -3.09154152]                                                                                                                                                │
│ [-2.50325861 -2.50325861]                                                                                                                                                │
│ [-2.00367776 -2.00367776 -2.00367776]                                                                                                                                    │
│ [-1.85941055 -1.85941055 -1.85941055]                                                                                                                                    │
│ [-1.83162799 -1.83162799 -1.83162799 -1.45974588]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1005130.07 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.99162535 -3.99162535]                                                                                                                                                │
│ [-3.09154152 -3.09154152]                                                                                                                                                │
│ [-2.50325861 -2.50325861]                                                                                                                                                │
│ [-2.00367776 -2.00367776 -2.00367776]                                                                                                                                    │
│ [-1.85941055 -1.85941055 -1.85941055]                                                                                                                                    │
│ [-1.83162799 -1.83162799 -1.83162799 -1.45974588]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1507793.96 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.99162535 -3.99162535]                                                                                                                                                │
│ [-3.09154152 -3.09154152]                                                                                                                                                │
│ [-2.50325861 -2.50325861]                                                                                                                                                │
│ [-2.00367776 -2.00367776 -2.00367776]                                                                                                                                    │
│ [-1.85941055 -1.85941055 -1.85941055]                                                                                                                                    │
│ [-1.83162799 -1.83162799 -1.83162799 -1.45974588]                                                                                                                        │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯m0:00:02[0m Remaining: [36m-:--:--[0m 2010877.84 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.99162535 -3.99162535]                                                                                                                                                │
│ [-3.09154152 -3.09154152]                                                                                                                                                │
│ [-2.50325861 -2.50325861]                                                                                                                                                │
│ [-2.00367776 -2.00367776 -2.00367776]                                                                                                                                    │
│ [-1.85941055 -1.85941055 -1.85941055]                                                                                                                                    │
│ [-1.83162799 -1.83162799 -1.83162799 -1.45974588]                                                                                                                        │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 173, 174, 1000]                                                                                                                                             │
│ Average cumulative reward:       -6.778709387978023                                                                                                                      │
│ Average rollout reward:          -6.418627255674581                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m-:--:--[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 173, 174, 1000]                                                                                                                                             │
│ Average cumulative reward:       -6.778709387978023                                                                                                                      │
│ Average rollout reward:          -6.418627255674581                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m-:--:--[0m   3.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 173, 174, 1000]                                                                                                                                             │
│ Average cumulative reward:       -6.778709387978023                                                                                                                      │
│ Average rollout reward:          -6.418627255674581                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:04[0m Remaining: [36m-:--:--[0m   4.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 173, 174, 1000]                                                                                                                                             │
│ Average cumulative reward:       -6.778709387978023                                                                                                                      │
│ Average rollout reward:          -6.418627255674581                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:04[0m Remaining: [36m-:--:--[0m   4.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 173, 174, 1000]                                                                                                                                             │
│ Average cumulative reward:       -6.778709387978023                                                                                                                      │
│ Average rollout reward:          -6.418627255674581                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:03:17[0m   2.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 733, 735, 2000]                                                                                                                                             │
│ Average cumulative reward:       -7.0658014307937345                                                                                                                     │
│ Average rollout reward:          -6.615312948915904                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:03:17[0m   2.77 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 733, 735, 2000]                                                                                                                                             │
│ Average cumulative reward:       -7.0658014307937345                                                                                                                     │
│ Average rollout reward:          -6.615312948915904                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:03:17[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 733, 735, 2000]                                                                                                                                             │
│ Average cumulative reward:       -7.0658014307937345                                                                                                                     │
│ Average rollout reward:          -6.615312948915904                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:03:17[0m   3.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 733, 735, 2000]                                                                                                                                             │
│ Average cumulative reward:       -7.0658014307937345                                                                                                                     │
│ Average rollout reward:          -6.615312948915904                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:03:17[0m   3.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 733, 735, 2000]                                                                                                                                             │
│ Average cumulative reward:       -7.0658014307937345                                                                                                                     │
│ Average rollout reward:          -6.615312948915904                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:03:12[0m   2.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 79, 82, 1566, 2028, 3000]                                                                                                                                   │
│ Average cumulative reward:       -6.994701823382031                                                                                                                      │
│ Average rollout reward:          -6.520935733768088                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:03:12[0m   2.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 79, 82, 1566, 2028, 3000]                                                                                                                                   │
│ Average cumulative reward:       -6.994701823382031                                                                                                                      │
│ Average rollout reward:          -6.520935733768088                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:03:12[0m   2.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 79, 82, 1566, 2028, 3000]                                                                                                                                   │
│ Average cumulative reward:       -6.994701823382031                                                                                                                      │
│ Average rollout reward:          -6.520935733768088                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:03:12[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 79, 82, 1566, 2028, 3000]                                                                                                                                   │
│ Average cumulative reward:       -6.994701823382031                                                                                                                      │
│ Average rollout reward:          -6.520935733768088                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:03:12[0m   3.19 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 79, 82, 1566, 2028, 3000]                                                                                                                                   │
│ Average cumulative reward:       -6.994701823382031                                                                                                                      │
│ Average rollout reward:          -6.520935733768088                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:03:12[0m   2.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3919, 3920, 3970, 4000]                                                                                                                                     │
│ Average cumulative reward:       -6.658987504815949                                                                                                                      │
│ Average rollout reward:          -6.2738093723333135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:03:12[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3919, 3920, 3970, 4000]                                                                                                                                     │
│ Average cumulative reward:       -6.658987504815949                                                                                                                      │
│ Average rollout reward:          -6.2738093723333135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:03:12[0m   2.77 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3919, 3920, 3970, 4000]                                                                                                                                     │
│ Average cumulative reward:       -6.658987504815949                                                                                                                      │
│ Average rollout reward:          -6.2738093723333135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:03:12[0m   2.89 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3919, 3920, 3970, 4000]                                                                                                                                     │
│ Average cumulative reward:       -6.658987504815949                                                                                                                      │
│ Average rollout reward:          -6.2738093723333135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:03:12[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 3919, 3920, 3970, 4000]                                                                                                                                     │
│ Average cumulative reward:       -6.658987504815949                                                                                                                      │
│ Average rollout reward:          -6.2738093723333135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:03:09[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 555, 557, 1746, 1779, 5000]                                                                                                                                 │
│ Average cumulative reward:       -6.929924315155034                                                                                                                      │
│ Average rollout reward:          -6.44501348167927                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:03:09[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 555, 557, 1746, 1779, 5000]                                                                                                                                 │
│ Average cumulative reward:       -6.929924315155034                                                                                                                      │
│ Average rollout reward:          -6.44501348167927                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:03:09[0m   2.72 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 555, 557, 1746, 1779, 5000]                                                                                                                                 │
│ Average cumulative reward:       -6.929924315155034                                                                                                                      │
│ Average rollout reward:          -6.44501348167927                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:03:09[0m   2.82 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 555, 557, 1746, 1779, 5000]                                                                                                                                 │
│ Average cumulative reward:       -6.929924315155034                                                                                                                      │
│ Average rollout reward:          -6.44501348167927                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:03:09[0m   2.92 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 555, 557, 1746, 1779, 5000]                                                                                                                                 │
│ Average cumulative reward:       -6.929924315155034                                                                                                                      │
│ Average rollout reward:          -6.44501348167927                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:03:06[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5910, 5914, 5950, 5955, 6000]                                                                                                                               │
│ Average cumulative reward:       -7.168386387990393                                                                                                                      │
│ Average rollout reward:          -6.67814150622775                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:03:06[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5910, 5914, 5950, 5955, 6000]                                                                                                                               │
│ Average cumulative reward:       -7.168386387990393                                                                                                                      │
│ Average rollout reward:          -6.67814150622775                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:03:06[0m   2.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5910, 5914, 5950, 5955, 6000]                                                                                                                               │
│ Average cumulative reward:       -7.168386387990393                                                                                                                      │
│ Average rollout reward:          -6.67814150622775                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:03:06[0m   2.77 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5910, 5914, 5950, 5955, 6000]                                                                                                                               │
│ Average cumulative reward:       -7.168386387990393                                                                                                                      │
│ Average rollout reward:          -6.67814150622775                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:03:06[0m   2.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 5910, 5914, 5950, 5955, 6000]                                                                                                                               │
│ Average cumulative reward:       -7.168386387990393                                                                                                                      │
│ Average rollout reward:          -6.67814150622775                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:03:03[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 146, 148, 153, 2013, 7000]                                                                                                                                  │
│ Average cumulative reward:       -7.057755401077561                                                                                                                      │
│ Average rollout reward:          -6.571301474024574                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:03:03[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 146, 148, 153, 2013, 7000]                                                                                                                                  │
│ Average cumulative reward:       -7.057755401077561                                                                                                                      │
│ Average rollout reward:          -6.571301474024574                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:03:03[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 146, 148, 153, 2013, 7000]                                                                                                                                  │
│ Average cumulative reward:       -7.057755401077561                                                                                                                      │
│ Average rollout reward:          -6.571301474024574                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:03:03[0m   2.73 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 146, 148, 153, 2013, 7000]                                                                                                                                  │
│ Average cumulative reward:       -7.057755401077561                                                                                                                      │
│ Average rollout reward:          -6.571301474024574                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:03:03[0m   2.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 146, 148, 153, 2013, 7000]                                                                                                                                  │
│ Average cumulative reward:       -7.057755401077561                                                                                                                      │
│ Average rollout reward:          -6.571301474024574                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:03:01[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 62, 65, 859, 1007, 8000]                                                                                                                                    │
│ Average cumulative reward:       -7.140295972932251                                                                                                                      │
│ Average rollout reward:          -6.644583151471818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:03:01[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 62, 65, 859, 1007, 8000]                                                                                                                                    │
│ Average cumulative reward:       -7.140295972932251                                                                                                                      │
│ Average rollout reward:          -6.644583151471818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:03:01[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 62, 65, 859, 1007, 8000]                                                                                                                                    │
│ Average cumulative reward:       -7.140295972932251                                                                                                                      │
│ Average rollout reward:          -6.644583151471818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:03:01[0m   2.71 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 62, 65, 859, 1007, 8000]                                                                                                                                    │
│ Average cumulative reward:       -7.140295972932251                                                                                                                      │
│ Average rollout reward:          -6.644583151471818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:03:01[0m   2.77 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 62, 65, 859, 1007, 8000]                                                                                                                                    │
│ Average cumulative reward:       -7.140295972932251                                                                                                                      │
│ Average rollout reward:          -6.644583151471818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:02:58[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:02:58[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:02:58[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:02:58[0m   2.69 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:02:58[0m   2.74 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:02:58[0m   2.80 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 2, 7091, 7095, 8916, 9000]                                                                                                                                     │
│ Average cumulative reward:       -7.408220971198451                                                                                                                      │
│ Average rollout reward:          -6.878505137241204                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:02:56[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9848, 9850, 9854, 9878, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.836452589005848                                                                                                                      │
│ Average rollout reward:          -6.286212848408626                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:02:56[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9848, 9850, 9854, 9878, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.836452589005848                                                                                                                      │
│ Average rollout reward:          -6.286212848408626                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:02:56[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9848, 9850, 9854, 9878, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.836452589005848                                                                                                                      │
│ Average rollout reward:          -6.286212848408626                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:02:56[0m   2.72 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9848, 9850, 9854, 9878, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.836452589005848                                                                                                                      │
│ Average rollout reward:          -6.286212848408626                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:02:56[0m   2.77 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 9848, 9850, 9854, 9878, 10000]                                                                                                                              │
│ Average cumulative reward:       -6.836452589005848                                                                                                                      │
│ Average rollout reward:          -6.286212848408626                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:02:53[0m   2.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1262, 1263, 11000]                                                                                                                                          │
│ Average cumulative reward:       -6.70778837528341                                                                                                                       │
│ Average rollout reward:          -6.190640282811359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:02:53[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1262, 1263, 11000]                                                                                                                                          │
│ Average cumulative reward:       -6.70778837528341                                                                                                                       │
│ Average rollout reward:          -6.190640282811359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:02:53[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1262, 1263, 11000]                                                                                                                                          │
│ Average cumulative reward:       -6.70778837528341                                                                                                                       │
│ Average rollout reward:          -6.190640282811359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:02:53[0m   2.73 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1262, 1263, 11000]                                                                                                                                          │
│ Average cumulative reward:       -6.70778837528341                                                                                                                       │
│ Average rollout reward:          -6.190640282811359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:02:53[0m   2.78 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1262, 1263, 11000]                                                                                                                                          │
│ Average cumulative reward:       -6.70778837528341                                                                                                                       │
│ Average rollout reward:          -6.190640282811359                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:52[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 7312, 12000]                                                                                                                                    │
│ Average cumulative reward:       -7.102755914738757                                                                                                                      │
│ Average rollout reward:          -6.56211247005717                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:02:52[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 7312, 12000]                                                                                                                                    │
│ Average cumulative reward:       -7.102755914738757                                                                                                                      │
│ Average rollout reward:          -6.56211247005717                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:52[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 7312, 12000]                                                                                                                                    │
│ Average cumulative reward:       -7.102755914738757                                                                                                                      │
│ Average rollout reward:          -6.56211247005717                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:02:52[0m   2.71 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 7312, 12000]                                                                                                                                    │
│ Average cumulative reward:       -7.102755914738757                                                                                                                      │
│ Average rollout reward:          -6.56211247005717                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:52[0m   2.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 7312, 12000]                                                                                                                                    │
│ Average cumulative reward:       -7.102755914738757                                                                                                                      │
│ Average rollout reward:          -6.56211247005717                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:02:50[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6366, 6370, 6397, 13000]                                                                                                                                    │
│ Average cumulative reward:       -7.3466854804040125                                                                                                                     │
│ Average rollout reward:          -6.816103537857699                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:50[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6366, 6370, 6397, 13000]                                                                                                                                    │
│ Average cumulative reward:       -7.3466854804040125                                                                                                                     │
│ Average rollout reward:          -6.816103537857699                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:02:50[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6366, 6370, 6397, 13000]                                                                                                                                    │
│ Average cumulative reward:       -7.3466854804040125                                                                                                                     │
│ Average rollout reward:          -6.816103537857699                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:50[0m   2.70 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6366, 6370, 6397, 13000]                                                                                                                                    │
│ Average cumulative reward:       -7.3466854804040125                                                                                                                     │
│ Average rollout reward:          -6.816103537857699                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:02:50[0m   2.74 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6366, 6370, 6397, 13000]                                                                                                                                    │
│ Average cumulative reward:       -7.3466854804040125                                                                                                                     │
│ Average rollout reward:          -6.816103537857699                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:48[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 13971, 13972, 14000]                                                                                                                          │
│ Average cumulative reward:       -7.016375414222832                                                                                                                      │
│ Average rollout reward:          -6.477280084543222                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:02:48[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 13971, 13972, 14000]                                                                                                                          │
│ Average cumulative reward:       -7.016375414222832                                                                                                                      │
│ Average rollout reward:          -6.477280084543222                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:48[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 13971, 13972, 14000]                                                                                                                          │
│ Average cumulative reward:       -7.016375414222832                                                                                                                      │
│ Average rollout reward:          -6.477280084543222                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:02:48[0m   2.69 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 13971, 13972, 14000]                                                                                                                          │
│ Average cumulative reward:       -7.016375414222832                                                                                                                      │
│ Average rollout reward:          -6.477280084543222                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:48[0m   2.72 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 13971, 13972, 14000]                                                                                                                          │
│ Average cumulative reward:       -7.016375414222832                                                                                                                      │
│ Average rollout reward:          -6.477280084543222                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:02:45[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 939, 942, 12356, 13120, 15000]                                                                                                                              │
│ Average cumulative reward:       -7.125217291298612                                                                                                                      │
│ Average rollout reward:          -6.578207166755121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:45[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 939, 942, 12356, 13120, 15000]                                                                                                                              │
│ Average cumulative reward:       -7.125217291298612                                                                                                                      │
│ Average rollout reward:          -6.578207166755121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:02:45[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 939, 942, 12356, 13120, 15000]                                                                                                                              │
│ Average cumulative reward:       -7.125217291298612                                                                                                                      │
│ Average rollout reward:          -6.578207166755121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:45[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 939, 942, 12356, 13120, 15000]                                                                                                                              │
│ Average cumulative reward:       -7.125217291298612                                                                                                                      │
│ Average rollout reward:          -6.578207166755121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:02:45[0m   2.71 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 939, 942, 12356, 13120, 15000]                                                                                                                              │
│ Average cumulative reward:       -7.125217291298612                                                                                                                      │
│ Average rollout reward:          -6.578207166755121                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:42[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15846, 15851, 16000]                                                                                                                                 │
│ Average cumulative reward:       -7.068833141678113                                                                                                                      │
│ Average rollout reward:          -6.516395245352619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:02:42[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15846, 15851, 16000]                                                                                                                                 │
│ Average cumulative reward:       -7.068833141678113                                                                                                                      │
│ Average rollout reward:          -6.516395245352619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:42[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15846, 15851, 16000]                                                                                                                                 │
│ Average cumulative reward:       -7.068833141678113                                                                                                                      │
│ Average rollout reward:          -6.516395245352619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:02:42[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15846, 15851, 16000]                                                                                                                                 │
│ Average cumulative reward:       -7.068833141678113                                                                                                                      │
│ Average rollout reward:          -6.516395245352619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:02:42[0m   2.70 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15846, 15851, 16000]                                                                                                                                 │
│ Average cumulative reward:       -7.068833141678113                                                                                                                      │
│ Average rollout reward:          -6.516395245352619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:02:40[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 16222, 17000]                                                                                                                                 │
│ Average cumulative reward:       -7.244658049043444                                                                                                                      │
│ Average rollout reward:          -6.6899768076491615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:40[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 16222, 17000]                                                                                                                                 │
│ Average cumulative reward:       -7.244658049043444                                                                                                                      │
│ Average rollout reward:          -6.6899768076491615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:02:40[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 16222, 17000]                                                                                                                                 │
│ Average cumulative reward:       -7.244658049043444                                                                                                                      │
│ Average rollout reward:          -6.6899768076491615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:40[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 16222, 17000]                                                                                                                                 │
│ Average cumulative reward:       -7.244658049043444                                                                                                                      │
│ Average rollout reward:          -6.6899768076491615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:02:40[0m   2.69 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 16222, 17000]                                                                                                                                 │
│ Average cumulative reward:       -7.244658049043444                                                                                                                      │
│ Average rollout reward:          -6.6899768076491615                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:38[0m   2.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17944, 17948, 17995, 18000]                                                                                                                                 │
│ Average cumulative reward:       -7.337747220617681                                                                                                                      │
│ Average rollout reward:          -6.779675527496911                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:02:38[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17944, 17948, 17995, 18000]                                                                                                                                 │
│ Average cumulative reward:       -7.337747220617681                                                                                                                      │
│ Average rollout reward:          -6.779675527496911                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:38[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17944, 17948, 17995, 18000]                                                                                                                                 │
│ Average cumulative reward:       -7.337747220617681                                                                                                                      │
│ Average rollout reward:          -6.779675527496911                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:02:38[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17944, 17948, 17995, 18000]                                                                                                                                 │
│ Average cumulative reward:       -7.337747220617681                                                                                                                      │
│ Average rollout reward:          -6.779675527496911                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:38[0m   2.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17944, 17948, 17995, 18000]                                                                                                                                 │
│ Average cumulative reward:       -7.337747220617681                                                                                                                      │
│ Average rollout reward:          -6.779675527496911                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:02:35[0m   2.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18829, 18832, 18897, 19000]                                                                                                                                 │
│ Average cumulative reward:       -7.075626819577882                                                                                                                      │
│ Average rollout reward:          -6.509871957201906                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:35[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18829, 18832, 18897, 19000]                                                                                                                                 │
│ Average cumulative reward:       -7.075626819577882                                                                                                                      │
│ Average rollout reward:          -6.509871957201906                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:02:35[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18829, 18832, 18897, 19000]                                                                                                                                 │
│ Average cumulative reward:       -7.075626819577882                                                                                                                      │
│ Average rollout reward:          -6.509871957201906                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:35[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18829, 18832, 18897, 19000]                                                                                                                                 │
│ Average cumulative reward:       -7.075626819577882                                                                                                                      │
│ Average rollout reward:          -6.509871957201906                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:02:35[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18829, 18832, 18897, 19000]                                                                                                                                 │
│ Average cumulative reward:       -7.075626819577882                                                                                                                      │
│ Average rollout reward:          -6.509871957201906                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:33[0m   2.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:02:33[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:33[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:02:33[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:33[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:02:33[0m   2.69 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 8680, 8682, 8716, 8719, 8780, 20000]                                                                                                                        │
│ Average cumulative reward:       -7.253034298859383                                                                                                                      │
│ Average rollout reward:          -6.66883680154791                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:31[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4661, 4727, 9586, 21000]                                                                                                                              │
│ Average cumulative reward:       -7.161709003487929                                                                                                                      │
│ Average rollout reward:          -6.581005086002415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:02:31[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4661, 4727, 9586, 21000]                                                                                                                              │
│ Average cumulative reward:       -7.161709003487929                                                                                                                      │
│ Average rollout reward:          -6.581005086002415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:31[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4661, 4727, 9586, 21000]                                                                                                                              │
│ Average cumulative reward:       -7.161709003487929                                                                                                                      │
│ Average rollout reward:          -6.581005086002415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:02:31[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4661, 4727, 9586, 21000]                                                                                                                              │
│ Average cumulative reward:       -7.161709003487929                                                                                                                      │
│ Average rollout reward:          -6.581005086002415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:31[0m   2.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4661, 4727, 9586, 21000]                                                                                                                              │
│ Average cumulative reward:       -7.161709003487929                                                                                                                      │
│ Average rollout reward:          -6.581005086002415                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:02:28[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4660, 20071, 20341, 22000]                                                                                                                            │
│ Average cumulative reward:       -7.231590745839872                                                                                                                      │
│ Average rollout reward:          -6.64926252951471                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:28[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4660, 20071, 20341, 22000]                                                                                                                            │
│ Average cumulative reward:       -7.231590745839872                                                                                                                      │
│ Average rollout reward:          -6.64926252951471                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:02:28[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4660, 20071, 20341, 22000]                                                                                                                            │
│ Average cumulative reward:       -7.231590745839872                                                                                                                      │
│ Average rollout reward:          -6.64926252951471                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:28[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4660, 20071, 20341, 22000]                                                                                                                            │
│ Average cumulative reward:       -7.231590745839872                                                                                                                      │
│ Average rollout reward:          -6.64926252951471                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:02:28[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4660, 20071, 20341, 22000]                                                                                                                            │
│ Average cumulative reward:       -7.231590745839872                                                                                                                      │
│ Average rollout reward:          -6.64926252951471                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:25[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2089, 21368, 21994, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.317276297840099                                                                                                                      │
│ Average rollout reward:          -6.73041580265056                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:02:25[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2089, 21368, 21994, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.317276297840099                                                                                                                      │
│ Average rollout reward:          -6.73041580265056                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:25[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2089, 21368, 21994, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.317276297840099                                                                                                                      │
│ Average rollout reward:          -6.73041580265056                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:02:25[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2089, 21368, 21994, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.317276297840099                                                                                                                      │
│ Average rollout reward:          -6.73041580265056                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:25[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2089, 21368, 21994, 23000]                                                                                                                            │
│ Average cumulative reward:       -7.317276297840099                                                                                                                      │
│ Average rollout reward:          -6.73041580265056                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:02:23[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 319, 324, 9662, 10446, 24000]                                                                                                                               │
│ Average cumulative reward:       -7.290161252316094                                                                                                                      │
│ Average rollout reward:          -6.697652068473858                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:23[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 319, 324, 9662, 10446, 24000]                                                                                                                               │
│ Average cumulative reward:       -7.290161252316094                                                                                                                      │
│ Average rollout reward:          -6.697652068473858                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:02:23[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 319, 324, 9662, 10446, 24000]                                                                                                                               │
│ Average cumulative reward:       -7.290161252316094                                                                                                                      │
│ Average rollout reward:          -6.697652068473858                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:23[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 319, 324, 9662, 10446, 24000]                                                                                                                               │
│ Average cumulative reward:       -7.290161252316094                                                                                                                      │
│ Average rollout reward:          -6.697652068473858                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:02:23[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 319, 324, 9662, 10446, 24000]                                                                                                                               │
│ Average cumulative reward:       -7.290161252316094                                                                                                                      │
│ Average rollout reward:          -6.697652068473858                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:02:21[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:02:21[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:21[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:02:21[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:21[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:02:21[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 16252, 16254, 16369, 16373, 16376, 25000]                                                                                                                   │
│ Average cumulative reward:       -7.362084811897969                                                                                                                      │
│ Average rollout reward:          -6.786989889384885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:18[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25763, 25765, 25991, 25992, 26000]                                                                                                                          │
│ Average cumulative reward:       -7.032546293153299                                                                                                                      │
│ Average rollout reward:          -6.4659307825320536                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:02:18[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25763, 25765, 25991, 25992, 26000]                                                                                                                          │
│ Average cumulative reward:       -7.032546293153299                                                                                                                      │
│ Average rollout reward:          -6.4659307825320536                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:02:18[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25763, 25765, 25991, 25992, 26000]                                                                                                                          │
│ Average cumulative reward:       -7.032546293153299                                                                                                                      │
│ Average rollout reward:          -6.4659307825320536                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:02:18[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 25763, 25765, 25991, 25992, 26000]                                                                                                                          │
│ Average cumulative reward:       -7.032546293153299                                                                                                                      │
│ Average rollout reward:          -6.4659307825320536                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:15[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26862, 26864, 26991, 26996, 27000]                                                                                                                          │
│ Average cumulative reward:       -7.011722659526322                                                                                                                      │
│ Average rollout reward:          -6.473741890607465                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:02:15[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26862, 26864, 26991, 26996, 27000]                                                                                                                          │
│ Average cumulative reward:       -7.011722659526322                                                                                                                      │
│ Average rollout reward:          -6.473741890607465                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:15[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26862, 26864, 26991, 26996, 27000]                                                                                                                          │
│ Average cumulative reward:       -7.011722659526322                                                                                                                      │
│ Average rollout reward:          -6.473741890607465                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:02:15[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26862, 26864, 26991, 26996, 27000]                                                                                                                          │
│ Average cumulative reward:       -7.011722659526322                                                                                                                      │
│ Average rollout reward:          -6.473741890607465                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:15[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 26862, 26864, 26991, 26996, 27000]                                                                                                                          │
│ Average cumulative reward:       -7.011722659526322                                                                                                                      │
│ Average rollout reward:          -6.473741890607465                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:02:12[0m   2.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:12[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:02:12[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:12[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:02:12[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:12[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 27990, 27992, 28000]                                                                                                                                        │
│ Average cumulative reward:       -7.044680852935741                                                                                                                      │
│ Average rollout reward:          -6.43593688790209                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:02:10[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47, 58, 19090, 21355, 29000]                                                                                                                                │
│ Average cumulative reward:       -7.249380417772526                                                                                                                      │
│ Average rollout reward:          -6.673069932757765                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:02:10[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47, 58, 19090, 21355, 29000]                                                                                                                                │
│ Average cumulative reward:       -7.249380417772526                                                                                                                      │
│ Average rollout reward:          -6.673069932757765                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:02:10[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47, 58, 19090, 21355, 29000]                                                                                                                                │
│ Average cumulative reward:       -7.249380417772526                                                                                                                      │
│ Average rollout reward:          -6.673069932757765                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:02:10[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47, 58, 19090, 21355, 29000]                                                                                                                                │
│ Average cumulative reward:       -7.249380417772526                                                                                                                      │
│ Average rollout reward:          -6.673069932757765                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:02:10[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47, 58, 19090, 21355, 29000]                                                                                                                                │
│ Average cumulative reward:       -7.249380417772526                                                                                                                      │
│ Average rollout reward:          -6.673069932757765                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:02:08[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29735, 29739, 29963, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.988844189957857                                                                                                                      │
│ Average rollout reward:          -6.432387543192548                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:02:08[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29735, 29739, 29963, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.988844189957857                                                                                                                      │
│ Average rollout reward:          -6.432387543192548                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:02:08[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29735, 29739, 29963, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.988844189957857                                                                                                                      │
│ Average rollout reward:          -6.432387543192548                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:02:08[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29735, 29739, 29963, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.988844189957857                                                                                                                      │
│ Average rollout reward:          -6.432387543192548                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:02:08[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 29735, 29739, 29963, 30000]                                                                                                                                 │
│ Average cumulative reward:       -6.988844189957857                                                                                                                      │
│ Average rollout reward:          -6.432387543192548                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:02:05[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30934, 30938, 30997, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.851421350537587                                                                                                                      │
│ Average rollout reward:          -6.332745713827004                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:02:05[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30934, 30938, 30997, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.851421350537587                                                                                                                      │
│ Average rollout reward:          -6.332745713827004                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:02:05[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30934, 30938, 30997, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.851421350537587                                                                                                                      │
│ Average rollout reward:          -6.332745713827004                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:02:05[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30934, 30938, 30997, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.851421350537587                                                                                                                      │
│ Average rollout reward:          -6.332745713827004                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:02:05[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 30934, 30938, 30997, 31000]                                                                                                                                 │
│ Average cumulative reward:       -6.851421350537587                                                                                                                      │
│ Average rollout reward:          -6.332745713827004                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:02:02[0m   2.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:02:02[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:02:02[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:02:02[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:02:02[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:02:02[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 28987, 29413, 32000]                                                                                                                          │
│ Average cumulative reward:       -7.592602962769266                                                                                                                      │
│ Average rollout reward:          -7.008582869598587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:02:00[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32788, 32790, 32903, 32913, 33000]                                                                                                                          │
│ Average cumulative reward:       -7.197292188881502                                                                                                                      │
│ Average rollout reward:          -6.625502294396546                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:02:00[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32788, 32790, 32903, 32913, 33000]                                                                                                                          │
│ Average cumulative reward:       -7.197292188881502                                                                                                                      │
│ Average rollout reward:          -6.625502294396546                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:02:00[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32788, 32790, 32903, 32913, 33000]                                                                                                                          │
│ Average cumulative reward:       -7.197292188881502                                                                                                                      │
│ Average rollout reward:          -6.625502294396546                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:02:00[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32788, 32790, 32903, 32913, 33000]                                                                                                                          │
│ Average cumulative reward:       -7.197292188881502                                                                                                                      │
│ Average rollout reward:          -6.625502294396546                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:02:00[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 32788, 32790, 32903, 32913, 33000]                                                                                                                          │
│ Average cumulative reward:       -7.197292188881502                                                                                                                      │
│ Average rollout reward:          -6.625502294396546                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:01:58[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 15910, 15920, 34000]                                                                                                                          │
│ Average cumulative reward:       -7.286625444463797                                                                                                                      │
│ Average rollout reward:          -6.671393359948001                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:58[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 15910, 15920, 34000]                                                                                                                          │
│ Average cumulative reward:       -7.286625444463797                                                                                                                      │
│ Average rollout reward:          -6.671393359948001                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:01:58[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 15910, 15920, 34000]                                                                                                                          │
│ Average cumulative reward:       -7.286625444463797                                                                                                                      │
│ Average rollout reward:          -6.671393359948001                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:01:58[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 15910, 15920, 34000]                                                                                                                          │
│ Average cumulative reward:       -7.286625444463797                                                                                                                      │
│ Average rollout reward:          -6.671393359948001                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:01:58[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 15845, 15847, 15910, 15920, 34000]                                                                                                                          │
│ Average cumulative reward:       -7.286625444463797                                                                                                                      │
│ Average rollout reward:          -6.671393359948001                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:01:55[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:01:55[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:01:55[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:01:55[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:01:55[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:01:55[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 24166, 24168, 24209, 24213, 35000]                                                                                                                          │
│ Average cumulative reward:       -7.152386720072494                                                                                                                      │
│ Average rollout reward:          -6.506073448903366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:01:53[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23135, 23136, 23253, 23259, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.21563313757847                                                                                                                       │
│ Average rollout reward:          -6.6257683735913675                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:01:53[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23135, 23136, 23253, 23259, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.21563313757847                                                                                                                       │
│ Average rollout reward:          -6.6257683735913675                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:01:53[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23135, 23136, 23253, 23259, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.21563313757847                                                                                                                       │
│ Average rollout reward:          -6.6257683735913675                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:01:53[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23135, 23136, 23253, 23259, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.21563313757847                                                                                                                       │
│ Average rollout reward:          -6.6257683735913675                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:01:53[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23135, 23136, 23253, 23259, 36000]                                                                                                                          │
│ Average cumulative reward:       -7.21563313757847                                                                                                                       │
│ Average rollout reward:          -6.6257683735913675                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:51[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 36959, 36967, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.9828778106625204                                                                                                                     │
│ Average rollout reward:          -6.406908005528021                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:01:51[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 36959, 36967, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.9828778106625204                                                                                                                     │
│ Average rollout reward:          -6.406908005528021                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:51[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 36959, 36967, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.9828778106625204                                                                                                                     │
│ Average rollout reward:          -6.406908005528021                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:01:51[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 36959, 36967, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.9828778106625204                                                                                                                     │
│ Average rollout reward:          -6.406908005528021                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:51[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 36959, 36967, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.9828778106625204                                                                                                                     │
│ Average rollout reward:          -6.406908005528021                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:01:50[0m   2.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:50[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:01:50[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:50[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:01:50[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:50[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 18383, 18387, 35708, 38000]                                                                                                                                 │
│ Average cumulative reward:       -7.465846225513403                                                                                                                      │
│ Average rollout reward:          -6.858158397417137                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:01:47[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 906, 26242, 39000]                                                                                                                                │
│ Average cumulative reward:       -7.34952236609902                                                                                                                       │
│ Average rollout reward:          -6.723864484039646                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:47[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 906, 26242, 39000]                                                                                                                                │
│ Average cumulative reward:       -7.34952236609902                                                                                                                       │
│ Average rollout reward:          -6.723864484039646                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:01:47[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 906, 26242, 39000]                                                                                                                                │
│ Average cumulative reward:       -7.34952236609902                                                                                                                       │
│ Average rollout reward:          -6.723864484039646                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:47[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 906, 26242, 39000]                                                                                                                                │
│ Average cumulative reward:       -7.34952236609902                                                                                                                       │
│ Average rollout reward:          -6.723864484039646                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:01:47[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 906, 26242, 39000]                                                                                                                                │
│ Average cumulative reward:       -7.34952236609902                                                                                                                       │
│ Average rollout reward:          -6.723864484039646                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:44[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 801, 4641, 5321, 27895, 40000]                                                                                                                         │
│ Average cumulative reward:       -7.100657629364958                                                                                                                      │
│ Average rollout reward:          -6.460957833915469                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:01:44[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 801, 4641, 5321, 27895, 40000]                                                                                                                         │
│ Average cumulative reward:       -7.100657629364958                                                                                                                      │
│ Average rollout reward:          -6.460957833915469                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:44[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 801, 4641, 5321, 27895, 40000]                                                                                                                         │
│ Average cumulative reward:       -7.100657629364958                                                                                                                      │
│ Average rollout reward:          -6.460957833915469                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:01:44[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 801, 4641, 5321, 27895, 40000]                                                                                                                         │
│ Average cumulative reward:       -7.100657629364958                                                                                                                      │
│ Average rollout reward:          -6.460957833915469                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:44[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 801, 4641, 5321, 27895, 40000]                                                                                                                         │
│ Average cumulative reward:       -7.100657629364958                                                                                                                      │
│ Average rollout reward:          -6.460957833915469                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:01:42[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 40869, 40873, 40992, 41000]                                                                                                                                 │
│ Average cumulative reward:       -7.13905972863932                                                                                                                       │
│ Average rollout reward:          -6.505188798994268                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:42[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 40869, 40873, 40992, 41000]                                                                                                                                 │
│ Average cumulative reward:       -7.13905972863932                                                                                                                       │
│ Average rollout reward:          -6.505188798994268                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:01:42[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 40869, 40873, 40992, 41000]                                                                                                                                 │
│ Average cumulative reward:       -7.13905972863932                                                                                                                       │
│ Average rollout reward:          -6.505188798994268                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:42[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 40869, 40873, 40992, 41000]                                                                                                                                 │
│ Average cumulative reward:       -7.13905972863932                                                                                                                       │
│ Average rollout reward:          -6.505188798994268                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:01:42[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 40869, 40873, 40992, 41000]                                                                                                                                 │
│ Average cumulative reward:       -7.13905972863932                                                                                                                       │
│ Average rollout reward:          -6.505188798994268                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:40[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 12110, 12112, 12115, 12124, 42000]                                                                                                                          │
│ Average cumulative reward:       -7.203420218355662                                                                                                                      │
│ Average rollout reward:          -6.580816765497444                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:01:40[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 12110, 12112, 12115, 12124, 42000]                                                                                                                          │
│ Average cumulative reward:       -7.203420218355662                                                                                                                      │
│ Average rollout reward:          -6.580816765497444                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:40[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 12110, 12112, 12115, 12124, 42000]                                                                                                                          │
│ Average cumulative reward:       -7.203420218355662                                                                                                                      │
│ Average rollout reward:          -6.580816765497444                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:01:40[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 12110, 12112, 12115, 12124, 42000]                                                                                                                          │
│ Average cumulative reward:       -7.203420218355662                                                                                                                      │
│ Average rollout reward:          -6.580816765497444                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:40[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 12110, 12112, 12115, 12124, 42000]                                                                                                                          │
│ Average cumulative reward:       -7.203420218355662                                                                                                                      │
│ Average rollout reward:          -6.580816765497444                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:01:37[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:37[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:01:37[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:37[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:01:37[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:37[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3747, 3749, 3768, 3778, 8530, 43000]                                                                                                                        │
│ Average cumulative reward:       -6.698901170818394                                                                                                                      │
│ Average rollout reward:          -6.093416347458299                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:01:34[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43805, 43809, 43820, 43912, 44000]                                                                                                                          │
│ Average cumulative reward:       -7.317173186601474                                                                                                                      │
│ Average rollout reward:          -6.727593431385539                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:34[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43805, 43809, 43820, 43912, 44000]                                                                                                                          │
│ Average cumulative reward:       -7.317173186601474                                                                                                                      │
│ Average rollout reward:          -6.727593431385539                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:01:34[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43805, 43809, 43820, 43912, 44000]                                                                                                                          │
│ Average cumulative reward:       -7.317173186601474                                                                                                                      │
│ Average rollout reward:          -6.727593431385539                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:34[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43805, 43809, 43820, 43912, 44000]                                                                                                                          │
│ Average cumulative reward:       -7.317173186601474                                                                                                                      │
│ Average rollout reward:          -6.727593431385539                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:01:34[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 43805, 43809, 43820, 43912, 44000]                                                                                                                          │
│ Average cumulative reward:       -7.317173186601474                                                                                                                      │
│ Average rollout reward:          -6.727593431385539                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:31[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23646, 23647, 23753, 23774, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.911673348883827                                                                                                                      │
│ Average rollout reward:          -6.370543588963339                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:01:31[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23646, 23647, 23753, 23774, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.911673348883827                                                                                                                      │
│ Average rollout reward:          -6.370543588963339                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:31[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23646, 23647, 23753, 23774, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.911673348883827                                                                                                                      │
│ Average rollout reward:          -6.370543588963339                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:01:31[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23646, 23647, 23753, 23774, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.911673348883827                                                                                                                      │
│ Average rollout reward:          -6.370543588963339                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:31[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 23646, 23647, 23753, 23774, 45000]                                                                                                                          │
│ Average cumulative reward:       -6.911673348883827                                                                                                                      │
│ Average rollout reward:          -6.370543588963339                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:01:28[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13167, 13168, 41291, 46000]                                                                                                                                 │
│ Average cumulative reward:       -7.238519567973501                                                                                                                      │
│ Average rollout reward:          -6.621393592301502                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:28[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13167, 13168, 41291, 46000]                                                                                                                                 │
│ Average cumulative reward:       -7.238519567973501                                                                                                                      │
│ Average rollout reward:          -6.621393592301502                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:01:28[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13167, 13168, 41291, 46000]                                                                                                                                 │
│ Average cumulative reward:       -7.238519567973501                                                                                                                      │
│ Average rollout reward:          -6.621393592301502                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:28[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13167, 13168, 41291, 46000]                                                                                                                                 │
│ Average cumulative reward:       -7.238519567973501                                                                                                                      │
│ Average rollout reward:          -6.621393592301502                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:02:01[0m Remaining: [36m0:01:28[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13167, 13168, 41291, 46000]                                                                                                                                 │
│ Average cumulative reward:       -7.238519567973501                                                                                                                      │
│ Average rollout reward:          -6.621393592301502                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:25[0m   2.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:02[0m Remaining: [36m0:01:25[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:25[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:03[0m Remaining: [36m0:01:25[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:01:25[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:02:04[0m Remaining: [36m0:01:25[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 46868, 46871, 46874, 46951, 47000]                                                                                                                          │
│ Average cumulative reward:       -7.263195894105154                                                                                                                      │
│ Average rollout reward:          -6.693488865412941                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:23[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47652, 47653, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -6.8431326412230655                                                                                                                     │
│ Average rollout reward:          -6.265116323077216                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:02:05[0m Remaining: [36m0:01:23[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47652, 47653, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -6.8431326412230655                                                                                                                     │
│ Average rollout reward:          -6.265116323077216                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:23[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47652, 47653, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -6.8431326412230655                                                                                                                     │
│ Average rollout reward:          -6.265116323077216                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:02:06[0m Remaining: [36m0:01:23[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47652, 47653, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -6.8431326412230655                                                                                                                     │
│ Average rollout reward:          -6.265116323077216                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:23[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 47652, 47653, 47993, 48000]                                                                                                                                 │
│ Average cumulative reward:       -6.8431326412230655                                                                                                                     │
│ Average rollout reward:          -6.265116323077216                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:02:07[0m Remaining: [36m0:01:20[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 46523, 48067, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.988694625918679                                                                                                                      │
│ Average rollout reward:          -6.380636411498254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:20[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 46523, 48067, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.988694625918679                                                                                                                      │
│ Average rollout reward:          -6.380636411498254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:02:08[0m Remaining: [36m0:01:20[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 46523, 48067, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.988694625918679                                                                                                                      │
│ Average rollout reward:          -6.380636411498254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:01:20[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 46523, 48067, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.988694625918679                                                                                                                      │
│ Average rollout reward:          -6.380636411498254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:02:09[0m Remaining: [36m0:01:20[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 36692, 36694, 46523, 48067, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.988694625918679                                                                                                                      │
│ Average rollout reward:          -6.380636411498254                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:17[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:10[0m Remaining: [36m0:01:17[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:17[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:11[0m Remaining: [36m0:01:17[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:17[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:02:12[0m Remaining: [36m0:01:17[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1857, 1859, 46076, 47387, 50000]                                                                                                                            │
│ Average cumulative reward:       -7.145270540760612                                                                                                                      │
│ Average rollout reward:          -6.496450543382473                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:14[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50872, 50875, 51000]                                                                                                                                        │
│ Average cumulative reward:       -7.280065518577674                                                                                                                      │
│ Average rollout reward:          -6.675473242707381                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:02:13[0m Remaining: [36m0:01:14[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50872, 50875, 51000]                                                                                                                                        │
│ Average cumulative reward:       -7.280065518577674                                                                                                                      │
│ Average rollout reward:          -6.675473242707381                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:14[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50872, 50875, 51000]                                                                                                                                        │
│ Average cumulative reward:       -7.280065518577674                                                                                                                      │
│ Average rollout reward:          -6.675473242707381                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:02:14[0m Remaining: [36m0:01:14[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50872, 50875, 51000]                                                                                                                                        │
│ Average cumulative reward:       -7.280065518577674                                                                                                                      │
│ Average rollout reward:          -6.675473242707381                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:14[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 50872, 50875, 51000]                                                                                                                                        │
│ Average cumulative reward:       -7.280065518577674                                                                                                                      │
│ Average rollout reward:          -6.675473242707381                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:15[0m Remaining: [36m0:01:12[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51698, 51700, 51745, 51750, 52000]                                                                                                                          │
│ Average cumulative reward:       -7.272308456261206                                                                                                                      │
│ Average rollout reward:          -6.647231462070111                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:12[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51698, 51700, 51745, 51750, 52000]                                                                                                                          │
│ Average cumulative reward:       -7.272308456261206                                                                                                                      │
│ Average rollout reward:          -6.647231462070111                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:16[0m Remaining: [36m0:01:12[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51698, 51700, 51745, 51750, 52000]                                                                                                                          │
│ Average cumulative reward:       -7.272308456261206                                                                                                                      │
│ Average rollout reward:          -6.647231462070111                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:01:12[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51698, 51700, 51745, 51750, 52000]                                                                                                                          │
│ Average cumulative reward:       -7.272308456261206                                                                                                                      │
│ Average rollout reward:          -6.647231462070111                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:02:17[0m Remaining: [36m0:01:12[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 51698, 51700, 51745, 51750, 52000]                                                                                                                          │
│ Average cumulative reward:       -7.272308456261206                                                                                                                      │
│ Average rollout reward:          -6.647231462070111                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:01:09[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1351, 1352, 49544, 50052, 53000]                                                                                                                            │
│ Average cumulative reward:       -7.3462323270876695                                                                                                                     │
│ Average rollout reward:          -6.6822074861477505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:18[0m Remaining: [36m0:01:09[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1351, 1352, 49544, 50052, 53000]                                                                                                                            │
│ Average cumulative reward:       -7.3462323270876695                                                                                                                     │
│ Average rollout reward:          -6.6822074861477505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:01:09[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1351, 1352, 49544, 50052, 53000]                                                                                                                            │
│ Average cumulative reward:       -7.3462323270876695                                                                                                                     │
│ Average rollout reward:          -6.6822074861477505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:19[0m Remaining: [36m0:01:09[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1351, 1352, 49544, 50052, 53000]                                                                                                                            │
│ Average cumulative reward:       -7.3462323270876695                                                                                                                     │
│ Average rollout reward:          -6.6822074861477505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:01:09[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 1351, 1352, 49544, 50052, 53000]                                                                                                                            │
│ Average cumulative reward:       -7.3462323270876695                                                                                                                     │
│ Average rollout reward:          -6.6822074861477505                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:20[0m Remaining: [36m0:01:07[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:01:07[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:21[0m Remaining: [36m0:01:07[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:01:07[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:22[0m Remaining: [36m0:01:07[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:01:07[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 3919, 3920, 48974, 53361, 54000]                                                                                                                            │
│ Average cumulative reward:       -7.490239167821813                                                                                                                      │
│ Average rollout reward:          -6.8466243966843905                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:02:23[0m Remaining: [36m0:01:04[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 21719, 21727, 21767, 55000]                                                                                                                   │
│ Average cumulative reward:       -7.388593761553804                                                                                                                      │
│ Average rollout reward:          -6.738514004553648                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:01:04[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 21719, 21727, 21767, 55000]                                                                                                                   │
│ Average cumulative reward:       -7.388593761553804                                                                                                                      │
│ Average rollout reward:          -6.738514004553648                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:02:24[0m Remaining: [36m0:01:04[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 21719, 21727, 21767, 55000]                                                                                                                   │
│ Average cumulative reward:       -7.388593761553804                                                                                                                      │
│ Average rollout reward:          -6.738514004553648                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:01:04[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 21719, 21727, 21767, 55000]                                                                                                                   │
│ Average cumulative reward:       -7.388593761553804                                                                                                                      │
│ Average rollout reward:          -6.738514004553648                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:02:25[0m Remaining: [36m0:01:04[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 21639, 21641, 21719, 21727, 21767, 55000]                                                                                                                   │
│ Average cumulative reward:       -7.388593761553804                                                                                                                      │
│ Average rollout reward:          -6.738514004553648                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:01:02[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 62, 65, 859, 1007, 56000]                                                                                                                                   │
│ Average cumulative reward:       -7.4219642798339756                                                                                                                     │
│ Average rollout reward:          -6.739979178092679                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:02:26[0m Remaining: [36m0:01:02[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 62, 65, 859, 1007, 56000]                                                                                                                                   │
│ Average cumulative reward:       -7.4219642798339756                                                                                                                     │
│ Average rollout reward:          -6.739979178092679                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:01:02[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 62, 65, 859, 1007, 56000]                                                                                                                                   │
│ Average cumulative reward:       -7.4219642798339756                                                                                                                     │
│ Average rollout reward:          -6.739979178092679                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:02:27[0m Remaining: [36m0:01:02[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 62, 65, 859, 1007, 56000]                                                                                                                                   │
│ Average cumulative reward:       -7.4219642798339756                                                                                                                     │
│ Average rollout reward:          -6.739979178092679                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:01:02[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 62, 65, 859, 1007, 56000]                                                                                                                                   │
│ Average cumulative reward:       -7.4219642798339756                                                                                                                     │
│ Average rollout reward:          -6.739979178092679                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:28[0m Remaining: [36m0:00:59[0m   2.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:59[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:29[0m Remaining: [36m0:00:59[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:59[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:30[0m Remaining: [36m0:00:59[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:59[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56823, 56855, 56863, 57000]                                                                                                                          │
│ Average cumulative reward:       -7.2524381235765105                                                                                                                     │
│ Average rollout reward:          -6.5987784187719125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:31[0m Remaining: [36m0:00:57[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:57[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:32[0m Remaining: [36m0:00:57[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:00:57[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:33[0m Remaining: [36m0:00:57[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:00:57[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 57703, 57704, 57867, 57948, 58000]                                                                                                                          │
│ Average cumulative reward:       -7.308426853012629                                                                                                                      │
│ Average rollout reward:          -6.70568725434412                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:02:34[0m Remaining: [36m0:00:54[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 58593, 58594, 59000]                                                                                                                                        │
│ Average cumulative reward:       -6.8470067811937305                                                                                                                     │
│ Average rollout reward:          -6.235436467693274                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:00:54[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 58593, 58594, 59000]                                                                                                                                        │
│ Average cumulative reward:       -6.8470067811937305                                                                                                                     │
│ Average rollout reward:          -6.235436467693274                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:02:35[0m Remaining: [36m0:00:54[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 58593, 58594, 59000]                                                                                                                                        │
│ Average cumulative reward:       -6.8470067811937305                                                                                                                     │
│ Average rollout reward:          -6.235436467693274                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:00:54[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 58593, 58594, 59000]                                                                                                                                        │
│ Average cumulative reward:       -6.8470067811937305                                                                                                                     │
│ Average rollout reward:          -6.235436467693274                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:02:36[0m Remaining: [36m0:00:54[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 58593, 58594, 59000]                                                                                                                                        │
│ Average cumulative reward:       -6.8470067811937305                                                                                                                     │
│ Average rollout reward:          -6.235436467693274                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:02:37[0m Remaining: [36m0:00:52[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2088, 2098, 31237, 60000]                                                                                                                             │
│ Average cumulative reward:       -7.210602416783267                                                                                                                      │
│ Average rollout reward:          -6.560800520770925                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:00:52[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2088, 2098, 31237, 60000]                                                                                                                             │
│ Average cumulative reward:       -7.210602416783267                                                                                                                      │
│ Average rollout reward:          -6.560800520770925                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:02:38[0m Remaining: [36m0:00:52[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2088, 2098, 31237, 60000]                                                                                                                             │
│ Average cumulative reward:       -7.210602416783267                                                                                                                      │
│ Average rollout reward:          -6.560800520770925                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:00:52[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2088, 2098, 31237, 60000]                                                                                                                             │
│ Average cumulative reward:       -7.210602416783267                                                                                                                      │
│ Average rollout reward:          -6.560800520770925                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:02:39[0m Remaining: [36m0:00:52[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 2087, 2088, 2098, 31237, 60000]                                                                                                                             │
│ Average cumulative reward:       -7.210602416783267                                                                                                                      │
│ Average rollout reward:          -6.560800520770925                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:49[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 670, 672, 15728, 16091, 32449, 38390, 61000]                                                                                                                │
│ Average cumulative reward:       -7.094382330791213                                                                                                                      │
│ Average rollout reward:          -6.562472793148968                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:40[0m Remaining: [36m0:00:49[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 670, 672, 15728, 16091, 32449, 38390, 61000]                                                                                                                │
│ Average cumulative reward:       -7.094382330791213                                                                                                                      │
│ Average rollout reward:          -6.562472793148968                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:49[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 670, 672, 15728, 16091, 32449, 38390, 61000]                                                                                                                │
│ Average cumulative reward:       -7.094382330791213                                                                                                                      │
│ Average rollout reward:          -6.562472793148968                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:41[0m Remaining: [36m0:00:49[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 670, 672, 15728, 16091, 32449, 38390, 61000]                                                                                                                │
│ Average cumulative reward:       -7.094382330791213                                                                                                                      │
│ Average rollout reward:          -6.562472793148968                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:49[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 670, 672, 15728, 16091, 32449, 38390, 61000]                                                                                                                │
│ Average cumulative reward:       -7.094382330791213                                                                                                                      │
│ Average rollout reward:          -6.562472793148968                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:42[0m Remaining: [36m0:00:46[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 5489, 62000]                                                                                                                                    │
│ Average cumulative reward:       -7.197048426683949                                                                                                                      │
│ Average rollout reward:          -6.537555816668402                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:46[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 5489, 62000]                                                                                                                                    │
│ Average cumulative reward:       -7.197048426683949                                                                                                                      │
│ Average rollout reward:          -6.537555816668402                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:43[0m Remaining: [36m0:00:46[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 5489, 62000]                                                                                                                                    │
│ Average cumulative reward:       -7.197048426683949                                                                                                                      │
│ Average rollout reward:          -6.537555816668402                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:46[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 5489, 62000]                                                                                                                                    │
│ Average cumulative reward:       -7.197048426683949                                                                                                                      │
│ Average rollout reward:          -6.537555816668402                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:02:44[0m Remaining: [36m0:00:46[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 5472, 5473, 5489, 62000]                                                                                                                                    │
│ Average cumulative reward:       -7.197048426683949                                                                                                                      │
│ Average rollout reward:          -6.537555816668402                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:44[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:45[0m Remaining: [36m0:00:44[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:44[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:46[0m Remaining: [36m0:00:44[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:44[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:02:47[0m Remaining: [36m0:00:44[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 13902, 13904, 59334, 60072, 63000]                                                                                                                          │
│ Average cumulative reward:       -7.105984314024573                                                                                                                      │
│ Average rollout reward:          -6.463038600555488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:41[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 800, 813, 64000]                                                                                                                                       │
│ Average cumulative reward:       -7.025551327180665                                                                                                                      │
│ Average rollout reward:          -6.4279210013459425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:48[0m Remaining: [36m0:00:41[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 800, 813, 64000]                                                                                                                                       │
│ Average cumulative reward:       -7.025551327180665                                                                                                                      │
│ Average rollout reward:          -6.4279210013459425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:41[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 800, 813, 64000]                                                                                                                                       │
│ Average cumulative reward:       -7.025551327180665                                                                                                                      │
│ Average rollout reward:          -6.4279210013459425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:49[0m Remaining: [36m0:00:41[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 800, 813, 64000]                                                                                                                                       │
│ Average cumulative reward:       -7.025551327180665                                                                                                                      │
│ Average rollout reward:          -6.4279210013459425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:41[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 798, 800, 813, 64000]                                                                                                                                       │
│ Average cumulative reward:       -7.025551327180665                                                                                                                      │
│ Average rollout reward:          -6.4279210013459425                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:50[0m Remaining: [36m0:00:38[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:38[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:51[0m Remaining: [36m0:00:38[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:38[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:52[0m Remaining: [36m0:00:38[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:38[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 20676, 20678, 20742, 20744, 20887, 65000]                                                                                                                   │
│ Average cumulative reward:       -7.252186117112922                                                                                                                      │
│ Average rollout reward:          -6.614604110434585                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:53[0m Remaining: [36m0:00:36[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 35361, 35362, 58223, 60937, 66000]                                                                                                                          │
│ Average cumulative reward:       -6.984444382861398                                                                                                                      │
│ Average rollout reward:          -6.331198404693956                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:36[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 35361, 35362, 58223, 60937, 66000]                                                                                                                          │
│ Average cumulative reward:       -6.984444382861398                                                                                                                      │
│ Average rollout reward:          -6.331198404693956                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:54[0m Remaining: [36m0:00:36[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 35361, 35362, 58223, 60937, 66000]                                                                                                                          │
│ Average cumulative reward:       -6.984444382861398                                                                                                                      │
│ Average rollout reward:          -6.331198404693956                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:36[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 35361, 35362, 58223, 60937, 66000]                                                                                                                          │
│ Average cumulative reward:       -6.984444382861398                                                                                                                      │
│ Average rollout reward:          -6.331198404693956                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:55[0m Remaining: [36m0:00:33[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66986, 66989, 66993, 67000]                                                                                                                                 │
│ Average cumulative reward:       -6.966808338157196                                                                                                                      │
│ Average rollout reward:          -6.2982861067489235                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:33[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66986, 66989, 66993, 67000]                                                                                                                                 │
│ Average cumulative reward:       -6.966808338157196                                                                                                                      │
│ Average rollout reward:          -6.2982861067489235                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:56[0m Remaining: [36m0:00:33[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66986, 66989, 66993, 67000]                                                                                                                                 │
│ Average cumulative reward:       -6.966808338157196                                                                                                                      │
│ Average rollout reward:          -6.2982861067489235                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:33[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66986, 66989, 66993, 67000]                                                                                                                                 │
│ Average cumulative reward:       -6.966808338157196                                                                                                                      │
│ Average rollout reward:          -6.2982861067489235                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:02:57[0m Remaining: [36m0:00:33[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 66986, 66989, 66993, 67000]                                                                                                                                 │
│ Average cumulative reward:       -6.966808338157196                                                                                                                      │
│ Average rollout reward:          -6.2982861067489235                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:30[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:58[0m Remaining: [36m0:00:30[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:30[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:02:59[0m Remaining: [36m0:00:30[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:30[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:03:00[0m Remaining: [36m0:00:30[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 6136, 6138, 6203, 6206, 8052, 68000]                                                                                                                        │
│ Average cumulative reward:       -7.2334965225317545                                                                                                                     │
│ Average rollout reward:          -6.651114175260165                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:27[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68945, 68949, 68967, 69000]                                                                                                                                 │
│ Average cumulative reward:       -7.528428878632884                                                                                                                      │
│ Average rollout reward:          -6.877263658401603                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:03:01[0m Remaining: [36m0:00:27[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68945, 68949, 68967, 69000]                                                                                                                                 │
│ Average cumulative reward:       -7.528428878632884                                                                                                                      │
│ Average rollout reward:          -6.877263658401603                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:03:02[0m Remaining: [36m0:00:27[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68945, 68949, 68967, 69000]                                                                                                                                 │
│ Average cumulative reward:       -7.528428878632884                                                                                                                      │
│ Average rollout reward:          -6.877263658401603                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:03:02[0m Remaining: [36m0:00:27[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68945, 68949, 68967, 69000]                                                                                                                                 │
│ Average cumulative reward:       -7.528428878632884                                                                                                                      │
│ Average rollout reward:          -6.877263658401603                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯8;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:03:03[0m Remaining: [36m0:00:27[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 68945, 68949, 68967, 69000]                                                                                                                                 │
│ Average cumulative reward:       -7.528428878632884                                                                                                                      │
│ Average rollout reward:          -6.877263658401603                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:03[0m Remaining: [36m0:00:24[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69937, 69938, 69957, 69962, 70000]                                                                                                                          │
│ Average cumulative reward:       -7.156633243896364                                                                                                                      │
│ Average rollout reward:          -6.491166721335291                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:24[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69937, 69938, 69957, 69962, 70000]                                                                                                                          │
│ Average cumulative reward:       -7.156633243896364                                                                                                                      │
│ Average rollout reward:          -6.491166721335291                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:04[0m Remaining: [36m0:00:24[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69937, 69938, 69957, 69962, 70000]                                                                                                                          │
│ Average cumulative reward:       -7.156633243896364                                                                                                                      │
│ Average rollout reward:          -6.491166721335291                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:24[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69937, 69938, 69957, 69962, 70000]                                                                                                                          │
│ Average cumulative reward:       -7.156633243896364                                                                                                                      │
│ Average rollout reward:          -6.491166721335291                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:03:05[0m Remaining: [36m0:00:24[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 69937, 69938, 69957, 69962, 70000]                                                                                                                          │
│ Average cumulative reward:       -7.156633243896364                                                                                                                      │
│ Average rollout reward:          -6.491166721335291                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:22[0m   2.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:06[0m Remaining: [36m0:00:22[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:22[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:07[0m Remaining: [36m0:00:22[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:22[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:03:08[0m Remaining: [36m0:00:22[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 4658, 4662, 60006, 62169, 71000]                                                                                                                            │
│ Average cumulative reward:       -7.318074946119104                                                                                                                      │
│ Average rollout reward:          -6.6638048278263815                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:19[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 71946, 71948, 71969, 72000]                                                                                                                                 │
│ Average cumulative reward:       -7.478269199036443                                                                                                                      │
│ Average rollout reward:          -6.811198216068614                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:03:09[0m Remaining: [36m0:00:19[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 71946, 71948, 71969, 72000]                                                                                                                                 │
│ Average cumulative reward:       -7.478269199036443                                                                                                                      │
│ Average rollout reward:          -6.811198216068614                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:19[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 71946, 71948, 71969, 72000]                                                                                                                                 │
│ Average cumulative reward:       -7.478269199036443                                                                                                                      │
│ Average rollout reward:          -6.811198216068614                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:03:10[0m Remaining: [36m0:00:19[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 71946, 71948, 71969, 72000]                                                                                                                                 │
│ Average cumulative reward:       -7.478269199036443                                                                                                                      │
│ Average rollout reward:          -6.811198216068614                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:19[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 71946, 71948, 71969, 72000]                                                                                                                                 │
│ Average cumulative reward:       -7.478269199036443                                                                                                                      │
│ Average rollout reward:          -6.811198216068614                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:11[0m Remaining: [36m0:00:16[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:16[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:12[0m Remaining: [36m0:00:16[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:16[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:13[0m Remaining: [36m0:00:16[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:16[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 72964, 72966, 72974, 73000]                                                                                                                                 │
│ Average cumulative reward:       -7.227709858174759                                                                                                                      │
│ Average rollout reward:          -6.559931276605887                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:14[0m Remaining: [36m0:00:14[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:14[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:15[0m Remaining: [36m0:00:14[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:14[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:16[0m Remaining: [36m0:00:14[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:03:17[0m Remaining: [36m0:00:14[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 73991, 73992, 74000]                                                                                                                                        │
│ Average cumulative reward:       -7.336082460501022                                                                                                                      │
│ Average rollout reward:          -6.698077703465421                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:03:17[0m Remaining: [36m0:00:11[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 56048, 56406, 75000]                                                                                                                              │
│ Average cumulative reward:       -7.237750515894822                                                                                                                      │
│ Average rollout reward:          -6.612926585084997                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:03:18[0m Remaining: [36m0:00:11[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 56048, 56406, 75000]                                                                                                                              │
│ Average cumulative reward:       -7.237750515894822                                                                                                                      │
│ Average rollout reward:          -6.612926585084997                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:03:18[0m Remaining: [36m0:00:11[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 56048, 56406, 75000]                                                                                                                              │
│ Average cumulative reward:       -7.237750515894822                                                                                                                      │
│ Average rollout reward:          -6.612926585084997                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:03:19[0m Remaining: [36m0:00:11[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 56048, 56406, 75000]                                                                                                                              │
│ Average cumulative reward:       -7.237750515894822                                                                                                                      │
│ Average rollout reward:          -6.612926585084997                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:03:19[0m Remaining: [36m0:00:11[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 452, 457, 56048, 56406, 75000]                                                                                                                              │
│ Average cumulative reward:       -7.237750515894822                                                                                                                      │
│ Average rollout reward:          -6.612926585084997                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:03:20[0m Remaining: [36m0:00:09[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14660, 14662, 73520, 73645, 76000]                                                                                                                          │
│ Average cumulative reward:       -7.337336341733629                                                                                                                      │
│ Average rollout reward:          -6.656581463165649                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:03:20[0m Remaining: [36m0:00:09[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14660, 14662, 73520, 73645, 76000]                                                                                                                          │
│ Average cumulative reward:       -7.337336341733629                                                                                                                      │
│ Average rollout reward:          -6.656581463165649                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:03:21[0m Remaining: [36m0:00:09[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14660, 14662, 73520, 73645, 76000]                                                                                                                          │
│ Average cumulative reward:       -7.337336341733629                                                                                                                      │
│ Average rollout reward:          -6.656581463165649                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:03:21[0m Remaining: [36m0:00:09[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14660, 14662, 73520, 73645, 76000]                                                                                                                          │
│ Average cumulative reward:       -7.337336341733629                                                                                                                      │
│ Average rollout reward:          -6.656581463165649                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:03:22[0m Remaining: [36m0:00:09[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 14660, 14662, 73520, 73645, 76000]                                                                                                                          │
│ Average cumulative reward:       -7.337336341733629                                                                                                                      │
│ Average rollout reward:          -6.656581463165649                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:22[0m Remaining: [36m0:00:06[0m   2.63 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:23[0m Remaining: [36m0:00:06[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:23[0m Remaining: [36m0:00:06[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:24[0m Remaining: [36m0:00:06[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:24[0m Remaining: [36m0:00:06[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:03:25[0m Remaining: [36m0:00:06[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 17512, 17513, 61574, 77000]                                                                                                                                 │
│ Average cumulative reward:       -7.550063288834438                                                                                                                      │
│ Average rollout reward:          -6.885810535360669                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:25[0m Remaining: [36m0:00:03[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:26[0m Remaining: [36m0:00:03[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:26[0m Remaining: [36m0:00:03[0m   2.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:27[0m Remaining: [36m0:00:03[0m   2.66 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:03:27[0m Remaining: [36m0:00:03[0m   2.67 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/79 [38;2;114;156;31m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m100.0%[0m Elapsed: [33m0:03:28[0m Remaining: [36m0:00:00[0m   2.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 2, 56820, 56822, 56829, 56848, 57290, 78000]                                                                                                                   │
│ Average cumulative reward:       -6.7954900958141256                                                                                                                     │
│ Average rollout reward:          -6.15183974767384                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.8316279865309903                                                                                                                             │
│ Best path: [0, 2, 62, 65]                                                                                                                                                │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
[?25hNode 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 62 is not terminal. Continue.
Node 65 is not terminal. Continue.
Node 68359 is not terminal. Continue.
Node 68615 is not terminal. Continue.
Node 69668 is not terminal. Continue.
Node 71884 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 245 is not terminal. Continue.
Node 250 is not terminal. Continue.
Node 443 is not terminal. Continue.
Node 5826 is not terminal. Continue.
Node 9657 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 2 is not terminal. Continue.
Node 62 is not terminal. Continue.
Node 65 is not terminal. Continue.
Node 68359 is not terminal. Continue.
Node 68615 is not terminal. Continue.
Node 69668 is not terminal. Continue.
Node 71884 is not terminal. Continue.
No children found. Stop.
=== RESULT ===
By Visits: estimated reward: -2.87514071639876
sign_newton [29.691267]
sign_newton [0.5899706]
sign_newton [27.22576]
By Value: estimated reward: -2.859410553666163
sign_newton [36.631073]
By Best Value: estimated reward: 0
sign_newton [29.691267]
sign_newton [0.2435745 0.        0.        0.       ]
sign_ns [0.5, 0.616111106172134]
sign_ns [0.5, 1.1045728051352952]
sign_ns [0.5, 1.0085478335561096]
sign_ns [0.5, 1.000054957634437]
Best value of root node:
-1.8316279865309903
Best root policy:
sign_newton [29.691267]
sign_newton [0.2435745 0.        0.        0.       ]
sign_ns [0.5, 0.616111106172134]
sign_ns [0.5, 1.1045728051352952]
sign_ns [0.5, 1.0085478335561096]
sign_ns [0.5, 1.000054957634437]
=== END ===
Finished making algorithm
