Matrix distribution: unif
Matrix distribution config: {'c': 0.25, 'd': 3000, 'eps': 0.001}
Initial matrix shape: torch.Size([3000, 3000])
Algorithm name: mcts
Algorithm config: {'c_ucb': 5.0, 'alpha_pw': 0.4, 'epsilon': 1e-06, 'EXPLORE_K': 5, 'early_termination_epsilon': 1e-05, 'budget': 80000, 'print_every': 1000, 'max_termination_count': 10, 'tree_initial_capacity': 10000, 'device': 'cuda', 'actions': [['sign_ns', [[0, 0], [5, 5]]], ['sign_newton', [[0], [40]]], ['sign_quintic', [[0, 0, 0], [5, 5, 5]]], ['sign_halley', [[0, 0, 0], [40, 40, 40]]]], 'initialize_with_baselines': True}
Actions: ['sign_halley', 'sign_newton', 'sign_ns', 'sign_quintic']
Action sign_halley took 1.0 times longer than sign_halley
Action sign_newton took 0.44487534839033555 times longer than sign_halley
Action sign_ns took 0.10836957858187127 times longer than sign_halley
Action sign_quintic took 0.1594791937731375 times longer than sign_halley
Skipping sign_newton_variant because not all actions are in the tree
Skipping inv_ns because not all actions are in the tree
Skipping inv_ns_chebyshev because not all actions are in the tree
Skipping sqrt_db because not all actions are in the tree
Skipping sqrt_nsv because not all actions are in the tree
Skipping sqrt_visser because not all actions are in the tree
Skipping sqrt_newton because not all actions are in the tree
Skipping sqrt_visser_coupled because not all actions are in the tree
Skipping sqrt_newton_coupled because not all actions are in the tree
Skipping proot_newton because not all actions are in the tree
Skipping proot_visser because not all actions are in the tree
Skipping proot_iannazzo because not all actions are in the tree
[?25l0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:00[0m Remaining: [36m-:--:--[0m 502440.86 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.66925209 -3.66925209]                                                                                                                                                │
│ [-2.77762167 -2.77762167]                                                                                                                                                │
│ [-1.40880452 -1.40880452 -1.40880452]                                                                                                                                    │
│ [-1.30043494 -1.30043494 -1.30043494 -1.30043494]                                                                                                                        │
│ [-1.19206536 -1.19206536 -1.19206536 -1.19206536 -1.08369579]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1006316.35 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.66925209 -3.66925209]                                                                                                                                                │
│ [-2.77762167 -2.77762167]                                                                                                                                                │
│ [-1.40880452 -1.40880452 -1.40880452]                                                                                                                                    │
│ [-1.30043494 -1.30043494 -1.30043494 -1.30043494]                                                                                                                        │
│ [-1.19206536 -1.19206536 -1.19206536 -1.19206536 -1.08369579]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K0/79 [38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m0.0%[0m Elapsed: [33m0:00:01[0m Remaining: [36m-:--:--[0m 1509518.99 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 0 ===                                                                                                                                                      │
│ 1  nodes in tree                                                                                                                                                         │
│ [-3.66925209 -3.66925209]                                                                                                                                                │
│ [-2.77762167 -2.77762167]                                                                                                                                                │
│ [-1.40880452 -1.40880452 -1.40880452]                                                                                                                                    │
│ [-1.30043494 -1.30043494 -1.30043494 -1.30043494]                                                                                                                        │
│ [-1.19206536 -1.19206536 -1.19206536 -1.19206536 -1.08369579]                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5, 1000]                                                                                                                                                    │
│ Average cumulative reward:       -6.176181697948157                                                                                                                      │
│ Average rollout reward:          -6.1022242977698005                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:02[0m Remaining: [36m-:--:--[0m   2.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5, 1000]                                                                                                                                                    │
│ Average cumulative reward:       -6.176181697948157                                                                                                                      │
│ Average rollout reward:          -6.1022242977698005                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K1/79 [38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m1.3%[0m Elapsed: [33m0:00:03[0m Remaining: [36m-:--:--[0m   3.02 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 1000 ===                                                                                                                                                   │
│ 1001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5, 1000]                                                                                                                                                    │
│ Average cumulative reward:       -6.176181697948157                                                                                                                      │
│ Average rollout reward:          -6.1022242977698005                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:03[0m Remaining: [36m0:01:57[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 1981, 1998, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.2813351760355784                                                                                                                     │
│ Average rollout reward:          -6.183812386817529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:01:57[0m   2.01 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 1981, 1998, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.2813351760355784                                                                                                                     │
│ Average rollout reward:          -6.183812386817529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K2/79 [38;2;249;38;114m━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m2.5%[0m Elapsed: [33m0:00:04[0m Remaining: [36m0:01:57[0m   2.27 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 2000 ===                                                                                                                                                   │
│ 2001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 1981, 1998, 2000]                                                                                                                                           │
│ Average cumulative reward:       -6.2813351760355784                                                                                                                     │
│ Average rollout reward:          -6.183812386817529                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:01:55[0m   1.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 2350, 2384, 2388, 3000]                                                                                                                                     │
│ Average cumulative reward:       -6.267442267703058                                                                                                                      │
│ Average rollout reward:          -6.163464810042853                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K3/79 [38;2;249;38;114m━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m3.8%[0m Elapsed: [33m0:00:05[0m Remaining: [36m0:01:55[0m   1.85 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 3000 ===                                                                                                                                                   │
│ 3001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 2350, 2384, 2388, 3000]                                                                                                                                     │
│ Average cumulative reward:       -6.267442267703058                                                                                                                      │
│ Average rollout reward:          -6.163464810042853                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:01:53[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 4, 3193, 3353, 4000]                                                                                                                                        │
│ Average cumulative reward:       -6.368615857909849                                                                                                                      │
│ Average rollout reward:          -6.262554704463819                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:06[0m Remaining: [36m0:01:53[0m   1.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 4, 3193, 3353, 4000]                                                                                                                                        │
│ Average cumulative reward:       -6.368615857909849                                                                                                                      │
│ Average rollout reward:          -6.262554704463819                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K4/79 [38;2;249;38;114m━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m5.1%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:01:53[0m   1.76 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 4000 ===                                                                                                                                                   │
│ 4001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 4, 3193, 3353, 4000]                                                                                                                                        │
│ Average cumulative reward:       -6.368615857909849                                                                                                                      │
│ Average rollout reward:          -6.262554704463819                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:07[0m Remaining: [36m0:01:52[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5000]                                                                                                                                                       │
│ Average cumulative reward:       -6.39079968733952                                                                                                                       │
│ Average rollout reward:          -6.272242036764358                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:01:52[0m   1.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5000]                                                                                                                                                       │
│ Average cumulative reward:       -6.39079968733952                                                                                                                       │
│ Average rollout reward:          -6.272242036764358                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K5/79 [38;2;249;38;114m━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m6.3%[0m Elapsed: [33m0:00:08[0m Remaining: [36m0:01:52[0m   1.71 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 5000 ===                                                                                                                                                   │
│ 5001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 5000]                                                                                                                                                       │
│ Average cumulative reward:       -6.39079968733952                                                                                                                       │
│ Average rollout reward:          -6.272242036764358                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:01:50[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 665, 666, 682, 697, 6000]                                                                                                                                   │
│ Average cumulative reward:       -5.845236035721494                                                                                                                      │
│ Average rollout reward:          -5.720877812311179                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:09[0m Remaining: [36m0:01:50[0m   1.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 665, 666, 682, 697, 6000]                                                                                                                                   │
│ Average cumulative reward:       -5.845236035721494                                                                                                                      │
│ Average rollout reward:          -5.720877812311179                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K6/79 [38;2;249;38;114m━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m7.6%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:01:50[0m   1.68 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 6000 ===                                                                                                                                                   │
│ 6001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 665, 666, 682, 697, 6000]                                                                                                                                   │
│ Average cumulative reward:       -5.845236035721494                                                                                                                      │
│ Average rollout reward:          -5.720877812311179                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:10[0m Remaining: [36m0:01:48[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 605, 1008, 1068, 1438, 7000]                                                                                                                                │
│ Average cumulative reward:       -6.1456926385193285                                                                                                                     │
│ Average rollout reward:          -6.027817516793198                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:01:48[0m   1.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 605, 1008, 1068, 1438, 7000]                                                                                                                                │
│ Average cumulative reward:       -6.1456926385193285                                                                                                                     │
│ Average rollout reward:          -6.027817516793198                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K7/79 [38;2;249;38;114m━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m8.9%[0m Elapsed: [33m0:00:11[0m Remaining: [36m0:01:48[0m   1.65 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 7000 ===                                                                                                                                                   │
│ 7001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 605, 1008, 1068, 1438, 7000]                                                                                                                                │
│ Average cumulative reward:       -6.1456926385193285                                                                                                                     │
│ Average rollout reward:          -6.027817516793198                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:01:47[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 1, 68, 8000]                                                                                                                                                   │
│ Average cumulative reward:       -6.249788613844177                                                                                                                      │
│ Average rollout reward:          -6.094977402755736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:12[0m Remaining: [36m0:01:47[0m   1.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 1, 68, 8000]                                                                                                                                                   │
│ Average cumulative reward:       -6.249788613844177                                                                                                                      │
│ Average rollout reward:          -6.094977402755736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K8/79 [38;2;249;38;114m━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m10.1%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:01:47[0m   1.64 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 8000 ===                                                                                                                                                   │
│ 8001  nodes in tree                                                                                                                                                      │
│ Path: [0, 1, 68, 8000]                                                                                                                                                   │
│ Average cumulative reward:       -6.249788613844177                                                                                                                      │
│ Average rollout reward:          -6.094977402755736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:13[0m Remaining: [36m0:01:46[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 8939, 8995, 8996, 9000]                                                                                                                                     │
│ Average cumulative reward:       -6.50907292607383                                                                                                                       │
│ Average rollout reward:          -6.396597091559853                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:01:46[0m   1.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 8939, 8995, 8996, 9000]                                                                                                                                     │
│ Average cumulative reward:       -6.50907292607383                                                                                                                       │
│ Average rollout reward:          -6.396597091559853                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K9/79 [38;2;249;38;114m━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m11.4%[0m Elapsed: [33m0:00:14[0m Remaining: [36m0:01:46[0m   1.62 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 9000 ===                                                                                                                                                   │
│ 9001  nodes in tree                                                                                                                                                      │
│ Path: [0, 3, 8939, 8995, 8996, 9000]                                                                                                                                     │
│ Average cumulative reward:       -6.50907292607383                                                                                                                       │
│ Average rollout reward:          -6.396597091559853                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:01:44[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9835, 9991, 9994, 10000]                                                                                                                                    │
│ Average cumulative reward:       -5.882876217274146                                                                                                                      │
│ Average rollout reward:          -5.749745892682573                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K10/79 [38;2;249;38;114m━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:15[0m Remaining: [36m0:01:44[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9835, 9991, 9994, 10000]                                                                                                                                    │
│ Average cumulative reward:       -5.882876217274146                                                                                                                      │
│ Average rollout reward:          -5.749745892682573                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m12.7%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:01:44[0m   1.61 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 10000 ===                                                                                                                                                  │
│ 10001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9835, 9991, 9994, 10000]                                                                                                                                    │
│ Average cumulative reward:       -5.882876217274146                                                                                                                      │
│ Average rollout reward:          -5.749745892682573                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:16[0m Remaining: [36m0:01:42[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10146, 10217, 10218, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.245990520564675                                                                                                                      │
│ Average rollout reward:          -6.092971968942259                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:01:42[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10146, 10217, 10218, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.245990520564675                                                                                                                      │
│ Average rollout reward:          -6.092971968942259                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K11/79 [38;2;249;38;114m━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m13.9%[0m Elapsed: [33m0:00:17[0m Remaining: [36m0:01:42[0m   1.60 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 11000 ===                                                                                                                                                  │
│ 11001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10146, 10217, 10218, 11000]                                                                                                                                 │
│ Average cumulative reward:       -6.245990520564675                                                                                                                      │
│ Average rollout reward:          -6.092971968942259                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:01:41[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 11628, 12000]                                                                                                                                         │
│ Average cumulative reward:       -5.712576989415339                                                                                                                      │
│ Average rollout reward:          -5.577857387481843                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:18[0m Remaining: [36m0:01:41[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 11628, 12000]                                                                                                                                         │
│ Average cumulative reward:       -5.712576989415339                                                                                                                      │
│ Average rollout reward:          -5.577857387481843                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K12/79 [38;2;249;38;114m━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m15.2%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:01:41[0m   1.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 12000 ===                                                                                                                                                  │
│ 12001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 11628, 12000]                                                                                                                                         │
│ Average cumulative reward:       -5.712576989415339                                                                                                                      │
│ Average rollout reward:          -5.577857387481843                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:19[0m Remaining: [36m0:01:39[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2100, 11427, 11733, 13000]                                                                                                                                  │
│ Average cumulative reward:       -6.047047103334291                                                                                                                      │
│ Average rollout reward:          -5.921850356537348                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:01:39[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2100, 11427, 11733, 13000]                                                                                                                                  │
│ Average cumulative reward:       -6.047047103334291                                                                                                                      │
│ Average rollout reward:          -5.921850356537348                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K13/79 [38;2;249;38;114m━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m16.5%[0m Elapsed: [33m0:00:20[0m Remaining: [36m0:01:39[0m   1.59 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 13000 ===                                                                                                                                                  │
│ 13001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2100, 11427, 11733, 13000]                                                                                                                                  │
│ Average cumulative reward:       -6.047047103334291                                                                                                                      │
│ Average rollout reward:          -5.921850356537348                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1920653644005839                                                                                                                             │
│ Best path: [0, 3, 5, 64, 76]                                                                                                                                             │
│ [-1.1348054  -1.1348054  -1.1348054  -1.1348054  -1.02643582 -1.02643582]                                                                                                │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:01:37[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 13952, 13995, 13997, 14000]                                                                                                                                 │
│ Average cumulative reward:       -6.149701185255338                                                                                                                      │
│ Average rollout reward:          -5.988026539615041                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:21[0m Remaining: [36m0:01:37[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 13952, 13995, 13997, 14000]                                                                                                                                 │
│ Average cumulative reward:       -6.149701185255338                                                                                                                      │
│ Average rollout reward:          -5.988026539615041                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K14/79 [38;2;249;38;114m━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m17.7%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:01:37[0m   1.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 14000 ===                                                                                                                                                  │
│ 14001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 13952, 13995, 13997, 14000]                                                                                                                                 │
│ Average cumulative reward:       -6.149701185255338                                                                                                                      │
│ Average rollout reward:          -5.988026539615041                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:22[0m Remaining: [36m0:01:36[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 15000]                                                                                                                                                │
│ Average cumulative reward:       -6.114597561027108                                                                                                                      │
│ Average rollout reward:          -5.955284252047489                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:01:36[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 15000]                                                                                                                                                │
│ Average cumulative reward:       -6.114597561027108                                                                                                                      │
│ Average rollout reward:          -5.955284252047489                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K15/79 [38;2;249;38;114m━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m19.0%[0m Elapsed: [33m0:00:23[0m Remaining: [36m0:01:36[0m   1.58 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 15000 ===                                                                                                                                                  │
│ 15001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 8939, 15000]                                                                                                                                                │
│ Average cumulative reward:       -6.114597561027108                                                                                                                      │
│ Average rollout reward:          -5.955284252047489                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:01:34[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 15950, 15954, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.751305058076089                                                                                                                      │
│ Average rollout reward:          -6.612935081573                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:24[0m Remaining: [36m0:01:34[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 15950, 15954, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.751305058076089                                                                                                                      │
│ Average rollout reward:          -6.612935081573                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K16/79 [38;2;249;38;114m━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m20.3%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:01:34[0m   1.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 16000 ===                                                                                                                                                  │
│ 16001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 15950, 15954, 16000]                                                                                                                                 │
│ Average cumulative reward:       -6.751305058076089                                                                                                                      │
│ Average rollout reward:          -6.612935081573                                                                                                                         │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:25[0m Remaining: [36m0:01:33[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2350, 14984, 15767, 17000]                                                                                                                                  │
│ Average cumulative reward:       -5.748894464990076                                                                                                                      │
│ Average rollout reward:          -5.593949934745939                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:01:33[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2350, 14984, 15767, 17000]                                                                                                                                  │
│ Average cumulative reward:       -5.748894464990076                                                                                                                      │
│ Average rollout reward:          -5.593949934745939                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K17/79 [38;2;249;38;114m━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m21.5%[0m Elapsed: [33m0:00:26[0m Remaining: [36m0:01:33[0m   1.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 17000 ===                                                                                                                                                  │
│ 17001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2350, 14984, 15767, 17000]                                                                                                                                  │
│ Average cumulative reward:       -5.748894464990076                                                                                                                      │
│ Average rollout reward:          -5.593949934745939                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:01:31[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 11786, 11917, 11920, 11926, 18000]                                                                                                                          │
│ Average cumulative reward:       -5.9174466437415765                                                                                                                     │
│ Average rollout reward:          -5.766015528148321                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:27[0m Remaining: [36m0:01:31[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 11786, 11917, 11920, 11926, 18000]                                                                                                                          │
│ Average cumulative reward:       -5.9174466437415765                                                                                                                     │
│ Average rollout reward:          -5.766015528148321                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K18/79 [38;2;249;38;114m━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m22.8%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:01:31[0m   1.57 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 18000 ===                                                                                                                                                  │
│ 18001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 11786, 11917, 11920, 11926, 18000]                                                                                                                          │
│ Average cumulative reward:       -5.9174466437415765                                                                                                                     │
│ Average rollout reward:          -5.766015528148321                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:28[0m Remaining: [36m0:01:30[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 18942, 18998, 19000]                                                                                                                                        │
│ Average cumulative reward:       -6.529295107977537                                                                                                                      │
│ Average rollout reward:          -6.367403723180069                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:01:30[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 18942, 18998, 19000]                                                                                                                                        │
│ Average cumulative reward:       -6.529295107977537                                                                                                                      │
│ Average rollout reward:          -6.367403723180069                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K19/79 [38;2;249;38;114m━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m24.1%[0m Elapsed: [33m0:00:29[0m Remaining: [36m0:01:30[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 19000 ===                                                                                                                                                  │
│ 19001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 18942, 18998, 19000]                                                                                                                                        │
│ Average cumulative reward:       -6.529295107977537                                                                                                                      │
│ Average rollout reward:          -6.367403723180069                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:01:29[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19866, 19996, 20000]                                                                                                                                        │
│ Average cumulative reward:       -6.226243792983772                                                                                                                      │
│ Average rollout reward:          -6.073198726255821                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:30[0m Remaining: [36m0:01:29[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19866, 19996, 20000]                                                                                                                                        │
│ Average cumulative reward:       -6.226243792983772                                                                                                                      │
│ Average rollout reward:          -6.073198726255821                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K20/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m25.3%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:01:29[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 20000 ===                                                                                                                                                  │
│ 20001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19866, 19996, 20000]                                                                                                                                        │
│ Average cumulative reward:       -6.226243792983772                                                                                                                      │
│ Average rollout reward:          -6.073198726255821                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:31[0m Remaining: [36m0:01:27[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 20817, 20840, 20841, 21000]                                                                                                                                 │
│ Average cumulative reward:       -6.033381457634027                                                                                                                      │
│ Average rollout reward:          -5.902579962815617                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:01:27[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 20817, 20840, 20841, 21000]                                                                                                                                 │
│ Average cumulative reward:       -6.033381457634027                                                                                                                      │
│ Average rollout reward:          -5.902579962815617                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K21/79 [38;2;249;38;114m━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m26.6%[0m Elapsed: [33m0:00:32[0m Remaining: [36m0:01:27[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 21000 ===                                                                                                                                                  │
│ 21001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 20817, 20840, 20841, 21000]                                                                                                                                 │
│ Average cumulative reward:       -6.033381457634027                                                                                                                      │
│ Average rollout reward:          -5.902579962815617                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:01:26[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6797, 19781, 20180, 22000]                                                                                                                                  │
│ Average cumulative reward:       -6.0082145452751705                                                                                                                     │
│ Average rollout reward:          -5.844252741450599                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:33[0m Remaining: [36m0:01:26[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6797, 19781, 20180, 22000]                                                                                                                                  │
│ Average cumulative reward:       -6.0082145452751705                                                                                                                     │
│ Average rollout reward:          -5.844252741450599                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K22/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m27.8%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:01:26[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 22000 ===                                                                                                                                                  │
│ 22001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6797, 19781, 20180, 22000]                                                                                                                                  │
│ Average cumulative reward:       -6.0082145452751705                                                                                                                     │
│ Average rollout reward:          -5.844252741450599                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:34[0m Remaining: [36m0:01:24[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22802, 22984, 22987, 22989, 23000]                                                                                                                          │
│ Average cumulative reward:       -6.211936131145768                                                                                                                      │
│ Average rollout reward:          -6.044284160887351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K23/79 [38;2;249;38;114m━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:01:24[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22802, 22984, 22987, 22989, 23000]                                                                                                                          │
│ Average cumulative reward:       -6.211936131145768                                                                                                                      │
│ Average rollout reward:          -6.044284160887351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m29.1%[0m Elapsed: [33m0:00:35[0m Remaining: [36m0:01:24[0m   1.56 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 23000 ===                                                                                                                                                  │
│ 23001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22802, 22984, 22987, 22989, 23000]                                                                                                                          │
│ Average cumulative reward:       -6.211936131145768                                                                                                                      │
│ Average rollout reward:          -6.044284160887351                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:01:23[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 23835, 23990, 23992, 24000]                                                                                                                                 │
│ Average cumulative reward:       -6.4166590211525945                                                                                                                     │
│ Average rollout reward:          -6.243126698820587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:36[0m Remaining: [36m0:01:23[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 23835, 23990, 23992, 24000]                                                                                                                                 │
│ Average cumulative reward:       -6.4166590211525945                                                                                                                     │
│ Average rollout reward:          -6.243126698820587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K24/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m30.4%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:01:23[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 24000 ===                                                                                                                                                  │
│ 24001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 23835, 23990, 23992, 24000]                                                                                                                                 │
│ Average cumulative reward:       -6.4166590211525945                                                                                                                     │
│ Average rollout reward:          -6.243126698820587                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:37[0m Remaining: [36m0:01:21[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19401, 19583, 19586, 25000]                                                                                                                                 │
│ Average cumulative reward:       -6.175101683919711                                                                                                                      │
│ Average rollout reward:          -6.006976443482748                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:01:21[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19401, 19583, 19586, 25000]                                                                                                                                 │
│ Average cumulative reward:       -6.175101683919711                                                                                                                      │
│ Average rollout reward:          -6.006976443482748                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K25/79 [38;2;249;38;114m━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m31.6%[0m Elapsed: [33m0:00:38[0m Remaining: [36m0:01:21[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 25000 ===                                                                                                                                                  │
│ 25001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 19401, 19583, 19586, 25000]                                                                                                                                 │
│ Average cumulative reward:       -6.175101683919711                                                                                                                      │
│ Average rollout reward:          -6.006976443482748                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:01:20[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000]                                                                                                                                               │
│ Average cumulative reward:       -6.384904009348728                                                                                                                      │
│ Average rollout reward:          -6.238024220545933                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:39[0m Remaining: [36m0:01:20[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000]                                                                                                                                               │
│ Average cumulative reward:       -6.384904009348728                                                                                                                      │
│ Average rollout reward:          -6.238024220545933                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K26/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m32.9%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:01:20[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 26000 ===                                                                                                                                                  │
│ 26001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000]                                                                                                                                               │
│ Average cumulative reward:       -6.384904009348728                                                                                                                      │
│ Average rollout reward:          -6.238024220545933                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:40[0m Remaining: [36m0:01:18[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 4603, 4626, 4628, 4633, 27000]                                                                                                                              │
│ Average cumulative reward:       -5.8850558165420015                                                                                                                     │
│ Average rollout reward:          -5.725086493818864                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:01:18[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 4603, 4626, 4628, 4633, 27000]                                                                                                                              │
│ Average cumulative reward:       -5.8850558165420015                                                                                                                     │
│ Average rollout reward:          -5.725086493818864                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K27/79 [38;2;249;38;114m━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m34.2%[0m Elapsed: [33m0:00:41[0m Remaining: [36m0:01:18[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 27000 ===                                                                                                                                                  │
│ 27001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 4603, 4626, 4628, 4633, 27000]                                                                                                                              │
│ Average cumulative reward:       -5.8850558165420015                                                                                                                     │
│ Average rollout reward:          -5.725086493818864                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:01:17[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 209, 229, 230, 2826, 28000]                                                                                                                                 │
│ Average cumulative reward:       -6.023278021976929                                                                                                                      │
│ Average rollout reward:          -5.8523902779425                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:42[0m Remaining: [36m0:01:17[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 209, 229, 230, 2826, 28000]                                                                                                                                 │
│ Average cumulative reward:       -6.023278021976929                                                                                                                      │
│ Average rollout reward:          -5.8523902779425                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K28/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m35.4%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:01:17[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 28000 ===                                                                                                                                                  │
│ 28001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 209, 229, 230, 2826, 28000]                                                                                                                                 │
│ Average cumulative reward:       -6.023278021976929                                                                                                                      │
│ Average rollout reward:          -5.8523902779425                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:43[0m Remaining: [36m0:01:15[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28872, 28875, 29000]                                                                                                                                 │
│ Average cumulative reward:       -5.991751035682034                                                                                                                      │
│ Average rollout reward:          -5.825661827660015                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:01:15[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28872, 28875, 29000]                                                                                                                                 │
│ Average cumulative reward:       -5.991751035682034                                                                                                                      │
│ Average rollout reward:          -5.825661827660015                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K29/79 [38;2;249;38;114m━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m36.7%[0m Elapsed: [33m0:00:44[0m Remaining: [36m0:01:15[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 29000 ===                                                                                                                                                  │
│ 29001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28872, 28875, 29000]                                                                                                                                 │
│ Average cumulative reward:       -5.991751035682034                                                                                                                      │
│ Average rollout reward:          -5.825661827660015                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:01:14[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000, 26003, 26277, 30000]                                                                                                                          │
│ Average cumulative reward:       -6.298186408453933                                                                                                                      │
│ Average rollout reward:          -6.119011387548986                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:45[0m Remaining: [36m0:01:14[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000, 26003, 26277, 30000]                                                                                                                          │
│ Average cumulative reward:       -6.298186408453933                                                                                                                      │
│ Average rollout reward:          -6.119011387548986                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K30/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m38.0%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:01:14[0m   1.55 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 30000 ===                                                                                                                                                  │
│ 30001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26000, 26003, 26277, 30000]                                                                                                                          │
│ Average cumulative reward:       -6.298186408453933                                                                                                                      │
│ Average rollout reward:          -6.119011387548986                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:46[0m Remaining: [36m0:01:12[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 30650, 30707, 30711, 30716, 31000]                                                                                                                          │
│ Average cumulative reward:       -6.364469297941478                                                                                                                      │
│ Average rollout reward:          -6.225599497742425                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:01:12[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 30650, 30707, 30711, 30716, 31000]                                                                                                                          │
│ Average cumulative reward:       -6.364469297941478                                                                                                                      │
│ Average rollout reward:          -6.225599497742425                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K31/79 [38;2;249;38;114m━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m39.2%[0m Elapsed: [33m0:00:47[0m Remaining: [36m0:01:12[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 31000 ===                                                                                                                                                  │
│ 31001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 30650, 30707, 30711, 30716, 31000]                                                                                                                          │
│ Average cumulative reward:       -6.364469297941478                                                                                                                      │
│ Average rollout reward:          -6.225599497742425                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:01:11[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 31888, 31997, 32000]                                                                                                                                        │
│ Average cumulative reward:       -6.239406985791491                                                                                                                      │
│ Average rollout reward:          -6.10068331021251                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:48[0m Remaining: [36m0:01:11[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 31888, 31997, 32000]                                                                                                                                        │
│ Average cumulative reward:       -6.239406985791491                                                                                                                      │
│ Average rollout reward:          -6.10068331021251                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K32/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m40.5%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:01:11[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 32000 ===                                                                                                                                                  │
│ 32001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 31888, 31997, 32000]                                                                                                                                        │
│ Average cumulative reward:       -6.239406985791491                                                                                                                      │
│ Average rollout reward:          -6.10068331021251                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:00:49[0m Remaining: [36m0:01:09[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 30605, 31671, 33000]                                                                                                                                 │
│ Average cumulative reward:       -6.536121373441183                                                                                                                      │
│ Average rollout reward:          -6.384496312758885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:01:09[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 30605, 31671, 33000]                                                                                                                                 │
│ Average cumulative reward:       -6.536121373441183                                                                                                                      │
│ Average rollout reward:          -6.384496312758885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K33/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━━[0m [35m41.8%[0m Elapsed: [33m0:00:50[0m Remaining: [36m0:01:09[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 33000 ===                                                                                                                                                  │
│ 33001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 30605, 31671, 33000]                                                                                                                                 │
│ Average cumulative reward:       -6.536121373441183                                                                                                                      │
│ Average rollout reward:          -6.384496312758885                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:01:08[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27684, 27773, 27775, 34000]                                                                                                                                 │
│ Average cumulative reward:       -6.273944013054852                                                                                                                      │
│ Average rollout reward:          -6.118976688742594                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:00:51[0m Remaining: [36m0:01:08[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27684, 27773, 27775, 34000]                                                                                                                                 │
│ Average cumulative reward:       -6.273944013054852                                                                                                                      │
│ Average rollout reward:          -6.118976688742594                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K34/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m43.0%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:01:08[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 34000 ===                                                                                                                                                  │
│ 34001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27684, 27773, 27775, 34000]                                                                                                                                 │
│ Average cumulative reward:       -6.273944013054852                                                                                                                      │
│ Average rollout reward:          -6.118976688742594                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:00:52[0m Remaining: [36m0:01:07[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7046, 7069, 7071, 12385, 35000]                                                                                                                             │
│ Average cumulative reward:       -5.839853255847982                                                                                                                      │
│ Average rollout reward:          -5.652455619239478                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:01:07[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7046, 7069, 7071, 12385, 35000]                                                                                                                             │
│ Average cumulative reward:       -5.839853255847982                                                                                                                      │
│ Average rollout reward:          -5.652455619239478                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K35/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━━[0m [35m44.3%[0m Elapsed: [33m0:00:53[0m Remaining: [36m0:01:07[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 35000 ===                                                                                                                                                  │
│ 35001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7046, 7069, 7071, 12385, 35000]                                                                                                                             │
│ Average cumulative reward:       -5.839853255847982                                                                                                                      │
│ Average rollout reward:          -5.652455619239478                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:01:05[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 35783, 35994, 35996, 36000]                                                                                                                                 │
│ Average cumulative reward:       -6.174768488459984                                                                                                                      │
│ Average rollout reward:          -6.0062248877801565                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:00:54[0m Remaining: [36m0:01:05[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 35783, 35994, 35996, 36000]                                                                                                                                 │
│ Average cumulative reward:       -6.174768488459984                                                                                                                      │
│ Average rollout reward:          -6.0062248877801565                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K36/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m45.6%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:01:05[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 36000 ===                                                                                                                                                  │
│ 36001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 35783, 35994, 35996, 36000]                                                                                                                                 │
│ Average cumulative reward:       -6.174768488459984                                                                                                                      │
│ Average rollout reward:          -6.0062248877801565                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:00:55[0m Remaining: [36m0:01:04[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 32519, 32535, 32536, 32723, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.147829781705554                                                                                                                      │
│ Average rollout reward:          -5.964879018856317                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:01:04[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 32519, 32535, 32536, 32723, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.147829781705554                                                                                                                      │
│ Average rollout reward:          -5.964879018856317                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K37/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━━[0m [35m46.8%[0m Elapsed: [33m0:00:56[0m Remaining: [36m0:01:04[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 37000 ===                                                                                                                                                  │
│ 37001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 32519, 32535, 32536, 32723, 37000]                                                                                                                          │
│ Average cumulative reward:       -6.147829781705554                                                                                                                      │
│ Average rollout reward:          -5.964879018856317                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:01:02[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37832, 37988, 37989, 37994, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.210979078730588                                                                                                                      │
│ Average rollout reward:          -6.0426426998985                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:00:57[0m Remaining: [36m0:01:02[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37832, 37988, 37989, 37994, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.210979078730588                                                                                                                      │
│ Average rollout reward:          -6.0426426998985                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K38/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m48.1%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:01:02[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 38000 ===                                                                                                                                                  │
│ 38001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37832, 37988, 37989, 37994, 38000]                                                                                                                          │
│ Average cumulative reward:       -6.210979078730588                                                                                                                      │
│ Average rollout reward:          -6.0426426998985                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:00:58[0m Remaining: [36m0:01:01[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 605, 38583, 39000]                                                                                                                                          │
│ Average cumulative reward:       -6.311278276579285                                                                                                                      │
│ Average rollout reward:          -6.157364373498043                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:01:01[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 605, 38583, 39000]                                                                                                                                          │
│ Average cumulative reward:       -6.311278276579285                                                                                                                      │
│ Average rollout reward:          -6.157364373498043                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K39/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━━[0m [35m49.4%[0m Elapsed: [33m0:00:59[0m Remaining: [36m0:01:01[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 39000 ===                                                                                                                                                  │
│ 39001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 605, 38583, 39000]                                                                                                                                          │
│ Average cumulative reward:       -6.311278276579285                                                                                                                      │
│ Average rollout reward:          -6.157364373498043                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:00:59[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 39953, 39961, 40000]                                                                                                                                 │
│ Average cumulative reward:       -6.195430454502011                                                                                                                      │
│ Average rollout reward:          -6.063163169991736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:00[0m Remaining: [36m0:00:59[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 39953, 39961, 40000]                                                                                                                                 │
│ Average cumulative reward:       -6.195430454502011                                                                                                                      │
│ Average rollout reward:          -6.063163169991736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K40/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m50.6%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:00:59[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 40000 ===                                                                                                                                                  │
│ 40001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 39953, 39961, 40000]                                                                                                                                 │
│ Average cumulative reward:       -6.195430454502011                                                                                                                      │
│ Average rollout reward:          -6.063163169991736                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:01[0m Remaining: [36m0:00:58[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 40673, 40990, 40991, 41000]                                                                                                                                 │
│ Average cumulative reward:       -6.188527473406653                                                                                                                      │
│ Average rollout reward:          -6.035698950325897                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:00:58[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 40673, 40990, 40991, 41000]                                                                                                                                 │
│ Average cumulative reward:       -6.188527473406653                                                                                                                      │
│ Average rollout reward:          -6.035698950325897                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K41/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━━[0m [35m51.9%[0m Elapsed: [33m0:01:02[0m Remaining: [36m0:00:58[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 41000 ===                                                                                                                                                  │
│ 41001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 40673, 40990, 40991, 41000]                                                                                                                                 │
│ Average cumulative reward:       -6.188527473406653                                                                                                                      │
│ Average rollout reward:          -6.035698950325897                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:00:56[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15112, 38629, 38937, 42000]                                                                                                                                 │
│ Average cumulative reward:       -5.9063362048707875                                                                                                                     │
│ Average rollout reward:          -5.726284710027764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:03[0m Remaining: [36m0:00:56[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15112, 38629, 38937, 42000]                                                                                                                                 │
│ Average cumulative reward:       -5.9063362048707875                                                                                                                     │
│ Average rollout reward:          -5.726284710027764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K42/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m53.2%[0m Elapsed: [33m0:01:04[0m Remaining: [36m0:00:56[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 42000 ===                                                                                                                                                  │
│ 42001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15112, 38629, 38937, 42000]                                                                                                                                 │
│ Average cumulative reward:       -5.9063362048707875                                                                                                                     │
│ Average rollout reward:          -5.726284710027764                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:00:55[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 42993, 42995, 43000]                                                                                                                                 │
│ Average cumulative reward:       -6.151532037576997                                                                                                                      │
│ Average rollout reward:          -5.96260582983429                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K43/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:05[0m Remaining: [36m0:00:55[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 42993, 42995, 43000]                                                                                                                                 │
│ Average cumulative reward:       -6.151532037576997                                                                                                                      │
│ Average rollout reward:          -5.96260582983429                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━━━━━━━━[0m [35m54.4%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:00:55[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 43000 ===                                                                                                                                                  │
│ 43001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 42993, 42995, 43000]                                                                                                                                 │
│ Average cumulative reward:       -6.151532037576997                                                                                                                      │
│ Average rollout reward:          -5.96260582983429                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:06[0m Remaining: [36m0:00:54[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7559, 7569, 7571, 36400, 44000]                                                                                                                             │
│ Average cumulative reward:       -5.78269335024673                                                                                                                       │
│ Average rollout reward:          -5.5993109888078125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:00:54[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7559, 7569, 7571, 36400, 44000]                                                                                                                             │
│ Average cumulative reward:       -5.78269335024673                                                                                                                       │
│ Average rollout reward:          -5.5993109888078125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K44/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m55.7%[0m Elapsed: [33m0:01:07[0m Remaining: [36m0:00:54[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 44000 ===                                                                                                                                                  │
│ 44001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 7559, 7569, 7571, 36400, 44000]                                                                                                                             │
│ Average cumulative reward:       -5.78269335024673                                                                                                                       │
│ Average rollout reward:          -5.5993109888078125                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:00:52[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10783, 10892, 10894, 45000]                                                                                                                                 │
│ Average cumulative reward:       -6.5842526804819235                                                                                                                     │
│ Average rollout reward:          -6.399079420615367                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:08[0m Remaining: [36m0:00:52[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10783, 10892, 10894, 45000]                                                                                                                                 │
│ Average cumulative reward:       -6.5842526804819235                                                                                                                     │
│ Average rollout reward:          -6.399079420615367                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K45/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━━[0m [35m57.0%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:00:52[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 45000 ===                                                                                                                                                  │
│ 45001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10783, 10892, 10894, 45000]                                                                                                                                 │
│ Average cumulative reward:       -6.5842526804819235                                                                                                                     │
│ Average rollout reward:          -6.399079420615367                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:09[0m Remaining: [36m0:00:51[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 45944, 46000]                                                                                                                                               │
│ Average cumulative reward:       -7.18308721537339                                                                                                                       │
│ Average rollout reward:          -7.012024599280647                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:00:51[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 45944, 46000]                                                                                                                                               │
│ Average cumulative reward:       -7.18308721537339                                                                                                                       │
│ Average rollout reward:          -7.012024599280647                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:10[0m Remaining: [36m0:00:51[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 45944, 46000]                                                                                                                                               │
│ Average cumulative reward:       -7.18308721537339                                                                                                                       │
│ Average rollout reward:          -7.012024599280647                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K46/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m58.2%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:00:51[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 46000 ===                                                                                                                                                  │
│ 46001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 45944, 46000]                                                                                                                                               │
│ Average cumulative reward:       -7.18308721537339                                                                                                                       │
│ Average rollout reward:          -7.012024599280647                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:11[0m Remaining: [36m0:00:50[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 46729, 46911, 46915, 46970, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.463171838413669                                                                                                                      │
│ Average rollout reward:          -6.324950218863366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:00:50[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 46729, 46911, 46915, 46970, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.463171838413669                                                                                                                      │
│ Average rollout reward:          -6.324950218863366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K47/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━━[0m [35m59.5%[0m Elapsed: [33m0:01:12[0m Remaining: [36m0:00:50[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 47000 ===                                                                                                                                                  │
│ 47001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 46729, 46911, 46915, 46970, 47000]                                                                                                                          │
│ Average cumulative reward:       -6.463171838413669                                                                                                                      │
│ Average rollout reward:          -6.324950218863366                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:00:48[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 277, 45919, 46458, 48000]                                                                                                                                   │
│ Average cumulative reward:       -6.386891780655793                                                                                                                      │
│ Average rollout reward:          -6.240508251609656                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K48/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m60.8%[0m Elapsed: [33m0:01:13[0m Remaining: [36m0:00:48[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 48000 ===                                                                                                                                                  │
│ 48001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 277, 45919, 46458, 48000]                                                                                                                                   │
│ Average cumulative reward:       -6.386891780655793                                                                                                                      │
│ Average rollout reward:          -6.240508251609656                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:00:47[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37142, 37144, 37156, 37207, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.0178619522599055                                                                                                                     │
│ Average rollout reward:          -5.836304680350794                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:14[0m Remaining: [36m0:00:47[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37142, 37144, 37156, 37207, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.0178619522599055                                                                                                                     │
│ Average rollout reward:          -5.836304680350794                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K49/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━━[0m [35m62.0%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:00:47[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 49000 ===                                                                                                                                                  │
│ 49001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 37142, 37144, 37156, 37207, 49000]                                                                                                                          │
│ Average cumulative reward:       -6.0178619522599055                                                                                                                     │
│ Average rollout reward:          -5.836304680350794                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:15[0m Remaining: [36m0:00:45[0m   1.51 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 12483, 46280, 46394, 47937, 48853, 50000]                                                                                                                   │
│ Average cumulative reward:       -6.0288595259410585                                                                                                                     │
│ Average rollout reward:          -5.8250794009559135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:00:45[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 12483, 46280, 46394, 47937, 48853, 50000]                                                                                                                   │
│ Average cumulative reward:       -6.0288595259410585                                                                                                                     │
│ Average rollout reward:          -5.8250794009559135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:16[0m Remaining: [36m0:00:45[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 12483, 46280, 46394, 47937, 48853, 50000]                                                                                                                   │
│ Average cumulative reward:       -6.0288595259410585                                                                                                                     │
│ Average rollout reward:          -5.8250794009559135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K50/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m63.3%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:00:45[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 50000 ===                                                                                                                                                  │
│ 50001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 12483, 46280, 46394, 47937, 48853, 50000]                                                                                                                   │
│ Average cumulative reward:       -6.0288595259410585                                                                                                                     │
│ Average rollout reward:          -5.8250794009559135                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:17[0m Remaining: [36m0:00:44[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 50775, 50986, 50989, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.2470909123599885                                                                                                                     │
│ Average rollout reward:          -6.05337756563894                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:00:44[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 50775, 50986, 50989, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.2470909123599885                                                                                                                     │
│ Average rollout reward:          -6.05337756563894                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K51/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━━[0m [35m64.6%[0m Elapsed: [33m0:01:18[0m Remaining: [36m0:00:44[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 51000 ===                                                                                                                                                  │
│ 51001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 50775, 50986, 50989, 51000]                                                                                                                                 │
│ Average cumulative reward:       -6.2470909123599885                                                                                                                     │
│ Average rollout reward:          -6.05337756563894                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:00:42[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 43202, 43205, 43230, 52000]                                                                                                                          │
│ Average cumulative reward:       -6.416959096685997                                                                                                                      │
│ Average rollout reward:          -6.194644443745882                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:19[0m Remaining: [36m0:00:42[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 43202, 43205, 43230, 52000]                                                                                                                          │
│ Average cumulative reward:       -6.416959096685997                                                                                                                      │
│ Average rollout reward:          -6.194644443745882                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K52/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━━[0m [35m65.8%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:00:42[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 52000 ===                                                                                                                                                  │
│ 52001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 42885, 43202, 43205, 43230, 52000]                                                                                                                          │
│ Average cumulative reward:       -6.416959096685997                                                                                                                      │
│ Average rollout reward:          -6.194644443745882                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:20[0m Remaining: [36m0:00:41[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10462, 10478, 10480, 10615, 53000]                                                                                                                          │
│ Average cumulative reward:       -6.013418422359543                                                                                                                      │
│ Average rollout reward:          -5.796847161574308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:00:41[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10462, 10478, 10480, 10615, 53000]                                                                                                                          │
│ Average cumulative reward:       -6.013418422359543                                                                                                                      │
│ Average rollout reward:          -5.796847161574308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K53/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━━[0m [35m67.1%[0m Elapsed: [33m0:01:21[0m Remaining: [36m0:00:41[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 53000 ===                                                                                                                                                  │
│ 53001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 10462, 10478, 10480, 10615, 53000]                                                                                                                          │
│ Average cumulative reward:       -6.013418422359543                                                                                                                      │
│ Average rollout reward:          -5.796847161574308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:00:39[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2223, 53084, 54000]                                                                                                                                         │
│ Average cumulative reward:       -6.1119629343454775                                                                                                                     │
│ Average rollout reward:          -5.980996280288136                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:22[0m Remaining: [36m0:00:39[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2223, 53084, 54000]                                                                                                                                         │
│ Average cumulative reward:       -6.1119629343454775                                                                                                                     │
│ Average rollout reward:          -5.980996280288136                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K54/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━━[0m [35m68.4%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:00:39[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 54000 ===                                                                                                                                                  │
│ 54001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2223, 53084, 54000]                                                                                                                                         │
│ Average cumulative reward:       -6.1119629343454775                                                                                                                     │
│ Average rollout reward:          -5.980996280288136                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:23[0m Remaining: [36m0:00:38[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 33156, 53146, 53830, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.339308868302813                                                                                                                      │
│ Average rollout reward:          -6.189647698381818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:00:38[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 33156, 53146, 53830, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.339308868302813                                                                                                                      │
│ Average rollout reward:          -6.189647698381818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K55/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━━[0m [35m69.6%[0m Elapsed: [33m0:01:24[0m Remaining: [36m0:00:38[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 55000 ===                                                                                                                                                  │
│ 55001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 33156, 53146, 53830, 55000]                                                                                                                                 │
│ Average cumulative reward:       -6.339308868302813                                                                                                                      │
│ Average rollout reward:          -6.189647698381818                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:00:36[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 55899, 55901, 55912, 56000]                                                                                                                                 │
│ Average cumulative reward:       -6.165745913549666                                                                                                                      │
│ Average rollout reward:          -5.981040323099632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K56/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:25[0m Remaining: [36m0:00:36[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 55899, 55901, 55912, 56000]                                                                                                                                 │
│ Average cumulative reward:       -6.165745913549666                                                                                                                      │
│ Average rollout reward:          -5.981040323099632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯━━━━━━[0m [35m70.9%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:00:36[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 56000 ===                                                                                                                                                  │
│ 56001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 55899, 55901, 55912, 56000]                                                                                                                                 │
│ Average cumulative reward:       -6.165745913549666                                                                                                                      │
│ Average rollout reward:          -5.981040323099632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:26[0m Remaining: [36m0:00:35[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 56782, 56993, 56996, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.5047694517291355                                                                                                                     │
│ Average rollout reward:          -6.321689404957834                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:00:35[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 56782, 56993, 56996, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.5047694517291355                                                                                                                     │
│ Average rollout reward:          -6.321689404957834                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K57/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━━[0m [35m72.2%[0m Elapsed: [33m0:01:27[0m Remaining: [36m0:00:35[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 57000 ===                                                                                                                                                  │
│ 57001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 56782, 56993, 56996, 57000]                                                                                                                                 │
│ Average cumulative reward:       -6.5047694517291355                                                                                                                     │
│ Average rollout reward:          -6.321689404957834                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:00:33[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2617, 53087, 53892, 58000]                                                                                                                                  │
│ Average cumulative reward:       -6.158885496857324                                                                                                                      │
│ Average rollout reward:          -5.961974326911712                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:28[0m Remaining: [36m0:00:33[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2617, 53087, 53892, 58000]                                                                                                                                  │
│ Average cumulative reward:       -6.158885496857324                                                                                                                      │
│ Average rollout reward:          -5.961974326911712                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K58/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━━[0m [35m73.4%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:00:33[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 58000 ===                                                                                                                                                  │
│ 58001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 2617, 53087, 53892, 58000]                                                                                                                                  │
│ Average cumulative reward:       -6.158885496857324                                                                                                                      │
│ Average rollout reward:          -5.961974326911712                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:29[0m Remaining: [36m0:00:31[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 58572, 58573, 58587, 58641, 59000]                                                                                                                          │
│ Average cumulative reward:       -5.942651225992859                                                                                                                      │
│ Average rollout reward:          -5.7526927068726                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:00:31[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 58572, 58573, 58587, 58641, 59000]                                                                                                                          │
│ Average cumulative reward:       -5.942651225992859                                                                                                                      │
│ Average rollout reward:          -5.7526927068726                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K59/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━━[0m [35m74.7%[0m Elapsed: [33m0:01:30[0m Remaining: [36m0:00:31[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 59000 ===                                                                                                                                                  │
│ 59001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 58572, 58573, 58587, 58641, 59000]                                                                                                                          │
│ Average cumulative reward:       -5.942651225992859                                                                                                                      │
│ Average rollout reward:          -5.7526927068726                                                                                                                        │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:00:30[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 54559, 55262, 58561, 60000]                                                                                                                          │
│ Average cumulative reward:       -5.570414261097864                                                                                                                      │
│ Average rollout reward:          -5.360634017426486                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:31[0m Remaining: [36m0:00:30[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 54559, 55262, 58561, 60000]                                                                                                                          │
│ Average cumulative reward:       -5.570414261097864                                                                                                                      │
│ Average rollout reward:          -5.360634017426486                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K60/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━━[0m [35m75.9%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:00:30[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 60000 ===                                                                                                                                                  │
│ 60001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 39950, 54559, 55262, 58561, 60000]                                                                                                                          │
│ Average cumulative reward:       -5.570414261097864                                                                                                                      │
│ Average rollout reward:          -5.360634017426486                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:32[0m Remaining: [36m0:00:28[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 60774, 61000]                                                                                                                                         │
│ Average cumulative reward:       -6.140853914746727                                                                                                                      │
│ Average rollout reward:          -5.9585649619130265                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:00:28[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 60774, 61000]                                                                                                                                         │
│ Average cumulative reward:       -6.140853914746727                                                                                                                      │
│ Average rollout reward:          -5.9585649619130265                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K61/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━━[0m [35m77.2%[0m Elapsed: [33m0:01:33[0m Remaining: [36m0:00:28[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 61000 ===                                                                                                                                                  │
│ 61001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 60774, 61000]                                                                                                                                         │
│ Average cumulative reward:       -6.140853914746727                                                                                                                      │
│ Average rollout reward:          -5.9585649619130265                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:00:27[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 60790, 61016, 62000]                                                                                                                                 │
│ Average cumulative reward:       -6.423397993509458                                                                                                                      │
│ Average rollout reward:          -6.27014550942378                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:34[0m Remaining: [36m0:00:27[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 60790, 61016, 62000]                                                                                                                                 │
│ Average cumulative reward:       -6.423397993509458                                                                                                                      │
│ Average rollout reward:          -6.27014550942378                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K62/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━━[0m [35m78.5%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:00:27[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 62000 ===                                                                                                                                                  │
│ 62001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 15918, 60790, 61016, 62000]                                                                                                                                 │
│ Average cumulative reward:       -6.423397993509458                                                                                                                      │
│ Average rollout reward:          -6.27014550942378                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:01:35[0m Remaining: [36m0:00:25[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26006, 26008, 59609, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.171158114565694                                                                                                                      │
│ Average rollout reward:          -6.0245673636718315                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:00:25[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26006, 26008, 59609, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.171158114565694                                                                                                                      │
│ Average rollout reward:          -6.0245673636718315                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K63/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━━[0m [35m79.7%[0m Elapsed: [33m0:01:36[0m Remaining: [36m0:00:25[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 63000 ===                                                                                                                                                  │
│ 63001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 25990, 26006, 26008, 59609, 63000]                                                                                                                          │
│ Average cumulative reward:       -6.171158114565694                                                                                                                      │
│ Average rollout reward:          -6.0245673636718315                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:00:24[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 53300, 53332, 53339, 53604, 64000]                                                                                                                          │
│ Average cumulative reward:       -6.222928933852659                                                                                                                      │
│ Average rollout reward:          -6.021472037878619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:01:37[0m Remaining: [36m0:00:24[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 53300, 53332, 53339, 53604, 64000]                                                                                                                          │
│ Average cumulative reward:       -6.222928933852659                                                                                                                      │
│ Average rollout reward:          -6.021472037878619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K64/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━━[0m [35m81.0%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:00:24[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 64000 ===                                                                                                                                                  │
│ 64001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 53300, 53332, 53339, 53604, 64000]                                                                                                                          │
│ Average cumulative reward:       -6.222928933852659                                                                                                                      │
│ Average rollout reward:          -6.021472037878619                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:01:38[0m Remaining: [36m0:00:22[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 60717, 61083, 63058, 65000]                                                                                                                             │
│ Average cumulative reward:       -5.738983506526109                                                                                                                      │
│ Average rollout reward:          -5.554133475670676                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:00:22[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 60717, 61083, 63058, 65000]                                                                                                                             │
│ Average cumulative reward:       -5.738983506526109                                                                                                                      │
│ Average rollout reward:          -5.554133475670676                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K65/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━━[0m [35m82.3%[0m Elapsed: [33m0:01:39[0m Remaining: [36m0:00:22[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 65000 ===                                                                                                                                                  │
│ 65001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 60717, 61083, 63058, 65000]                                                                                                                             │
│ Average cumulative reward:       -5.738983506526109                                                                                                                      │
│ Average rollout reward:          -5.554133475670676                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:00:20[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 3205, 3206, 3212, 60466, 66000]                                                                                                                             │
│ Average cumulative reward:       -6.125718299748682                                                                                                                      │
│ Average rollout reward:          -5.94565540787159                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:01:40[0m Remaining: [36m0:00:20[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 3205, 3206, 3212, 60466, 66000]                                                                                                                             │
│ Average cumulative reward:       -6.125718299748682                                                                                                                      │
│ Average rollout reward:          -5.94565540787159                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K66/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━━[0m [35m83.5%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:00:20[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 66000 ===                                                                                                                                                  │
│ 66001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 3205, 3206, 3212, 60466, 66000]                                                                                                                             │
│ Average cumulative reward:       -6.125718299748682                                                                                                                      │
│ Average rollout reward:          -5.94565540787159                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:01:41[0m Remaining: [36m0:00:19[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 51697, 51701, 51705, 67000]                                                                                                                          │
│ Average cumulative reward:       -6.046455527412                                                                                                                         │
│ Average rollout reward:          -5.838051776884632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:00:19[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 51697, 51701, 51705, 67000]                                                                                                                          │
│ Average cumulative reward:       -6.046455527412                                                                                                                         │
│ Average rollout reward:          -5.838051776884632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K67/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━━[0m [35m84.8%[0m Elapsed: [33m0:01:42[0m Remaining: [36m0:00:19[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 67000 ===                                                                                                                                                  │
│ 67001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 51697, 51701, 51705, 67000]                                                                                                                          │
│ Average cumulative reward:       -6.046455527412                                                                                                                         │
│ Average rollout reward:          -5.838051776884632                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:00:17[0m   1.52 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6084, 67990, 67991, 67995, 68000]                                                                                                                           │
│ Average cumulative reward:       -6.225090705066294                                                                                                                      │
│ Average rollout reward:          -6.041466930677434                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:01:43[0m Remaining: [36m0:00:17[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6084, 67990, 67991, 67995, 68000]                                                                                                                           │
│ Average cumulative reward:       -6.225090705066294                                                                                                                      │
│ Average rollout reward:          -6.041466930677434                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:00:17[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6084, 67990, 67991, 67995, 68000]                                                                                                                           │
│ Average cumulative reward:       -6.225090705066294                                                                                                                      │
│ Average rollout reward:          -6.041466930677434                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K68/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━━[0m [35m86.1%[0m Elapsed: [33m0:01:44[0m Remaining: [36m0:00:17[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 68000 ===                                                                                                                                                  │
│ 68001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 6084, 67990, 67991, 67995, 68000]                                                                                                                           │
│ Average cumulative reward:       -6.225090705066294                                                                                                                      │
│ Average rollout reward:          -6.041466930677434                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:00:16[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 9234, 9236, 9381, 69000]                                                                                                                              │
│ Average cumulative reward:       -6.107519414872187                                                                                                                      │
│ Average rollout reward:          -5.913315604666488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K69/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:01:45[0m Remaining: [36m0:00:16[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 9234, 9236, 9381, 69000]                                                                                                                              │
│ Average cumulative reward:       -6.107519414872187                                                                                                                      │
│ Average rollout reward:          -5.913315604666488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯8;5;237m━━━━━[0m [35m87.3%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:00:16[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 69000 ===                                                                                                                                                  │
│ 69001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 9232, 9234, 9236, 9381, 69000]                                                                                                                              │
│ Average cumulative reward:       -6.107519414872187                                                                                                                      │
│ Average rollout reward:          -5.913315604666488                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:01:46[0m Remaining: [36m0:00:14[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 68951, 70000]                                                                                                                                        │
│ Average cumulative reward:       -6.465021392571439                                                                                                                      │
│ Average rollout reward:          -6.280640910857146                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:00:14[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 68951, 70000]                                                                                                                                        │
│ Average cumulative reward:       -6.465021392571439                                                                                                                      │
│ Average rollout reward:          -6.280640910857146                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K70/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━━[0m [35m88.6%[0m Elapsed: [33m0:01:47[0m Remaining: [36m0:00:14[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 70000 ===                                                                                                                                                  │
│ 70001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 51608, 68951, 70000]                                                                                                                                        │
│ Average cumulative reward:       -6.465021392571439                                                                                                                      │
│ Average rollout reward:          -6.280640910857146                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:00:13[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27112, 27268, 27269, 27275, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.2609045412955515                                                                                                                     │
│ Average rollout reward:          -6.069406054007011                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:01:48[0m Remaining: [36m0:00:13[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27112, 27268, 27269, 27275, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.2609045412955515                                                                                                                     │
│ Average rollout reward:          -6.069406054007011                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K71/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━━[0m [35m89.9%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:00:13[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 71000 ===                                                                                                                                                  │
│ 71001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 27112, 27268, 27269, 27275, 71000]                                                                                                                          │
│ Average cumulative reward:       -6.2609045412955515                                                                                                                     │
│ Average rollout reward:          -6.069406054007011                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:01:49[0m Remaining: [36m0:00:11[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 863, 53076, 53760, 72000]                                                                                                                                   │
│ Average cumulative reward:       -6.257732437371661                                                                                                                      │
│ Average rollout reward:          -6.076572572950046                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:00:11[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 863, 53076, 53760, 72000]                                                                                                                                   │
│ Average cumulative reward:       -6.257732437371661                                                                                                                      │
│ Average rollout reward:          -6.076572572950046                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K72/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━━[0m [35m91.1%[0m Elapsed: [33m0:01:50[0m Remaining: [36m0:00:11[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 72000 ===                                                                                                                                                  │
│ 72001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 863, 53076, 53760, 72000]                                                                                                                                   │
│ Average cumulative reward:       -6.257732437371661                                                                                                                      │
│ Average rollout reward:          -6.076572572950046                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:00:10[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28865, 28869, 28913, 73000]                                                                                                                          │
│ Average cumulative reward:       -6.071120589530218                                                                                                                      │
│ Average rollout reward:          -5.8974038394186055                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:01:51[0m Remaining: [36m0:00:10[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28865, 28869, 28913, 73000]                                                                                                                          │
│ Average cumulative reward:       -6.071120589530218                                                                                                                      │
│ Average rollout reward:          -5.8974038394186055                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K73/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━━[0m [35m92.4%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:00:10[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 73000 ===                                                                                                                                                  │
│ 73001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 28849, 28865, 28869, 28913, 73000]                                                                                                                          │
│ Average cumulative reward:       -6.071120589530218                                                                                                                      │
│ Average rollout reward:          -5.8974038394186055                                                                                                                     │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:01:52[0m Remaining: [36m0:00:08[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 1092, 73607, 74000]                                                                                                                                         │
│ Average cumulative reward:       -6.392405181399134                                                                                                                      │
│ Average rollout reward:          -6.241177723714308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:00:08[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 1092, 73607, 74000]                                                                                                                                         │
│ Average cumulative reward:       -6.392405181399134                                                                                                                      │
│ Average rollout reward:          -6.241177723714308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K74/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━━[0m [35m93.7%[0m Elapsed: [33m0:01:53[0m Remaining: [36m0:00:08[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 74000 ===                                                                                                                                                  │
│ 74001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 1092, 73607, 74000]                                                                                                                                         │
│ Average cumulative reward:       -6.392405181399134                                                                                                                      │
│ Average rollout reward:          -6.241177723714308                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:00:07[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 75000]                                                                                                                                 │
│ Average cumulative reward:       -6.348409743628418                                                                                                                      │
│ Average rollout reward:          -6.193257991536554                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:01:54[0m Remaining: [36m0:00:07[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 75000]                                                                                                                                 │
│ Average cumulative reward:       -6.348409743628418                                                                                                                      │
│ Average rollout reward:          -6.193257991536554                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K75/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━━[0m [35m94.9%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:00:07[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 75000 ===                                                                                                                                                  │
│ 75001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 75000]                                                                                                                                 │
│ Average cumulative reward:       -6.348409743628418                                                                                                                      │
│ Average rollout reward:          -6.193257991536554                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:01:55[0m Remaining: [36m0:00:05[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 73593, 73726, 76000]                                                                                                                                    │
│ Average cumulative reward:       -6.132259780808532                                                                                                                      │
│ Average rollout reward:          -5.970315131878162                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K76/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m[38;5;237m━[0m [35m96.2%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:00:05[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 73593, 73726, 76000]                                                                                                                                    │
│ Average cumulative reward:       -6.132259780808532                                                                                                                      │
│ Average rollout reward:          -5.970315131878162                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯;237m━[0m [35m96.2%[0m Elapsed: [33m0:01:56[0m Remaining: [36m0:00:05[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 76000 ===                                                                                                                                                  │
│ 76001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 84, 73593, 73726, 76000]                                                                                                                                    │
│ Average cumulative reward:       -6.132259780808532                                                                                                                      │
│ Average rollout reward:          -5.970315131878162                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:00:04[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 77000]                                                                                                                                 │
│ Average cumulative reward:       -6.1465488078200154                                                                                                                     │
│ Average rollout reward:          -5.97559828234325                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:01:57[0m Remaining: [36m0:00:04[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 77000]                                                                                                                                 │
│ Average cumulative reward:       -6.1465488078200154                                                                                                                     │
│ Average rollout reward:          -5.97559828234325                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K77/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;2;249;38;114m╸[0m[38;5;237m━[0m [35m97.5%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:00:04[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 77000 ===                                                                                                                                                  │
│ 77001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 22295, 73660, 73916, 77000]                                                                                                                                 │
│ Average cumulative reward:       -6.1465488078200154                                                                                                                     │
│ Average rollout reward:          -5.97559828234325                                                                                                                       │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:01:58[0m Remaining: [36m0:00:02[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 62256, 73694, 74772, 75170, 78000]                                                                                                                          │
│ Average cumulative reward:       -6.198524889953435                                                                                                                      │
│ Average rollout reward:          -6.041798978591099                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:00:02[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 62256, 73694, 74772, 75170, 78000]                                                                                                                          │
│ Average cumulative reward:       -6.198524889953435                                                                                                                      │
│ Average rollout reward:          -6.041798978591099                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:01:59[0m Remaining: [36m0:00:02[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 62256, 73694, 74772, 75170, 78000]                                                                                                                          │
│ Average cumulative reward:       -6.198524889953435                                                                                                                      │
│ Average rollout reward:          -6.041798978591099                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K78/79 [38;2;249;38;114m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m[38;5;237m╺[0m [35m98.7%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:00:02[0m   1.54 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 62256, 73694, 74772, 75170, 78000]                                                                                                                          │
│ Average cumulative reward:       -6.198524889953435                                                                                                                      │
│ Average rollout reward:          -6.041798978591099                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K[1A[2K79/79 [38;2;114;156;31m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [35m100.0%[0m Elapsed: [33m0:02:00[0m Remaining: [36m0:00:00[0m   1.53 s/iter
╭────────────────────────────────────────────────────────────────────────────────── MCTS ──────────────────────────────────────────────────────────────────────────────────╮
│ === Iteration 78000 ===                                                                                                                                                  │
│ 78001  nodes in tree                                                                                                                                                     │
│ Path: [0, 3, 62256, 73694, 74772, 75170, 78000]                                                                                                                          │
│ Average cumulative reward:       -6.198524889953435                                                                                                                      │
│ Average rollout reward:          -6.041798978591099                                                                                                                      │
│ Termination count: 0                                                                                                                                                     │
│ Best value of root node: -1.1348054010099788                                                                                                                             │
│ Best path: [0, 3, 13576, 13577, 13588, 13691]                                                                                                                            │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
[?25hNode 0 is not terminal. Continue.
Node 3 is not terminal. Continue.
Node 4798 is not terminal. Continue.
Node 4842 is not terminal. Continue.
Node 4843 is not terminal. Continue.
Node 4885 is not terminal. Continue.
Node 10756 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 3 is not terminal. Continue.
Node 27112 is not terminal. Continue.
Node 60805 is not terminal. Continue.
Node 61031 is not terminal. Continue.
Node 62891 is not terminal. Continue.
Node 64708 is not terminal. Continue.
No children found. Stop.
Node 0 is not terminal. Continue.
Node 3 is not terminal. Continue.
Node 4798 is not terminal. Continue.
Node 4842 is not terminal. Continue.
Node 4845 is not terminal. Continue.
Node 19197 is not terminal. Continue.
Node 19203 is not terminal. Continue.
No children found. Stop.
=== RESULT ===
By Visits: estimated reward: -1.8422828358918113
sign_ns [2.0989943 1.3736744]
By Value: estimated reward: -1.3118319770453057
sign_ns [1.8582718 0.7825565]
sign_newton [29.289265]
By Best Value: estimated reward: 0
sign_ns [2.1798525 1.0003409]
sign_quintic [1.1357034 1.5       0.5       0.       ]
sign_ns [0.5, 4.307957215691435]
sign_ns [0.5, 1.711727771246605]
sign_ns [0.5, 1.6795547754084654]
sign_ns [0.5, 1.5992181837543653]
sign_ns [0.5, 1.424207590584095]
sign_ns [0.5, 1.1756212807669717]
sign_ns [0.5, 1.024993150896671]
sign_ns [0.5, 1.0004725821924616]
Best value of root node:
-1.1348054010099788
Best root policy:
sign_ns [2.1798525 1.0003409]
sign_quintic [1.1357034 1.5       0.5       0.       ]
sign_ns [0.5, 4.307957215691435]
sign_ns [0.5, 1.711727771246605]
sign_ns [0.5, 1.6795547754084654]
sign_ns [0.5, 1.5992181837543653]
sign_ns [0.5, 1.424207590584095]
sign_ns [0.5, 1.1756212807669717]
sign_ns [0.5, 1.024993150896671]
sign_ns [0.5, 1.0004725821924616]
=== END ===
Finished making algorithm
