Using DST 2.0.2 (8739bdc51aa9e2f5710897d3ef40f69e2b870e8d)

Hello! Conciliator steering has started.

Profile 0: [0.1 0.2 0.7]
Preference: [25.80775684 -4.55894461 -0.26852819]

Baseline: [57.8  -5.32 -7.2 ]
Actions: [[0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0], [0, 0]]

Difference between the preferred and received rewards and Pareto optimal policies:
          received  preferred  difference
treasure       0.0     25.808     -25.808
time         -50.0     -4.559     -45.441
fuel           0.0     -0.269       0.269

Difference to Pareto optimal solutions
    treasure   time  fuel  Pareto policy ratio
0        1.0   49.0  -1.0                 0.00
1      124.0   40.0  -4.0                 0.12
2       74.0   41.0  -4.0                 0.10
3       50.0   43.0  -4.0                 0.06
4       50.0   41.0  -3.0                 0.12
5       74.0   39.0  -3.0                 0.16
6      124.0   38.0  -3.0                 0.18
7       16.0   45.0  -2.0                 0.06
8       50.0   45.0  -6.0                 0.00
9      124.0   43.0  -6.0                 0.02
10      74.0   44.0  -7.0                 0.00
11      74.0   45.0 -11.0                 0.00
12       2.0   48.0  -3.0                 0.00
13       3.0   47.0  -3.0                 0.02
14       8.0   46.0  -2.0                 0.06
15      16.0   46.0  -3.0                 0.04
16      16.0   47.0  -4.0                 0.00
17     124.0   44.0  -8.0                 0.00
18     124.0   45.0 -13.0                 0.00
19       3.0   48.0 -10.0                 0.00
20      50.0   46.0  -8.0                 0.02
21     124.0   46.0 -22.0                 0.00
22      16.0   48.0 -15.0                 0.00
23      74.0   46.0 -17.0                 0.00
24      50.0   47.0 -18.0                 0.00
25       0.0 -950.0   0.0                 1.00

Conciliator steering has ended. Bye!


Emissions: 0.09952748021320684 g of CO2


====================

Using DST 2.0.2 (8739bdc51aa9e2f5710897d3ef40f69e2b870e8d)

Hello! Conciliator steering has started.

Profile 1: [0.98 0.01 0.01]
Preference: [115.6       -28.9274384 -28.9274384]

Baseline: [57.8  -5.32 -7.2 ]
Actions: [[3, 3], [1, -2], [-2, 3], [-2, -2]]

Difference between the preferred and received rewards and Pareto optimal policies:
          received  preferred  difference
treasure     124.0    115.600       8.400
time          -4.0    -28.927      24.927
fuel         -44.0    -28.927     -15.073

Difference to Pareto optimal solutions
    treasure   time  fuel  Pareto policy ratio
0     -123.0    3.0  43.0                  0.0
1        0.0   -6.0  40.0                  0.0
2      -50.0   -5.0  40.0                  0.0
3      -74.0   -3.0  40.0                  0.0
4      -74.0   -5.0  41.0                  0.0
5      -50.0   -7.0  41.0                  0.0
6        0.0   -8.0  41.0                  0.0
7     -108.0   -1.0  42.0                  0.0
8      -74.0   -1.0  38.0                  0.0
9        0.0   -3.0  38.0                  0.0
10     -50.0   -2.0  37.0                  0.0
11     -50.0   -1.0  33.0                  0.0
12    -122.0    2.0  41.0                  0.0
13    -121.0    1.0  41.0                  0.0
14    -116.0    0.0  42.0                  0.0
15    -108.0    0.0  41.0                  0.0
16    -108.0    1.0  40.0                  0.0
17       0.0   -2.0  36.0                  0.0
18       0.0   -1.0  31.0                  0.0
19    -121.0    2.0  34.0                  0.0
20     -74.0    0.0  36.0                  0.0
21       0.0    0.0  22.0                  0.0
22    -108.0    2.0  29.0                  0.0
23     -50.0    0.0  27.0                  0.0
24     -74.0    1.0  26.0                  0.0
25    -124.0 -996.0  44.0                  0.0

Conciliator steering has ended. Bye!


Emissions: 0.011057515877750975 g of CO2


====================

Using DST 2.0.2 (8739bdc51aa9e2f5710897d3ef40f69e2b870e8d)

Hello! Conciliator steering has started.

Profile 2: [0.2 0.4 0.4]
Preference: [24.45017431 -2.04560037 -2.04560037]

Baseline: [57.8  -5.32 -7.2 ]
Actions: [[1, 1], [0, 0], [0, 0], [0, 0]]

Difference between the preferred and received rewards and Pareto optimal policies:
          received  preferred  difference
treasure       8.0     24.450     -16.450
time          -4.0     -2.046      -1.954
fuel          -2.0     -2.046       0.046

Difference to Pareto optimal solutions
    treasure   time  fuel  Pareto policy ratio
0       -7.0    3.0   1.0                 0.00
1      116.0   -6.0  -2.0                 0.75
2       66.0   -5.0  -2.0                 0.75
3       42.0   -3.0  -2.0                 0.50
4       42.0   -5.0  -1.0                 0.50
5       66.0   -7.0  -1.0                 0.50
6      116.0   -8.0  -1.0                 0.50
7        8.0   -1.0   0.0                 0.50
8       42.0   -1.0  -4.0                 0.00
9      116.0   -3.0  -4.0                 0.00
10      66.0   -2.0  -5.0                 0.00
11      66.0   -1.0  -9.0                 0.00
12      -6.0    2.0  -1.0                 0.25
13      -5.0    1.0  -1.0                 0.50
14       0.0    0.0   0.0                 1.00
15       8.0    0.0  -1.0                 0.75
16       8.0    1.0  -2.0                 0.25
17     116.0   -2.0  -6.0                 0.25
18     116.0   -1.0 -11.0                 0.25
19      -5.0    2.0  -8.0                 0.00
20      42.0    0.0  -6.0                 0.25
21     116.0    0.0 -20.0                 0.00
22       8.0    2.0 -13.0                 0.00
23      66.0    0.0 -15.0                 0.00
24      42.0    1.0 -16.0                 0.00
25      -8.0 -996.0   2.0                 0.75

Conciliator steering has ended. Bye!


Emissions: 0.011093397428153138 g of CO2


====================

