ReSHAP: Experimental Code¶

Leveraging the Ames Housing Dataset: https://www.kaggle.com/datasets/shashanknecrothapa/ames-housing-dataset

In [3]:
import pandas as pd
from kagglehub import KaggleDatasetAdapter
import kagglehub

dataset_dir = kagglehub.dataset_download("prevek18/ames-housing-dataset")
print("Files available:", list(__import__("pathlib").Path(dataset_dir).rglob("*")))

df_ames = kagglehub.dataset_load(
    KaggleDatasetAdapter.PANDAS,
    handle="prevek18/ames-housing-dataset",
    path="AmesHousing.csv",  
    pandas_kwargs={"low_memory": False},  
)
Files available: [PosixPath('/Users/jensreil/.cache/kagglehub/datasets/prevek18/ames-housing-dataset/versions/1/AmesHousing.csv')]

Chapter 1: Analysis Dataset¶

First, let's do a descriptive overview of the dataset we are dealing with:

In [6]:
pd.set_option('display.max_columns', None)
df_ames.describe()
Out[6]:
Order PID MS SubClass Lot Frontage Lot Area Overall Qual Overall Cond Year Built Year Remod/Add Mas Vnr Area BsmtFin SF 1 BsmtFin SF 2 Bsmt Unf SF Total Bsmt SF 1st Flr SF 2nd Flr SF Low Qual Fin SF Gr Liv Area Bsmt Full Bath Bsmt Half Bath Full Bath Half Bath Bedroom AbvGr Kitchen AbvGr TotRms AbvGrd Fireplaces Garage Yr Blt Garage Cars Garage Area Wood Deck SF Open Porch SF Enclosed Porch 3Ssn Porch Screen Porch Pool Area Misc Val Mo Sold Yr Sold SalePrice
count 2930.00000 2.930000e+03 2930.000000 2440.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2907.000000 2929.000000 2929.000000 2929.000000 2929.000000 2930.000000 2930.000000 2930.000000 2930.000000 2928.000000 2928.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2771.000000 2929.000000 2929.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000 2930.000000
mean 1465.50000 7.144645e+08 57.387372 69.224590 10147.921843 6.094881 5.563140 1971.356314 1984.266553 101.896801 442.629566 49.722431 559.262547 1051.614544 1159.557679 335.455973 4.676792 1499.690444 0.431352 0.061134 1.566553 0.379522 2.854266 1.044369 6.443003 0.599317 1978.132443 1.766815 472.819734 93.751877 47.533447 23.011604 2.592491 16.002048 2.243345 50.635154 6.216041 2007.790444 180796.060068
std 845.96247 1.887308e+08 42.638025 23.365335 7880.017759 1.411026 1.111537 30.245361 20.860286 179.112611 455.590839 169.168476 439.494153 440.615067 391.890885 428.395715 46.310510 505.508887 0.524820 0.245254 0.552941 0.502629 0.827731 0.214076 1.572964 0.647921 25.528411 0.760566 215.046549 126.361562 67.483400 64.139059 25.141331 56.087370 35.597181 566.344288 2.714492 1.316613 79886.692357
min 1.00000 5.263011e+08 20.000000 21.000000 1300.000000 1.000000 1.000000 1872.000000 1950.000000 0.000000 0.000000 0.000000 0.000000 0.000000 334.000000 0.000000 0.000000 334.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 2.000000 0.000000 1895.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 2006.000000 12789.000000
25% 733.25000 5.284770e+08 20.000000 58.000000 7440.250000 5.000000 5.000000 1954.000000 1965.000000 0.000000 0.000000 0.000000 219.000000 793.000000 876.250000 0.000000 0.000000 1126.000000 0.000000 0.000000 1.000000 0.000000 2.000000 1.000000 5.000000 0.000000 1960.000000 1.000000 320.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 4.000000 2007.000000 129500.000000
50% 1465.50000 5.354536e+08 50.000000 68.000000 9436.500000 6.000000 5.000000 1973.000000 1993.000000 0.000000 370.000000 0.000000 466.000000 990.000000 1084.000000 0.000000 0.000000 1442.000000 0.000000 0.000000 2.000000 0.000000 3.000000 1.000000 6.000000 1.000000 1979.000000 2.000000 480.000000 0.000000 27.000000 0.000000 0.000000 0.000000 0.000000 0.000000 6.000000 2008.000000 160000.000000
75% 2197.75000 9.071811e+08 70.000000 80.000000 11555.250000 7.000000 6.000000 2001.000000 2004.000000 164.000000 734.000000 0.000000 802.000000 1302.000000 1384.000000 703.750000 0.000000 1742.750000 1.000000 0.000000 2.000000 1.000000 3.000000 1.000000 7.000000 1.000000 2002.000000 2.000000 576.000000 168.000000 70.000000 0.000000 0.000000 0.000000 0.000000 0.000000 8.000000 2009.000000 213500.000000
max 2930.00000 1.007100e+09 190.000000 313.000000 215245.000000 10.000000 9.000000 2010.000000 2010.000000 1600.000000 5644.000000 1526.000000 2336.000000 6110.000000 5095.000000 2065.000000 1064.000000 5642.000000 3.000000 2.000000 4.000000 2.000000 8.000000 3.000000 15.000000 4.000000 2207.000000 5.000000 1488.000000 1424.000000 742.000000 1012.000000 508.000000 576.000000 800.000000 17000.000000 12.000000 2010.000000 755000.000000

Now, let's look at the correlations between the features.

In [8]:
import seaborn as sns
import matplotlib.pyplot as plt
import numpy as np

# Numeric-only features
num_df = df_ames.select_dtypes(include=[np.number]).copy()

# Ensure SalePrice is included
if "SalePrice" not in num_df.columns:
    num_df["SalePrice"] = pd.to_numeric(df_ames["SalePrice"], errors="coerce")

# Compute correlation matrix
corr = num_df.corr()

# Big heatmap
plt.figure(figsize=(16, 14))
sns.heatmap(corr, cmap="coolwarm", center=0,
            annot=False, fmt=".2f", square=True,
            cbar_kws={"shrink": 0.75})
plt.title("Full correlation matrix (numeric features including SalePrice)")
plt.show()

# Optional: list strongest correlations with SalePrice
saleprice_corr = corr["SalePrice"].sort_values(ascending=False)
print("=== Top correlations with SalePrice ===")
display(saleprice_corr.head(15))
print("=== Lowest correlations with SalePrice ===")
display(saleprice_corr.tail(15))
No description has been provided for this image
=== Top correlations with SalePrice ===
SalePrice         1.000000
Overall Qual      0.799262
Gr Liv Area       0.706780
Garage Cars       0.647877
Garage Area       0.640401
Total Bsmt SF     0.632280
1st Flr SF        0.621676
Year Built        0.558426
Full Bath         0.545604
Year Remod/Add    0.532974
Garage Yr Blt     0.526965
Mas Vnr Area      0.508285
TotRms AbvGrd     0.495474
Fireplaces        0.474558
BsmtFin SF 1      0.432914
Name: SalePrice, dtype: float64
=== Lowest correlations with SalePrice ===
Screen Porch       0.112151
Pool Area          0.068403
Mo Sold            0.035259
3Ssn Porch         0.032225
BsmtFin SF 2       0.005891
Misc Val          -0.015691
Yr Sold           -0.030569
Order             -0.031408
Bsmt Half Bath    -0.035835
Low Qual Fin SF   -0.037660
MS SubClass       -0.085092
Overall Cond      -0.101697
Kitchen AbvGr     -0.119814
Enclosed Porch    -0.128787
PID               -0.246521
Name: SalePrice, dtype: float64

For this investigation, we are aiming to only use a small set of features for the sake of explanation purposes. Hence, we make a smaller matrix so assess some features a bit more in-depth.

In [148]:
import seaborn as sns
import matplotlib.pyplot as plt

# Ensure numeric only
num_df = df_ames.select_dtypes(include=[np.number]).copy()

# Make sure SalePrice is included
if "SalePrice" not in num_df.columns:
    num_df["SalePrice"] = pd.to_numeric(df_ames["SalePrice"], errors="coerce")

# Compute full correlation matrix
corr = num_df.corr()

# Focus on Gr Liv Area, Overall Qual, and SalePrice correlations
focus_features = ["Gr Liv Area", "Overall Qual", "SalePrice"]
corr_focus = corr[focus_features].sort_values(by="SalePrice", ascending=False)

print("=== Correlations with Gr Liv Area, Overall Qual & SalePrice ===")
display(corr_focus.head(15))  # top 15 correlations with SalePrice

# Collect top correlated features with the three focus features
top_feats = set()
for f in focus_features:
    top_feats.update(
        corr[f].abs().sort_values(ascending=False).head(10).index.tolist()
    )
top_feats = list(top_feats)

# Heatmap for focus + top correlated features
plt.figure(figsize=(12, 10))
sns.heatmap(
    corr.loc[top_feats, top_feats], 
    annot=True, fmt=".2f", cmap="coolwarm", center=0
)
plt.title("Correlation matrix: Gr Liv Area, Overall Qual, SalePrice & related features")

# Save directly to PDF
plt.savefig("correlation_matrix.pdf", format="pdf", bbox_inches="tight")

plt.show()
=== Correlations with Gr Liv Area, Overall Qual & SalePrice ===
Gr Liv Area Overall Qual SalePrice
SalePrice 0.706780 0.799262 1.000000
Overall Qual 0.570556 1.000000 0.799262
Gr Liv Area 1.000000 0.570556 0.706780
Garage Cars 0.488829 0.599545 0.647877
Garage Area 0.484892 0.563503 0.640401
Total Bsmt SF 0.444675 0.547294 0.632280
1st Flr SF 0.562166 0.477837 0.621676
Year Built 0.241726 0.597027 0.558426
Full Bath 0.630321 0.522263 0.545604
Year Remod/Add 0.316855 0.569609 0.532974
Garage Yr Blt 0.272848 0.570569 0.526965
Mas Vnr Area 0.403611 0.429418 0.508285
TotRms AbvGrd 0.807772 0.380693 0.495474
Fireplaces 0.454924 0.393007 0.474558
BsmtFin SF 1 0.209633 0.284118 0.432914
No description has been provided for this image

The features we are interested in are:

  • Gr Liv Area
  • Overall Quality

Based on these two features, a first assessment will be made. From these features, either a duplicate of one of the two, or a highly correlated feature such as TotRms AbvGrd will be used.

Functions Definitions¶

In [13]:
import numpy as np
import pandas as pd
import random
import itertools
from itertools import permutations
import warnings

from sklearn.exceptions import ConvergenceWarning
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LinearRegression
from sklearn.neural_network import MLPRegressor
from sklearn.metrics import mean_squared_error, r2_score

# Optional libraries (may or may not be installed)
try:
    import lightgbm as lgb
    HAS_LGB = True
except Exception:
    HAS_LGB = False

try:
    import xgboost as xgb
    HAS_XGB = True
except Exception:
    HAS_XGB = False
In [14]:
def all_subsets_indices(m, letters):
    """Return [(label, comb), ...] where label uses letters (e.g., 'AB'), comb are integer indices."""
    out = []
    for k in range(0, m+1):
        for comb in itertools.combinations(range(m), k):
            label = "nothing" if k == 0 else "".join(letters[j] for j in comb)
            out.append((label, comb))
    return sorted(out, key=lambda t: (len(t[1]), t[1]))

def predict_array(model, arr):
    return np.asarray(model.predict(arr)).reshape(-1)

def shap_for_subset_model(model, x_point_S, x0_S):
    """
    Exact per-point SHAP with mean baseline for k in {0,1,2,3}.
    Returns: (phi_S (len k), f_xS, f_x0S)
    """
    x = x_point_S.reshape(1, -1)
    X0 = x0_S.reshape(1, -1)
    f_x0 = predict_array(model, X0)[0]
    f_x  = predict_array(model, x)[0]
    k = x.shape[1]

    if k == 0:
        return np.array([]), f_x0, f_x0

    if k == 1:
        return np.array([f_x - f_x0]), f_x, f_x0

    if k == 2:
        x1x2 = f_x
        x1x0 = predict_array(model, np.array([[x[0,0], x0_S[1]]]))[0]
        x0x2 = predict_array(model, np.array([[x0_S[0], x[0,1]]]))[0]
        x0x0 = f_x0
        phi1 = 0.5 * ((x1x2 - x0x2) + (x1x0 - x0x0))
        phi2 = 0.5 * ((x1x2 - x1x0) + (x0x2 - x0x0))
        return np.array([phi1, phi2]), f_x, f_x0

    # k == 3
    contrib = np.zeros(3, dtype=float)
    for pi in permutations(range(3)):  # 6 perms
        z = X0.copy(); f_prev = f_x0
        for j in pi:
            z[0, j] = x[0, j]
            f_new = predict_array(model, z)[0]
            contrib[j] += (f_new - f_prev)
            f_prev = f_new
    phi = contrib / 6.0
    return phi, f_x, f_x0
In [15]:
def lattice_predictions_for_row(model, comb, letters, x_point_full, x0_S):
    """
    Build f(∅) and f(subset) for all non-empty subsets ⊆ comb.
    Keys look like: f(∅), f(A), f(B), f(AB), ... using global letters.
    """
    out = {}
    f_empty = float(predict_array(model, x0_S.reshape(1, -1))[0])
    out["f(∅)"] = f_empty

    k = len(comb)
    if k == 0:
        return out

    # Compute f for all non-empty sub-combinations inside comb
    for kk in range(1, k+1):
        for sub in itertools.combinations(comb, kk):
            z = x0_S.copy()
            for j_global in sub:
                pos_in_S = list(comb).index(j_global)
                z[pos_in_S] = x_point_full.iloc[j_global]
            yhat_sub = float(predict_array(model, z.reshape(1, -1))[0])
            label_sub = "f(" + "".join(letters[j] for j in sub) + ")"
            out[label_sub] = yhat_sub

    # Ensure required keys exist (dynamic per subset)
    sub_letters = "".join(letters[j] for j in comb)  # e.g., "AB", "AC", "B", ...
    need = {"f(∅)"}
    if k == 1:
        need |= {f"f({sub_letters})"}
    elif k == 2:
        L1, L2 = sub_letters[0], sub_letters[1]
        need |= {f"f({L1})", f"f({L2})", f"f({L1+L2})"}
    elif k == 3:
        # If the active set is size 3, assume globals are A,B,C
        need |= {"f(A)","f(B)","f(C)","f(AB)","f(AC)","f(BC)","f(ABC)"}

    for kkey in need:
        out.setdefault(kkey, None)

    return out
In [16]:
def mu_from_f_upper_2f(f_dict, pair_letters):
    """
    2-feature μ for an arbitrary pair (e.g., 'AB', 'AC', or 'BC').
    """
    L1, L2 = pair_letters[0], pair_letters[1]
    f0    = float(f_dict["f(∅)"])
    fL1   = float(f_dict[f"f({L1})"])
    fL2   = float(f_dict[f"f({L2})"])
    fL1L2 = float(f_dict[f"f({L1+L2})"])

    nuL1, nuL2, nuL1L2 = (fL1 - f0), (fL2 - f0), (fL1L2 - f0)

    muL1   = nuL1
    muL2   = nuL2
    muL1L2 = nuL1L2 - nuL1 - nuL2

    return {f"μ({L1})": muL1, f"μ({L2})": muL2, f"μ({L1+L2})": muL1L2}

def mu_from_f_upper_3f(f_dict):
    """
    Standard 3-feature μ (assuming letters A,B,C).
    """
    f0   = float(f_dict["f(∅)"])
    fA   = float(f_dict["f(A)"])
    fB   = float(f_dict["f(B)"])
    fC   = float(f_dict["f(C)"])
    fAB  = float(f_dict["f(AB)"])
    fAC  = float(f_dict["f(AC)"])
    fBC  = float(f_dict["f(BC)"])
    fABC = float(f_dict["f(ABC)"])

    nuA, nuB, nuC    = (fA - f0), (fB - f0), (fC - f0)
    nuAB, nuAC, nuBC = (fAB - f0), (fAC - f0), (fBC - f0)
    nuABC = (fABC - f0)

    muA = nuABC - nuBC
    muB = nuABC - nuAC
    muC = nuABC - nuAB
    muAB = -nuC + nuBC + nuAC - nuABC
    muAC = -nuB + nuAB + nuBC - nuABC
    muBC = -nuA + nuAB + nuAC - nuABC
    muABC = nuA+nuB+nuC-nuAB-nuAC-nuBC +nuABC


    return {
        "μ(A)": muA, "μ(B)": muB, "μ(C)": muC,
        "μ(AB)": muAB, "μ(AC)": muAC, "μ(BC)": muBC, "μ(ABC)": muABC
    }
In [17]:
def _subset_strings(letters_str):
    """All non-empty subsets of letters_str (e.g., 'AB' → ['A','B','AB'])"""
    outs = []
    L = list(letters_str)
    for k in range(1, len(L)+1):
        for comb in itertools.combinations(L, k):
            outs.append("".join(comb))
    return outs

def _mu_get(mu_dict, subset_str):
    return float(mu_dict.get(f"μ({subset_str})", 0.0))

def _redistribute_weights(mu_dict, T):
    """
    Recursive weights for coalition T (string like 'AB', 'ABC'), summing to 1 over singletons in T.
    """
    if len(T) == 1:
        return {T: 1.0}

    # all non-empty proper subsets U of T
    proper_subsets = []
    for k in range(1, len(T)):
        for comb in itertools.combinations(T, k):
            proper_subsets.append("".join(comb))

    denom = sum(_mu_get(mu_dict, U) for U in proper_subsets)
    weights = {c: 0.0 for c in T}

    if abs(denom) < 1e-12:
        # equal split to singletons
        equal = 1.0 / len(T)
        for c in T:
            weights[c] += equal
        return weights

    # push mass to U proportional to μ(U), then recurse
    for U in proper_subsets:
        frac = _mu_get(mu_dict, U) / denom
        sub_w = _redistribute_weights(mu_dict, U)
        for c, w in sub_w.items():
            weights[c] += frac * w

    return weights

def reshAP_from_mu_recursive(mu_dict, active_letters):
    """
    ReSHAP_i = sum_{T ⊆ active_letters, i∈T} w_i(T) * μ(T).
    Returns dict: {"ReSHAP A": ..., ...} for active letters only.
    """
    if active_letters == "nothing" or len(active_letters) == 0:
        return {}

    coalitions = _subset_strings(active_letters)
    coalition_weights = {T: _redistribute_weights(mu_dict, T) for T in coalitions}

    res = {f"ReSHAP {c}": 0.0 for c in active_letters}
    for T in coalitions:
        mu_T = _mu_get(mu_dict, T)
        if abs(mu_T) < 1e-12:
            continue
        for c, w in coalition_weights[T].items():
            res[f"ReSHAP {c}"] += w * mu_T
    return res
In [18]:
def make_model_factory(settings, random_state):
    """Return a factory function build_models_for_subset(k) honoring SETTINGS & available libs."""
    def build_models_for_subset(n_features):
        models = {}
        if settings["USE_LINEAR"]:
            models["Linear Regression"] = Pipeline([("model", LinearRegression())])
        if settings["USE_MLP"]:
            models["MLP"] = Pipeline([
                ("scaler", StandardScaler()),
                ("model", MLPRegressor(
                    hidden_layer_sizes=settings["MLP_hidden"],
                    activation="relu", solver="adam",
                    max_iter=settings["MLP_max_iter"],
                    random_state=random_state,
                    early_stopping=settings["MLP_early_stopping"],
                    n_iter_no_change=settings["MLP_n_iter_no_change"],
                    tol=settings["MLP_tol"],
                    learning_rate_init=settings["MLP_lr"],
                    alpha=settings["MLP_alpha"]
                ))
            ])
        if settings["USE_LGB"] and HAS_LGB:
            models["LightGBM"] = lgb.LGBMRegressor(
                n_estimators=500, learning_rate=0.05, num_leaves=31,
                random_state=random_state, verbose=-1
            )
        if settings["USE_XGB"] and HAS_XGB:
            models["XGBoost"] = xgb.XGBRegressor(
                n_estimators=500, learning_rate=0.05, max_depth=6,
                subsample=0.8, colsample_bytree=0.8,
                objective="reg:squarederror",
                random_state=random_state, n_jobs=-1
            )
        return models
    return build_models_for_subset

def run_pipeline(settings):
    """
    End-to-end:
    - slice data by MODE
    - split train/test
    - pick one or many test rows (per settings)
    - train each subset model per chosen algorithms (once)
    - for each chosen test row: compute SHAP, lattice f, μ, and ReSHAP
    - display per-model tables per test row
    """
    # Warnings
    if settings["SILENCE_MLP_WARNINGS"]:
        warnings.filterwarnings("ignore", category=ConvergenceWarning)
    else:
        warnings.filterwarnings("default", category=ConvergenceWarning)

    # ---- Data slice
    MODE = settings["MODE"]
    RANDOM_STATE = settings["RANDOM_STATE"]

    df = df_ames.copy()
    df.columns = df.columns.str.strip()

    need = ["Gr Liv Area", "Overall Qual", "SalePrice"]
    if MODE == "totrms":
        need += ["TotRms AbvGrd"]
    base_df = df[need].apply(pd.to_numeric, errors="coerce").dropna()

    if MODE == "two":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        feature_names_full = ["Gr Liv Area", "Overall Qual"]
    elif MODE == "dup_grliv":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        X_full["Gr Liv Area (DUP)"] = X_full["Gr Liv Area"]
        feature_names_full = ["Gr Liv Area", "Overall Qual", "Gr Liv Area (DUP)"]
    elif MODE == "dup_qual":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        X_full["Overall Qual (DUP)"] = X_full["Overall Qual"]
        feature_names_full = ["Gr Liv Area", "Overall Qual", "Overall Qual (DUP)"]
    elif MODE == "totrms":
        X_full = base_df[["Gr Liv Area", "Overall Qual", "TotRms AbvGrd"]].copy()
        feature_names_full = ["Gr Liv Area", "Overall Qual", "TotRms AbvGrd"]
    else:
        raise ValueError("Invalid MODE")

    y_full = base_df["SalePrice"]
    m = len(feature_names_full)
    letters = "ABC"[:m]

    X_tr_full, X_te_full, y_tr, y_te = train_test_split(
        X_full, y_full, test_size=0.2, random_state=RANDOM_STATE
    )

    # ---- Pick one or many test rows
    if settings["PICK_RANDOM_POINT"]:
        random.seed(RANDOM_STATE)
        n = min(int(settings.get("N_RANDOM_POINTS", 1)), len(X_te_full))
        test_indices = random.sample(list(X_te_full.index), n)
    else:
        if settings["CUSTOM_TEST_INDEX"] is not None:
            idx = settings["CUSTOM_TEST_INDEX"]
            if idx not in X_te_full.index:
                raise ValueError(f"CUSTOM_TEST_INDEX {idx} not in test indices.")
            test_indices = [idx]
        else:
            test_indices = [X_te_full.index[0]]

    print(f"Mode: {MODE} | Features (A,B{',C' if m==3 else ''}): {list(zip(list(letters), feature_names_full))}")
    print(f"Evaluating {len(test_indices)} test point(s): {test_indices}")

    # ---- Build subsets & models
    build_models_for_subset = make_model_factory(settings, RANDOM_STATE)
    subsets = all_subsets_indices(m, letters)

    # Determine model order (respect settings + availability)
    model_names = list(build_models_for_subset(1).keys())
    if settings["USE_LGB"] and HAS_LGB and "LightGBM" not in model_names:
        model_names.append("LightGBM")
    if settings["USE_XGB"] and HAS_XGB and "XGBoost" not in model_names:
        model_names.append("XGBoost")

    # ---- Train per subset (once)
    fitted = {}  # (model_name, label) -> dict(model, r2, rmse, x0_S)
    for label, comb in subsets:
        k = len(comb)
        if k == 0:
            # Constant model
            y_hat_test = np.full_like(y_te.values, fill_value=y_tr.mean(), dtype=float)
            r2  = r2_score(y_te.values, y_hat_test)
            rmse = float(np.sqrt(mean_squared_error(y_te.values, y_hat_test)))
            for mn in model_names:
                fitted[(mn, label)] = {"model": None, "r2": r2, "rmse": rmse, "x0_S": None}
            continue

        X_tr_S = X_tr_full.iloc[:, list(comb)].values
        X_te_S = X_te_full.iloc[:, list(comb)].values

        # Baseline
        bl = settings["BASELINE"]
        if bl == "mean":
            x0_S = X_tr_full.iloc[:, list(comb)].mean(axis=0).values.astype(float)
        elif bl == "median":
            x0_S = X_tr_full.iloc[:, list(comb)].median(axis=0).values.astype(float)
        elif bl == "zero":
            x0_S = np.zeros(len(comb), dtype=float)
        else:
            raise ValueError("BASELINE must be one of {'mean','median','zero'}")

        for mn, mdl in build_models_for_subset(k).items():
            mdl.fit(X_tr_S, y_tr.values)
            y_hat_test = mdl.predict(X_te_S)
            r2  = r2_score(y_te.values, y_hat_test)
            rmse = float(np.sqrt(mean_squared_error(y_te.values, y_hat_test)))
            fitted[(mn, label)] = {"model": mdl, "r2": r2, "rmse": rmse, "x0_S": x0_S}

    # ---- Evaluate each chosen point
    for idx in test_indices:
        x_point_full = X_te_full.loc[idx]
        y_true = y_te.loc[idx]

        print(f"\n──────── Test point index: {idx} | True SalePrice: {y_true} ────────")
        display(pd.DataFrame(x_point_full).T)

        for mn in model_names:
            rows = []
            for label, comb in subsets:
                k = len(comb)
                entry = fitted[(mn, label)]
                r2, rmse = entry["r2"], entry["rmse"]

                # defaults
                phi_cols = [""]*m
                mu_cols  = {}
                resh_cols = {}
                lattice_dict = {}
                baseline = float(y_tr.mean()) if k==0 else None
                prediction = baseline

                if k == 0:
                    lattice_dict = {"f(∅)": baseline}
                else:
                    model = entry["model"]
                    x0_S = entry["x0_S"]
                    x_point_S = x_point_full.iloc[list(comb)].values

                    # φ for this point
                    phi_S, f_xS, f_x0S = shap_for_subset_model(model, x_point_S, x0_S)
                    baseline  = float(f_x0S)
                    prediction = float(f_xS)

                    # Align φ into global A,B,(C)
                    for j in range(m):
                        if j in comb:
                            pos = list(comb).index(j)
                            phi_cols[j] = float(phi_S[pos])
                        else:
                            phi_cols[j] = ""

                    # Lattice
                    lattice_dict = lattice_predictions_for_row(model, comb, letters, x_point_full, x0_S)

                    # μ and ReSHAP
                    if k == 2:
                        sub_letters = "".join(letters[j] for j in comb)  # "AB"/"AC"/"BC"
                        mu_cols = mu_from_f_upper_2f(lattice_dict, sub_letters)
                        resh_cols = reshAP_from_mu_recursive(mu_cols, sub_letters)
                    elif k == 3:
                        mu_cols = mu_from_f_upper_3f(lattice_dict)
                        resh_cols = reshAP_from_mu_recursive(mu_cols, "ABC")
                    elif k == 1:
                        sl = "".join(letters[j] for j in comb)  # "A"/"B"/"C"
                        f0 = float(lattice_dict["f(∅)"])
                        fA = float(lattice_dict[f"f({sl})"])
                        mu_cols = {f"μ({sl})": (fA - f0)}
                        resh_cols = reshAP_from_mu_recursive(mu_cols, sl)

                # row
                row = {
                    "Feature subset": label,
                    "n_features": k,
                    "R2 (subset test)": r2,
                    "RMSE (subset test)": rmse,
                    "Baseline f(∅)": baseline,
                    "Prediction f(S)": prediction,
                    "φ A": phi_cols[0] if m>=1 else "",
                    "φ B": phi_cols[1] if m>=2 else "",
                    "φ C": phi_cols[2] if m>=3 else "",
                    "f(∅)": lattice_dict.get("f(∅)", ""),
                    "f(A)": lattice_dict.get("f(A)", ""),
                    "f(B)": lattice_dict.get("f(B)", ""),
                    "f(AB)": lattice_dict.get("f(AB)", ""),
                }
                if m == 3:
                    row.update({
                        "f(C)": lattice_dict.get("f(C)", ""),
                        "f(AC)": lattice_dict.get("f(AC)", ""),
                        "f(BC)": lattice_dict.get("f(BC)", ""),
                        "f(ABC)": lattice_dict.get("f(ABC)", ""),
                    })
                for key in ["μ(A)","μ(B)","μ(C)","μ(AB)","μ(AC)","μ(BC)","μ(ABC)"]:
                    row[key] = mu_cols.get(key, "")
                for key in ["ReSHAP A","ReSHAP B","ReSHAP C"]:
                    row[key] = resh_cols.get(key, "")

                rows.append(row)

            # display table for this point + model
            base_cols = ["Feature subset","n_features","R2 (subset test)","RMSE (subset test)",
                         "Baseline f(∅)","Prediction f(S)","φ A","φ B"]
            if m==3: base_cols += ["φ C"]
            lattice_cols = ["f(∅)","f(A)","f(B)","f(AB)"]
            if m==3: lattice_cols += ["f(C)","f(AC)","f(BC)","f(ABC)"]
            mu_cols_all = ["μ(A)","μ(B)","μ(AB)"]
            if m==3: mu_cols_all = ["μ(A)","μ(B)","μ(C)","μ(AB)","μ(AC)","μ(BC)","μ(ABC)"]
            resh_cols_all = ["ReSHAP A","ReSHAP B"]
            if m==3: resh_cols_all += ["ReSHAP C"]

            ordered_cols = base_cols + lattice_cols + mu_cols_all + resh_cols_all
            tbl = pd.DataFrame(rows)[ordered_cols]
            print(f"\n=== Subset predictions + SHAP + μ + ReSHAP — {mn} | Test idx {idx} ===")
            display(tbl)

Investigate 2 Features¶

In [20]:
SETTINGS = dict(
    # Data slice / feature modes:
    MODE="two",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=10,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=True,
    USE_MLP=True,
    USE_LGB=True,              # will only run if LightGBM is installed
    USE_XGB=True,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)
Mode: two | Features (A,B): [('A', 'Gr Liv Area'), ('B', 'Overall Qual')]
Evaluating 10 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual
1093 1892 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 221182.120972 42599.913207 178582.207765 221182.120972 42599.913207 42599.913207
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 AB 2 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 30608.541001 178582.207765 201107.081002 209190.748766 231715.622003 22524.873238 30608.541001 -0.0 22524.873238 30608.541001
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 207590.130539 198246.330716 -9343.799823 -9343.799823
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 AB 2 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 182789.632430 183753.304113 229488.927603 200534.205454 963.671683 46699.295173 -29918.393833 358.767901 17385.805123
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 187762.781250 -7323.453125 195086.234375 187762.78125 -7323.453125 -7323.453125
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 AB 2 0.790517 40982.155774 172155.718750 201395.015625 657.195312 28582.101562 172155.718750 172812.921875 200737.828125 201395.015625 657.203125 28582.109375 -0.015625 657.202774 28582.094101
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual
1032 2340 8
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 268996.782449 90414.574684 178582.207765 268996.782449 90414.574684 90414.574684
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 AB 2 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 63324.646841 178582.207765 226389.273091 241906.854606 289713.919932 47807.065326 63324.646841 -0.0 47807.065326 63324.646841
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 207590.130539 337899.406371 130309.275832 130309.275832
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 AB 2 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 182789.632430 205582.297796 247036.506074 293900.114096 22792.665366 64246.873644 24070.942656 29096.018385 82014.463281
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 294881.656250 99795.421875 195086.234375 294881.65625 99795.421875 99795.421875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 AB 2 0.790517 40982.155774 172155.718750 321153.718750 67974.367188 81023.632812 172155.718750 240129.8125 253179.078125 321153.71875 67974.09375 81023.359375 0.546875 67974.34324 81023.65676
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual
2783 1282 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 156077.336371 -22504.871393 178582.207765 156077.336371 -22504.871393 -22504.871393
2 B 1 0.651246 52878.678358 178582.207765 175736.484516 -2845.723249 178582.207765 175736.484516 -2845.723249 -2845.723249
3 AB 2 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -2107.564839 178582.207765 166682.667667 176474.642926 164575.102828 -11899.540097 -2107.564839 -0.0 -11899.540097 -2107.564839
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 207590.130539 155854.168999 -51735.961539 -51735.961539
2 B 1 0.725528 46910.500909 162512.760008 162512.760008 0.0 162512.760008 162512.760008 0.0 0.0
3 AB 2 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 182789.632430 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 -28447.486479 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 154598.546875 -40487.6875 195086.234375 154598.546875 -40487.6875 -40487.6875
2 B 1 0.725640 46900.883370 162447.125000 162447.125000 0.0 162447.125000 162447.125 0.0 0.0
3 AB 2 0.790517 40982.155774 172155.718750 150816.812500 -21338.90625 0.0 172155.718750 150816.8125 172155.71875 150816.8125 -21338.90625 0.0 0.0 -21338.90625 0.0
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual
411 1055 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 131849.818168 -46732.389597 178582.207765 131849.818168 -46732.389597 -46732.389597
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 AB 2 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 30608.541001 178582.207765 153872.271229 209190.748766 184480.812231 -24709.936535 30608.541001 -0.0 -24709.936535 30608.541001
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 207590.130539 136692.9966 -70897.133939 -70897.133939
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 AB 2 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 182789.632430 142918.51353 229488.927603 146990.418258 -39871.1189 46699.295173 -42627.390445 209038.939205 -244838.153377
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 134324.750000 -60761.484375 195086.234375 134324.75 -60761.484375 -60761.484375
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 AB 2 0.790517 40982.155774 172155.718750 169153.296875 -31584.53125 28582.109375 172155.718750 140571.1875 200737.828125 169153.296875 -31584.53125 28582.109375 0.0 -31584.53125 28582.109375
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual
1385 1258 4
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 153515.836649 -25066.371115 178582.207765 153515.836649 -25066.371115 -25066.371115
2 B 1 0.651246 52878.678358 178582.207765 87387.142860 -91195.064905 178582.207765 87387.14286 -91195.064905 -91195.064905
3 AB 2 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -67539.776519 178582.207765 165328.264519 111042.431246 97788.488001 -13253.943245 -67539.776519 -0.0 -13253.943245 -67539.776519
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 207590.130539 165029.481902 -42560.648637 -42560.648637
2 B 1 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 162512.760008 105779.163578 -56733.59643 -56733.59643
3 AB 2 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 182789.632430 159205.404641 128531.458118 120456.636131 -23584.227789 -54258.174312 15509.405802 -18885.280307 -43447.715991
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 163633.953125 -31452.28125 195086.234375 163633.953125 -31452.28125 -31452.28125
2 B 1 0.725640 46900.883370 162447.125000 105912.015625 -56535.109375 162447.125000 105912.015625 -56535.109375 -56535.109375
3 AB 2 0.790517 40982.155774 172155.718750 117276.554688 -13997.238281 -40881.925781 172155.718750 158158.5625 131273.875 117276.554688 -13997.15625 -40881.84375 -0.164062 -13997.198095 -40881.965968
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual
1577 1394 5
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 168031.001741 -10551.206024 178582.207765 168031.001741 -10551.206024 -10551.206024
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 AB 2 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -34823.670679 178582.207765 173003.215689 143758.537086 138179.545011 -5578.992075 -34823.670679 -0.0 -5578.992075 -34823.670679
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 207590.130539 158197.633277 -49392.497261 -49392.497261
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 AB 2 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 182789.632430 158731.955162 160199.795645 143978.553894 -24057.677268 -22589.836785 7836.435517 -20016.166366 -18794.91217
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 161489.765625 -33596.46875 195086.234375 161489.765625 -33596.46875 -33596.46875
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 AB 2 0.790517 40982.155774 172155.718750 140205.796875 -14059.90625 -17890.015625 172155.718750 158095.8125 154265.703125 140205.796875 -14059.90625 -17890.015625 0.0 -14059.90625 -17890.015625
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual
1427 2097 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 243061.597764 64479.389999 178582.207765 243061.597764 64479.389999 64479.389999
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 AB 2 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 30608.541001 178582.207765 212675.941221 209190.748766 243284.482223 34093.733457 30608.541001 -0.0 34093.733457 30608.541001
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 207590.130539 252701.449543 45111.319005 45111.319005
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 AB 2 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 182789.632430 200242.930323 229488.927603 231949.98396 17453.297893 46699.295173 -14992.241537 13374.521882 35785.829647
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 263538.906250 68452.671875 195086.234375 263538.90625 68452.671875 68452.671875
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 AB 2 0.790517 40982.155774 172155.718750 243396.140625 42658.3125 28582.109375 172155.718750 214814.03125 200737.828125 243396.140625 42658.3125 28582.109375 0.0 42658.3125 28582.109375
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual
518 1314 8
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 159492.669334 -19089.53843 178582.207765 159492.669334 -19089.53843 -19089.53843
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 AB 2 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 63324.646841 178582.207765 168488.538531 241906.854606 231813.185372 -10093.669234 63324.646841 -0.0 -10093.669234 63324.646841
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 207590.130539 171220.025307 -36370.105231 -36370.105231
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 AB 2 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 182789.632430 157787.322563 247036.506074 206902.567063 -25002.309868 64246.873644 -15131.629144 -15362.10383 39475.038463
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 168911.546875 -26174.6875 195086.234375 168911.546875 -26174.6875 -26174.6875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 AB 2 0.790517 40982.155774 172155.718750 237431.468750 -15747.609375 81023.359375 172155.718750 156408.109375 253179.078125 237431.46875 -15747.609375 81023.359375 0.0 -15747.609375 81023.359375
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual
2181 4676 10
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 518316.088722 339733.880958 178582.207765 518316.088722 339733.880958 339733.880958
2 B 1 0.651246 52878.678358 178582.207765 352435.167828 173852.960063 178582.207765 352435.167828 173852.960063 173852.960063
3 AB 2 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 128756.858521 178582.207765 358217.846125 307339.066286 486974.704646 179635.63836 128756.858521 -0.0 179635.63836 128756.858521
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 207590.130539 386181.566891 178591.436353 178591.436353
2 B 1 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 162512.760008 449028.228831 286515.468823 286515.468823
3 AB 2 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 182789.632430 309526.773946 305311.493879 484943.892199 126737.141516 122521.861449 52895.256804 153632.032241 148522.227527
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 459439.187500 264352.9375 195086.234375 459439.1875 264352.953125 264352.953125
2 B 1 0.725640 46900.883370 162447.125000 450958.187500 288511.0625 162447.125000 450958.1875 288511.0625 288511.0625
3 AB 2 0.790517 40982.155774 172155.718750 542171.750000 151095.578125 218920.453125 172155.718750 323251.0 391075.875 542171.75 151095.28125 218920.15625 0.59375 151095.523707 218920.507543
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual
1880 1422 5
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 171019.418083 -7562.789682 178582.207765 171019.418083 -7562.789682 -7562.789682
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 AB 2 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -34823.670679 178582.207765 174583.352695 143758.537086 139759.682016 -3998.85507 -34823.670679 -0.0 -3998.85507 -34823.670679
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 207590.130539 166761.015241 -40829.115298 -40829.115298
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 AB 2 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 182789.632430 164634.126044 160199.795645 144744.578437 -18155.506386 -22589.836785 2700.289179 -16952.29852 -21092.755473
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B f(∅) f(A) f(B) f(AB) μ(A) μ(B) μ(AB) ReSHAP A ReSHAP B
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 165546.140625 -29540.09375 195086.234375 165546.140625 -29540.09375 -29540.09375
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 AB 2 0.790517 40982.155774 172155.718750 141969.359375 -12296.34375 -17890.015625 172155.718750 159859.375 154265.703125 141969.359375 -12296.34375 -17890.015625 0.0 -12296.34375 -17890.015625

Duplicate "Overall Quality (B == C)"¶

In [22]:
SETTINGS = dict(
    # Data slice / feature modes:
    MODE="dup_qual",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=10,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=True,
    USE_MLP=True,
    USE_LGB=True,              # will only run if LightGBM is installed
    USE_XGB=True,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)
Mode: dup_qual | Features (A,B,C): [('A', 'Gr Liv Area'), ('B', 'Overall Qual'), ('C', 'Overall Qual (DUP)')]
Evaluating 10 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1093 1892 7 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 221182.120972 42599.913207 178582.207765 221182.120972 42599.913207 42599.913207
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
4 AB 2 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 30608.541001 178582.207765 201107.081002 209190.748766 231715.622003 22524.873238 30608.541001 -0.0 22524.873238 30608.541001
5 AC 2 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 30608.541001 178582.207765 201107.081002 209190.748766 231715.622003 22524.873238 30608.541001 -0.0 22524.873238 30608.541001
6 BC 2 0.653450 52711.313325 178582.207765 220898.848464 -227849515686375680.0 227849515686417984.0 178582.207765 -227849515686197088.0 227849515686596576.0 220898.848464 -227849515686375680.0 227849515686417984.0 0.0 -227849515686375680.0 227849515686417984.0
7 ABC 3 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 15304.270501 15304.270501 178582.207765 201107.081002 193886.478265 216411.351503 193886.478265 216411.351503 209190.748766 231715.622003 22524.873238 15304.270501 15304.270501 0.0 0.0 0.0 -0.0 22524.873238 15304.270501 15304.270501
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 225058.589689 24436.113321 17063.785283 17053.563352 166505.127733 190864.068295 183491.465006 207850.405569 183481.243075 207840.183638 200468.13085 225058.589689 24590.458838 17218.40605 17208.18412 -231.518276 -231.518276 -232.068777 231.518276 24414.618744 17074.501554 17064.341658
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 207590.130539 198246.330716 -9343.799823 -9343.799823
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
4 AB 2 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 182789.632430 183753.304113 229488.927603 200534.205454 963.671683 46699.295173 -29918.393833 358.767901 17385.805123
5 AC 2 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 182789.632430 183753.304113 229488.927603 200534.205454 963.671683 46699.295173 -29918.393833 358.767901 17385.805123
6 BC 2 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 0.0 162512.760008 204048.490253 162512.760008 204048.490253 41535.730245 0.0 0.0 41535.730245 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 0.0 182789.632430 183753.304113 229488.927603 200534.205454 182789.63243 183753.304113 229488.927603 200534.205454 -28954.72215 16780.90134 0.0 29918.393833 0.0 0.0 0.0 42204.431099 -24459.858076 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 187762.781250 -7323.453125 195086.234375 187762.78125 -7323.453125 -7323.453125
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
4 AB 2 0.790517 40982.155774 172155.718750 201395.015625 657.195312 28582.101562 172155.718750 172812.921875 200737.828125 201395.015625 657.203125 28582.109375 -0.015625 657.202774 28582.094101
5 AC 2 0.790517 40982.155774 172155.718750 201395.015625 657.195312 28582.101562 172155.718750 172812.921875 200737.828125 201395.015625 657.203125 28582.109375 -0.015625 657.202774 28582.094101
6 BC 2 0.725640 46900.883370 162447.125000 204150.109375 23600.09375 18102.890625 162447.125000 186047.21875 180550.015625 204150.109375 23600.09375 18102.890625 0.0 23600.09375 18102.890625
7 ABC 3 0.801046 39939.015065 169186.234375 195527.140625 -3297.367188 22207.6875 7430.585938 169186.234375 172347.234375 195464.28125 190484.5625 179004.828125 177389.8125 205282.875 195527.140625 -9755.734375 18137.328125 5042.578125 8140.71875 4776.015625 0.0 0.0 -9345.256291 35753.415417 -67.252875
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1032 2340 8 8
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 268996.782449 90414.574684 178582.207765 268996.782449 90414.574684 90414.574684
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
4 AB 2 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 63324.646841 178582.207765 226389.273091 241906.854606 289713.919932 47807.065326 63324.646841 -0.0 47807.065326 63324.646841
5 AC 2 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 63324.646841 178582.207765 226389.273091 241906.854606 289713.919932 47807.065326 63324.646841 -0.0 47807.065326 63324.646841
6 BC 2 0.653450 52711.313325 178582.207765 266050.848464 -471387712115406592.0 471387712115494080.0 178582.207765 -471387712115228032.0 471387712115672640.0 266050.848464 -471387712115406592.0 471387712115494080.0 0.0 -471387712115406592.0 471387712115494080.0
7 ABC 3 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 31662.323421 31662.323421 178582.207765 226389.273091 210244.531185 258051.596511 210244.531185 258051.596511 241906.854606 289713.919932 47807.065326 31662.323421 31662.323421 0.0 0.0 0.0 -0.0 47807.065326 31662.323421 31662.323421
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
5 AC 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 303870.471190 52128.012189 42686.830311 42550.500957 166505.127733 218204.839889 202112.051547 254454.213752 201975.722193 254317.884398 251528.308985 303870.47119 52342.162205 49552.586792 49416.257438 0.0 0.0 -13303.212929 -642.450048 52098.500425 42692.148956 42574.694076
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 207590.130539 337899.406371 130309.275832 130309.275832
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
4 AB 2 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 182789.632430 205582.297796 247036.506074 293900.114096 22792.665366 64246.873644 24070.942656 29096.018385 82014.463281
5 AC 2 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 182789.632430 205582.297796 247036.506074 293900.114096 22792.665366 64246.873644 24070.942656 29096.018385 82014.463281
6 BC 2 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 0.0 162512.760008 269359.021972 162512.760008 269359.021972 106846.261963 0.0 0.0 106846.261963 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 0.0 182789.632430 205582.297796 247036.506074 293900.114096 182789.63243 205582.297796 247036.506074 293900.114096 46863.608022 88317.8163 0.0 -24070.942656 0.0 0.0 0.0 38518.887384 72591.594282 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 294881.656250 99795.421875 195086.234375 294881.65625 99795.421875 99795.421875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
4 AB 2 0.790517 40982.155774 172155.718750 321153.718750 67974.367188 81023.632812 172155.718750 240129.8125 253179.078125 321153.71875 67974.09375 81023.359375 0.546875 67974.34324 81023.65676
5 AC 2 0.790517 40982.155774 172155.718750 321153.718750 67974.367188 81023.632812 172155.718750 240129.8125 253179.078125 321153.71875 67974.09375 81023.359375 0.546875 67974.34324 81023.65676
6 BC 2 0.725640 46900.883370 162447.125000 269223.312500 61835.289062 44940.898438 162447.125000 224282.328125 207387.9375 269223.3125 61835.203125 44940.8125 0.171875 61835.30266 44940.88484
7 ABC 3 0.801046 39939.015065 169186.234375 350891.437500 81759.984375 69537.96875 30407.25 169186.234375 216994.671875 221542.546875 303714.3125 182823.609375 264171.8125 235179.90625 350891.4375 115711.53125 86719.625 47177.125 -34363.328125 -33539.765625 0.015625 0.0 72243.424883 71998.704884 37463.073359
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2783 1282 6 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 156077.336371 -22504.871393 178582.207765 156077.336371 -22504.871393 -22504.871393
2 B 1 0.651246 52878.678358 178582.207765 175736.484516 -2845.723249 178582.207765 175736.484516 -2845.723249 -2845.723249
3 C 1 0.651246 52878.678358 178582.207765 175736.484516 -2845.723249 178582.207765 175736.484516 -2845.723249 -2845.723249
4 AB 2 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -2107.564839 178582.207765 166682.667667 176474.642926 164575.102828 -11899.540097 -2107.564839 -0.0 -11899.540097 -2107.564839
5 AC 2 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -2107.564839 178582.207765 166682.667667 176474.642926 164575.102828 -11899.540097 -2107.564839 -0.0 -11899.540097 -2107.564839
6 BC 2 0.653450 52711.313325 178582.207765 175746.848464 15688680742655232.0 -15688680742658068.0 178582.207765 15688680742833814.0 -15688680742479486.0 175746.848464 15688680742655232.0 -15688680742658068.0 0.0 15688680742655232.0 -15688680742658068.0
7 ABC 3 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -1053.782419 -1053.782419 178582.207765 166682.667667 177528.425345 165628.885248 177528.425345 165628.885248 176474.642926 164575.102828 -11899.540097 -1053.782419 -1053.782419 0.0 0.0 0.0 -0.0 -11899.540097 -1053.782419 -1053.782419
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 151298.178472 -12868.449331 -1169.601883 -1168.898047 166505.127733 153636.678401 165335.52585 152467.076519 165336.229686 152467.780354 164166.627803 151298.178472 -12868.449331 -1169.601883 -1168.898047 0.0 -0.0 0.0 0.0 -12868.449331 -1169.601883 -1168.898047
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 207590.130539 155854.168999 -51735.961539 -51735.961539
2 B 1 0.725528 46910.500909 162512.760008 162512.760008 0.0 162512.760008 162512.760008 0.0 0.0
3 C 1 0.725528 46910.500909 162512.760008 162512.760008 0.0 162512.760008 162512.760008 0.0 0.0
4 AB 2 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 182789.632430 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 -28447.486479 0.0
5 AC 2 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 182789.632430 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 -28447.486479 0.0
6 BC 2 0.725528 46910.500909 162512.760008 162512.760008 0.0 0.0 162512.760008 162512.760008 162512.760008 162512.760008 0.0 0.0 0.0 0.0 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 0.0 182789.632430 154342.145951 182789.63243 154342.145951 182789.63243 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 0.0 0.0 0.0 0.0 -28447.486479 0.0 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 154598.546875 -40487.6875 195086.234375 154598.546875 -40487.6875 -40487.6875
2 B 1 0.725640 46900.883370 162447.125000 162447.125000 0.0 162447.125000 162447.125 0.0 0.0
3 C 1 0.725640 46900.883370 162447.125000 162447.125000 0.0 162447.125000 162447.125 0.0 0.0
4 AB 2 0.790517 40982.155774 172155.718750 150816.812500 -21338.90625 0.0 172155.718750 150816.8125 172155.71875 150816.8125 -21338.90625 0.0 0.0 -21338.90625 0.0
5 AC 2 0.790517 40982.155774 172155.718750 150816.812500 -21338.90625 0.0 172155.718750 150816.8125 172155.71875 150816.8125 -21338.90625 0.0 0.0 -21338.90625 0.0
6 BC 2 0.725640 46900.883370 162447.125000 162447.125000 0.0 0.0 162447.125000 162447.125 162447.125 162447.125 0.0 0.0 0.0 0.0 0.0
7 ABC 3 0.801046 39939.015065 169186.234375 153720.234375 -15466.0 0.0 0.0 169186.234375 153720.234375 169186.234375 153720.234375 169186.234375 153720.234375 169186.234375 153720.234375 -15466.0 0.0 0.0 0.0 0.0 0.0 0.0 -15466.0 0.0 0.0
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
411 1055 7 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 131849.818168 -46732.389597 178582.207765 131849.818168 -46732.389597 -46732.389597
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
4 AB 2 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 30608.541001 178582.207765 153872.271229 209190.748766 184480.812231 -24709.936535 30608.541001 -0.0 -24709.936535 30608.541001
5 AC 2 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 30608.541001 178582.207765 153872.271229 209190.748766 184480.812231 -24709.936535 30608.541001 -0.0 -24709.936535 30608.541001
6 BC 2 0.653450 52711.313325 178582.207765 220898.848464 -227849515686375680.0 227849515686417984.0 178582.207765 -227849515686197088.0 227849515686596576.0 220898.848464 -227849515686375680.0 227849515686417984.0 0.0 -227849515686375680.0 227849515686417984.0
7 ABC 3 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 15304.270501 15304.270501 178582.207765 153872.271229 193886.478265 169176.54173 193886.478265 169176.54173 209190.748766 184480.812231 -24709.936535 15304.270501 15304.270501 0.0 0.0 -0.0 -0.0 -24709.936535 15304.270501 15304.270501
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 173745.697243 -26578.654253 16914.722847 16904.500917 166505.127733 140213.519233 183491.465006 156769.544354 183481.243075 156759.322423 200468.13085 173745.697243 -26722.433607 16986.374821 16976.15289 0.512954 0.512954 -430.3497 429.799199 -28405.782589 17828.539913 17817.812187
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 207590.130539 136692.9966 -70897.133939 -70897.133939
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
4 AB 2 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 182789.632430 142918.51353 229488.927603 146990.418258 -39871.1189 46699.295173 -42627.390445 209038.939205 -244838.153377
5 AC 2 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 182789.632430 142918.51353 229488.927603 146990.418258 -39871.1189 46699.295173 -42627.390445 209038.939205 -244838.153377
6 BC 2 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 0.0 162512.760008 204048.490253 162512.760008 204048.490253 41535.730245 0.0 0.0 41535.730245 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 0.0 182789.632430 142918.51353 229488.927603 146990.418258 182789.63243 142918.51353 229488.927603 146990.418258 -82498.509345 4071.904729 0.0 42627.390445 0.0 0.0 0.0 -37657.90728 1858.693108 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 134324.750000 -60761.484375 195086.234375 134324.75 -60761.484375 -60761.484375
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
4 AB 2 0.790517 40982.155774 172155.718750 169153.296875 -31584.53125 28582.109375 172155.718750 140571.1875 200737.828125 169153.296875 -31584.53125 28582.109375 0.0 -31584.53125 28582.109375
5 AC 2 0.790517 40982.155774 172155.718750 169153.296875 -31584.53125 28582.109375 172155.718750 140571.1875 200737.828125 169153.296875 -31584.53125 28582.109375 0.0 -31584.53125 28582.109375
6 BC 2 0.725640 46900.883370 162447.125000 204150.109375 23600.09375 18102.890625 162447.125000 186047.21875 180550.015625 204150.109375 23600.09375 18102.890625 0.0 23600.09375 18102.890625
7 ABC 3 0.801046 39939.015065 169186.234375 148364.906250 -43307.166667 18072.973958 4412.864583 169186.234375 139489.859375 195464.28125 149357.78125 179004.828125 138497.015625 205282.875 148364.90625 -56917.96875 9867.890625 -992.875 16410.15625 10811.46875 0.03125 -0.03125 -26440.015682 6426.205831 -807.518274
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1385 1258 4 4
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 153515.836649 -25066.371115 178582.207765 153515.836649 -25066.371115 -25066.371115
2 B 1 0.651246 52878.678358 178582.207765 87387.142860 -91195.064905 178582.207765 87387.14286 -91195.064905 -91195.064905
3 C 1 0.651246 52878.678358 178582.207765 87387.142860 -91195.064905 178582.207765 87387.14286 -91195.064905 -91195.064905
4 AB 2 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -67539.776519 178582.207765 165328.264519 111042.431246 97788.488001 -13253.943245 -67539.776519 -0.0 -13253.943245 -67539.776519
5 AC 2 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -67539.776519 178582.207765 165328.264519 111042.431246 97788.488001 -13253.943245 -67539.776519 -0.0 -13253.943245 -67539.776519
6 BC 2 0.653450 52711.313325 178582.207765 85186.848464 502765073600716800.0 -502765073600810176.0 178582.207765 502765073600895360.0 -502765073600631616.0 85186.848464 502765073600716800.0 -502765073600810176.0 0.0 502765073600716800.0 -502765073600810176.0
7 ABC 3 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -33769.888259 -33769.888259 178582.207765 165328.264519 144812.319505 131558.37626 144812.319505 131558.37626 111042.431246 97788.488001 -13253.943245 -33769.888259 -33769.888259 -0.0 0.0 0.0 -0.0 -13253.943245 -33769.888259 -33769.888259
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
5 AC 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 112880.829149 -9482.786534 -21858.224524 -22283.287525 166505.127733 152171.994209 136767.390763 128548.419598 136359.403222 128106.281139 118760.008604 112880.829149 -5879.179455 -15225.45199 -15667.590449 -2373.942628 -2339.791709 -8398.122621 -3740.219732 -7717.186325 -22657.362436 -23249.749823
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 207590.130539 165029.481902 -42560.648637 -42560.648637
2 B 1 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 162512.760008 105779.163578 -56733.59643 -56733.59643
3 C 1 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 162512.760008 105779.163578 -56733.59643 -56733.59643
4 AB 2 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 182789.632430 159205.404641 128531.458118 120456.636131 -23584.227789 -54258.174312 15509.405802 -18885.280307 -43447.715991
5 AC 2 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 182789.632430 159205.404641 128531.458118 120456.636131 -23584.227789 -54258.174312 15509.405802 -18885.280307 -43447.715991
6 BC 2 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 0.0 162512.760008 105779.163578 162512.760008 105779.163578 -56733.59643 0.0 0.0 -56733.59643 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 0.0 182789.632430 159205.404641 128531.458118 120456.636131 182789.63243 159205.404641 128531.458118 120456.636131 -8074.821987 -38748.76851 0.0 -15509.405802 0.0 0.0 0.0 -10749.450089 -51583.54621 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 163633.953125 -31452.28125 195086.234375 163633.953125 -31452.28125 -31452.28125
2 B 1 0.725640 46900.883370 162447.125000 105912.015625 -56535.109375 162447.125000 105912.015625 -56535.109375 -56535.109375
3 C 1 0.725640 46900.883370 162447.125000 105912.015625 -56535.109375 162447.125000 105912.015625 -56535.109375 -56535.109375
4 AB 2 0.790517 40982.155774 172155.718750 117276.554688 -13997.238281 -40881.925781 172155.718750 158158.5625 131273.875 117276.554688 -13997.15625 -40881.84375 -0.164062 -13997.198095 -40881.965968
5 AC 2 0.790517 40982.155774 172155.718750 117276.554688 -13997.238281 -40881.925781 172155.718750 158158.5625 131273.875 117276.554688 -13997.15625 -40881.84375 -0.164062 -13997.198095 -40881.965968
6 BC 2 0.725640 46900.883370 162447.125000 105912.015625 -32301.214844 -24233.894531 162447.125000 130145.945312 138213.265625 105912.015625 -32301.179688 -24233.859375 -0.070312 -32301.21986 -24233.889515
7 ABC 3 0.801046 39939.015065 169186.234375 120505.265625 -4228.384115 -33885.919271 -10566.665365 169186.234375 159775.90625 132885.09375 128305.101562 155852.703125 151976.0 119551.742188 120505.265625 953.523438 -31470.734375 -7799.835938 -4830.226562 -5533.515625 -0.070312 -0.109375 1875.133193 -36452.021699 -14104.080243
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1577 1394 5 5
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 168031.001741 -10551.206024 178582.207765 168031.001741 -10551.206024 -10551.206024
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
4 AB 2 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -34823.670679 178582.207765 173003.215689 143758.537086 138179.545011 -5578.992075 -34823.670679 -0.0 -5578.992075 -34823.670679
5 AC 2 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -34823.670679 178582.207765 173003.215689 143758.537086 138179.545011 -5578.992075 -34823.670679 -0.0 -5578.992075 -34823.670679
6 BC 2 0.653450 52711.313325 178582.207765 130338.848464 259226877171685888.0 -259226877171734144.0 178582.207765 259226877171864480.0 -259226877171555552.0 130338.848464 259226877171685888.0 -259226877171734144.0 0.0 259226877171685888.0 -259226877171734144.0
7 ABC 3 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -17411.835339 -17411.835339 178582.207765 173003.215689 161170.372425 155591.38035 161170.372425 155591.38035 143758.537086 138179.545011 -5578.992075 -17411.835339 -17411.835339 0.0 0.0 0.0 0.0 -5578.992075 -17411.835339 -17411.835339
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 132295.257274 -4840.536576 -14615.114407 -14754.219476 166505.127733 160471.8713 147859.950307 143065.757178 147746.497458 142900.999889 135963.76522 132295.257274 -3668.507946 -10605.742616 -10770.499904 -1176.989622 -1125.685183 -6800.371506 -62.073681 -4264.736729 -14881.224561 -15063.909168
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 207590.130539 158197.633277 -49392.497261 -49392.497261
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
4 AB 2 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 182789.632430 158731.955162 160199.795645 143978.553894 -24057.677268 -22589.836785 7836.435517 -20016.166366 -18794.91217
5 AC 2 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 182789.632430 158731.955162 160199.795645 143978.553894 -24057.677268 -22589.836785 7836.435517 -20016.166366 -18794.91217
6 BC 2 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 0.0 162512.760008 135047.645999 162512.760008 135047.645999 -27465.11401 0.0 0.0 -27465.11401 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 0.0 182789.632430 158731.955162 160199.795645 143978.553894 182789.63243 158731.955162 160199.795645 143978.553894 -16221.241751 -14753.401269 0.0 -7836.435517 0.0 0.0 0.0 -20325.137796 -18485.94074 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 161489.765625 -33596.46875 195086.234375 161489.765625 -33596.46875 -33596.46875
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
4 AB 2 0.790517 40982.155774 172155.718750 140205.796875 -14059.90625 -17890.015625 172155.718750 158095.8125 154265.703125 140205.796875 -14059.90625 -17890.015625 0.0 -14059.90625 -17890.015625
5 AC 2 0.790517 40982.155774 172155.718750 140205.796875 -14059.90625 -17890.015625 172155.718750 158095.8125 154265.703125 140205.796875 -14059.90625 -17890.015625 0.0 -14059.90625 -17890.015625
6 BC 2 0.725640 46900.883370 162447.125000 135150.843750 -15966.984375 -11329.296875 162447.125000 146480.140625 151117.828125 135150.84375 -15966.984375 -11329.296875 0.0 -15966.984375 -11329.296875
7 ABC 3 0.801046 39939.015065 169186.234375 143742.968750 -6270.984375 -14660.046875 -4512.234375 169186.234375 161057.984375 153236.15625 147687.96875 164106.765625 157112.984375 148156.6875 143742.96875 -4413.71875 -13370.015625 -3945.0 -2580.0625 -1134.46875 0.0 0.0 -5653.102935 -15309.736226 -4480.426463
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1427 2097 7 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 243061.597764 64479.389999 178582.207765 243061.597764 64479.389999 64479.389999
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
4 AB 2 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 30608.541001 178582.207765 212675.941221 209190.748766 243284.482223 34093.733457 30608.541001 -0.0 34093.733457 30608.541001
5 AC 2 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 30608.541001 178582.207765 212675.941221 209190.748766 243284.482223 34093.733457 30608.541001 -0.0 34093.733457 30608.541001
6 BC 2 0.653450 52711.313325 178582.207765 220898.848464 -227849515686375680.0 227849515686417984.0 178582.207765 -227849515686197088.0 227849515686596576.0 220898.848464 -227849515686375680.0 227849515686417984.0 0.0 -227849515686375680.0 227849515686417984.0
7 ABC 3 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 15304.270501 15304.270501 178582.207765 212675.941221 193886.478265 227980.211722 193886.478265 227980.211722 209190.748766 243284.482223 34093.733457 15304.270501 15304.270501 0.0 0.0 0.0 -0.0 34093.733457 15304.270501 15304.270501
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 237724.900703 36998.779756 17115.607573 17105.385642 166505.127733 203374.91244 183491.465006 220361.249714 183481.243075 220351.027783 200468.13085 237724.900703 37256.769853 17373.87292 17363.650989 -386.985146 -386.985146 -387.535647 386.985146 36929.554768 17150.16583 17140.052373
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 207590.130539 252701.449543 45111.319005 45111.319005
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
4 AB 2 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 182789.632430 200242.930323 229488.927603 231949.98396 17453.297893 46699.295173 -14992.241537 13374.521882 35785.829647
5 AC 2 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 182789.632430 200242.930323 229488.927603 231949.98396 17453.297893 46699.295173 -14992.241537 13374.521882 35785.829647
6 BC 2 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 0.0 162512.760008 204048.490253 162512.760008 204048.490253 41535.730245 0.0 0.0 41535.730245 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 0.0 182789.632430 200242.930323 229488.927603 231949.98396 182789.63243 200242.930323 229488.927603 231949.98396 2461.056357 31707.053637 0.0 14992.241537 0.0 0.0 0.0 3540.915656 45619.435874 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 263538.906250 68452.671875 195086.234375 263538.90625 68452.671875 68452.671875
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
4 AB 2 0.790517 40982.155774 172155.718750 243396.140625 42658.3125 28582.109375 172155.718750 214814.03125 200737.828125 243396.140625 42658.3125 28582.109375 0.0 42658.3125 28582.109375
5 AC 2 0.790517 40982.155774 172155.718750 243396.140625 42658.3125 28582.109375 172155.718750 214814.03125 200737.828125 243396.140625 42658.3125 28582.109375 0.0 42658.3125 28582.109375
6 BC 2 0.725640 46900.883370 162447.125000 204150.109375 23600.09375 18102.890625 162447.125000 186047.21875 180550.015625 204150.109375 23600.09375 18102.890625 0.0 23600.09375 18102.890625
7 ABC 3 0.801046 39939.015065 169186.234375 233924.484375 28483.625 26414.140625 9840.484375 169186.234375 197511.875 195464.28125 224062.109375 179004.828125 207374.25 205282.875 233924.484375 28641.609375 26550.234375 9862.375 -272.1875 -43.78125 0.0 0.0 28467.791487 26419.297602 9851.160911
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
518 1314 8 8
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 159492.669334 -19089.53843 178582.207765 159492.669334 -19089.53843 -19089.53843
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
4 AB 2 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 63324.646841 178582.207765 168488.538531 241906.854606 231813.185372 -10093.669234 63324.646841 -0.0 -10093.669234 63324.646841
5 AC 2 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 63324.646841 178582.207765 168488.538531 241906.854606 231813.185372 -10093.669234 63324.646841 -0.0 -10093.669234 63324.646841
6 BC 2 0.653450 52711.313325 178582.207765 266050.848464 -471387712115406592.0 471387712115494080.0 178582.207765 -471387712115228032.0 471387712115672640.0 266050.848464 -471387712115406592.0 471387712115494080.0 0.0 -471387712115406592.0 471387712115494080.0
7 ABC 3 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 31662.323421 31662.323421 178582.207765 168488.538531 210244.531185 200150.861951 210244.531185 200150.861951 241906.854606 231813.185372 -10093.669234 31662.323421 31662.323421 0.0 0.0 0.0 0.0 -10093.669234 31662.323421 31662.323421
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 240477.129234 -10994.046018 42533.309176 42432.738344 166505.127733 155589.590658 202112.051547 191060.871796 201975.722193 190996.059486 251528.308985 240477.129234 -11051.179751 49481.069747 49416.257438 71.517045 0.0 -14009.788609 64.125632 -11081.352034 42600.597855 42452.755681
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 207590.130539 171220.025307 -36370.105231 -36370.105231
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
4 AB 2 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 182789.632430 157787.322563 247036.506074 206902.567063 -25002.309868 64246.873644 -15131.629144 -15362.10383 39475.038463
5 AC 2 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 182789.632430 157787.322563 247036.506074 206902.567063 -25002.309868 64246.873644 -15131.629144 -15362.10383 39475.038463
6 BC 2 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 0.0 162512.760008 269359.021972 162512.760008 269359.021972 106846.261963 0.0 0.0 106846.261963 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 0.0 182789.632430 157787.322563 247036.506074 206902.567063 182789.63243 157787.322563 247036.506074 206902.567063 -40133.939012 49115.2445 0.0 15131.629144 0.0 0.0 0.0 -107751.267247 131864.20188 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 168911.546875 -26174.6875 195086.234375 168911.546875 -26174.6875 -26174.6875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
4 AB 2 0.790517 40982.155774 172155.718750 237431.468750 -15747.609375 81023.359375 172155.718750 156408.109375 253179.078125 237431.46875 -15747.609375 81023.359375 0.0 -15747.609375 81023.359375
5 AC 2 0.790517 40982.155774 172155.718750 237431.468750 -15747.609375 81023.359375 172155.718750 156408.109375 253179.078125 237431.46875 -15747.609375 81023.359375 0.0 -15747.609375 81023.359375
6 BC 2 0.725640 46900.883370 162447.125000 269223.312500 61835.289062 44940.898438 162447.125000 224282.328125 207387.9375 269223.3125 61835.203125 44940.8125 0.171875 61835.30266 44940.88484
7 ABC 3 0.801046 39939.015065 169186.234375 215800.250000 -14412.565104 50329.223958 10697.356771 169186.234375 159740.765625 221542.546875 208042.90625 182823.609375 167498.109375 235179.90625 215800.25 -19379.65625 48302.140625 7757.34375 4054.15625 5880.015625 0.0 0.015625 -12291.520299 55072.823139 3832.712785
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2181 4676 10 10
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 518316.088722 339733.880958 178582.207765 518316.088722 339733.880958 339733.880958
2 B 1 0.651246 52878.678358 178582.207765 352435.167828 173852.960063 178582.207765 352435.167828 173852.960063 173852.960063
3 C 1 0.651246 52878.678358 178582.207765 352435.167828 173852.960063 178582.207765 352435.167828 173852.960063 173852.960063
4 AB 2 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 128756.858521 178582.207765 358217.846125 307339.066286 486974.704646 179635.63836 128756.858521 -0.0 179635.63836 128756.858521
5 AC 2 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 128756.858521 178582.207765 358217.846125 307339.066286 486974.704646 179635.63836 128756.858521 -0.0 179635.63836 128756.858521
6 BC 2 0.653450 52711.313325 178582.207765 356354.848464 -958464104973468416.0 958464104973646208.0 178582.207765 -958464104973289728.0 958464104973824896.0 356354.848464 -958464104973468288.0 958464104973646336.0 -256.0 -957086011364592000.0 957086011364769792.0
7 ABC 3 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 64378.429261 64378.429261 178582.207765 358217.846125 242960.637025 422596.275385 242960.637025 422596.275385 307339.066286 486974.704646 179635.63836 64378.429261 64378.429261 0.0 0.0 0.0 0.0 179635.63836 64378.429261 64378.429261
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
4 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
5 AC 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
6 BC 2 0.697364 49258.516562 161570.634319 408111.076369 122989.130608 123551.311442 161570.634319 278211.79659 278773.977424 408111.076369 116641.16227 117203.343104 12695.936675 122973.869586 123566.572464
7 ABC 3 0.797024 40340.721518 166505.127733 550268.367511 195700.818212 94169.666582 93892.754985 166505.127733 360595.490204 253313.842793 449848.307854 253036.646783 449571.68067 353791.024821 550268.367511 196477.34269 100696.686841 100420.059658 57.691197 57.122371 -11443.869191 -2501.793787 195280.234326 94371.214413 94111.79104
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 207590.130539 386181.566891 178591.436353 178591.436353
2 B 1 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 162512.760008 449028.228831 286515.468823 286515.468823
3 C 1 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 162512.760008 449028.228831 286515.468823 286515.468823
4 AB 2 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 182789.632430 309526.773946 305311.493879 484943.892199 126737.141516 122521.861449 52895.256804 153632.032241 148522.227527
5 AC 2 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 182789.632430 309526.773946 305311.493879 484943.892199 126737.141516 122521.861449 52895.256804 153632.032241 148522.227527
6 BC 2 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 0.0 162512.760008 449028.228831 162512.760008 449028.228831 286515.468823 0.0 0.0 286515.468823 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 0.0 182789.632430 309526.773946 305311.493879 484943.892199 182789.63243 309526.773946 305311.493879 484943.892199 179632.39832 175417.118253 0.0 -52895.256804 0.0 0.0 0.0 152870.773825 149283.485943 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 459439.187500 264352.9375 195086.234375 459439.1875 264352.953125 264352.953125
2 B 1 0.725640 46900.883370 162447.125000 450958.187500 288511.0625 162447.125000 450958.1875 288511.0625 288511.0625
3 C 1 0.725640 46900.883370 162447.125000 450958.187500 288511.0625 162447.125000 450958.1875 288511.0625 288511.0625
4 AB 2 0.790517 40982.155774 172155.718750 542171.750000 151095.578125 218920.453125 172155.718750 323251.0 391075.875 542171.75 151095.28125 218920.15625 0.59375 151095.523707 218920.507543
5 AC 2 0.790517 40982.155774 172155.718750 542171.750000 151095.578125 218920.453125 172155.718750 323251.0 391075.875 542171.75 151095.28125 218920.15625 0.59375 151095.523707 218920.507543
6 BC 2 0.725640 46900.883370 162447.125000 450958.187500 155306.234375 133204.828125 162447.125000 317753.46875 295652.0625 450958.1875 155306.34375 133204.9375 -0.21875 155306.225996 133204.836504
7 ABC 3 0.801046 39939.015065 169186.234375 549030.937500 212473.679688 147899.520833 19471.497396 169186.234375 301055.1875 261168.65625 504871.59375 163970.046875 345214.15625 255952.359375 549030.9375 293078.578125 203816.78125 44159.34375 -111834.46875 -49375.640625 -0.375 0.484375 184206.473536 157944.357414 37693.872175
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1880 1422 5 5
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 171019.418083 -7562.789682 178582.207765 171019.418083 -7562.789682 -7562.789682
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
4 AB 2 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -34823.670679 178582.207765 174583.352695 143758.537086 139759.682016 -3998.85507 -34823.670679 -0.0 -3998.85507 -34823.670679
5 AC 2 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -34823.670679 178582.207765 174583.352695 143758.537086 139759.682016 -3998.85507 -34823.670679 -0.0 -3998.85507 -34823.670679
6 BC 2 0.653450 52711.313325 178582.207765 130338.848464 259226877171685888.0 -259226877171734144.0 178582.207765 259226877171864480.0 -259226877171555552.0 130338.848464 259226877171685888.0 -259226877171734144.0 0.0 259226877171685888.0 -259226877171734144.0
7 ABC 3 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -17411.835339 -17411.835339 178582.207765 174583.352695 161170.372425 157171.517355 161170.372425 157171.517355 143758.537086 139759.682016 -3998.85507 -17411.835339 -17411.835339 0.0 0.0 0.0 -0.0 -3998.85507 -17411.835339 -17411.835339
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 133334.956280 -3493.070816 -14768.033664 -14909.066973 166505.127733 162180.669524 147859.950307 144361.585467 147746.497458 144192.971698 135963.76522 133334.95628 -2628.80894 -10858.015418 -11026.629187 -924.716819 -869.5559 -6961.068639 98.623451 -2967.626528 -15011.574086 -15190.970839
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 207590.130539 166761.015241 -40829.115298 -40829.115298
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
4 AB 2 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 182789.632430 164634.126044 160199.795645 144744.578437 -18155.506386 -22589.836785 2700.289179 -16952.29852 -21092.755473
5 AC 2 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 182789.632430 164634.126044 160199.795645 144744.578437 -18155.506386 -22589.836785 2700.289179 -16952.29852 -21092.755473
6 BC 2 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 0.0 162512.760008 135047.645999 162512.760008 135047.645999 -27465.11401 0.0 0.0 -27465.11401 0.0
7 ABC 3 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 0.0 182789.632430 164634.126044 160199.795645 144744.578437 182789.63243 164634.126044 160199.795645 144744.578437 -15455.217208 -19889.547607 0.0 -2700.289179 0.0 0.0 0.0 -16635.973566 -21409.080427 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 165546.140625 -29540.09375 195086.234375 165546.140625 -29540.09375 -29540.09375
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
4 AB 2 0.790517 40982.155774 172155.718750 141969.359375 -12296.34375 -17890.015625 172155.718750 159859.375 154265.703125 141969.359375 -12296.34375 -17890.015625 0.0 -12296.34375 -17890.015625
5 AC 2 0.790517 40982.155774 172155.718750 141969.359375 -12296.34375 -17890.015625 172155.718750 159859.375 154265.703125 141969.359375 -12296.34375 -17890.015625 0.0 -12296.34375 -17890.015625
6 BC 2 0.725640 46900.883370 162447.125000 135150.843750 -15966.984375 -11329.296875 162447.125000 146480.140625 151117.828125 135150.84375 -15966.984375 -11329.296875 0.0 -15966.984375 -11329.296875
7 ABC 3 0.801046 39939.015065 169186.234375 146273.656250 -3770.567708 -14776.153646 -4365.856771 169186.234375 163528.140625 153236.15625 149925.890625 164106.765625 159875.875 148156.6875 146273.65625 -1883.03125 -13602.21875 -3652.234375 -2347.859375 -1427.234375 -0.03125 0.03125 -2654.059185 -15664.577928 -4593.941012

Duplicate "Gr Liv Area (A == C)"¶

In [24]:
# ============================================================
# 6) Settings & Run (edit here, then run this cell)
# ============================================================

SETTINGS = dict(
    # Data slice / feature modes:
    MODE="dup_grliv",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=10,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=True,
    USE_MLP=True,
    USE_LGB=True,              # will only run if LightGBM is installed
    USE_XGB=True,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)
Mode: dup_grliv | Features (A,B,C): [('A', 'Gr Liv Area'), ('B', 'Overall Qual'), ('C', 'Gr Liv Area (DUP)')]
Evaluating 10 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1093 1892 7 1892
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 221182.120972 42599.913207 178582.207765 221182.120972 42599.913207 42599.913207
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.523397 61815.726690 178582.207765 221182.120972 42599.913207 178582.207765 221182.120972 42599.913207 42599.913207
4 AB 2 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 30608.541001 178582.207765 201107.081002 209190.748766 231715.622003 22524.873238 30608.541001 -0.0 22524.873238 30608.541001
5 AC 2 0.522521 61872.550411 178582.207765 220949.980443 -127274544189851776.0 127274544189894144.0 178582.207765 -127274544189673200.0 127274544190072720.0 220949.980443 -127274544189851776.0 127274544189894144.0 0.0 -127274544189851776.0 127274544189894144.0
6 BC 2 0.743635 45336.691057 178582.207765 231715.622003 30608.541001 22524.873238 178582.207765 209190.748766 201107.081002 231715.622003 30608.541001 22524.873238 0.0 30608.541001 22524.873238
7 ABC 3 0.741124 45558.189247 178582.207765 231205.183161 -520172682010594752.0 30720.0 520172682010616768.0 178582.207765 -520172682010416192.0 209302.207765 -520172682010385472.0 520172682010795328.0 200485.183161 520172682010826048.0 231205.183161 -520172682010594880.0 30720.0 520172682010616640.0 0.0 113.024603 0.0 14.975397 -523023418420314752.0 30728.747247 523023418420336640.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
4 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
5 AC 2 0.522846 61851.496117 176231.472744 222390.517319 23107.922124 23051.12245 176231.472744 199339.412023 199282.612349 222390.517319 23107.939279 23051.139604 -0.034309 23107.922103 23051.122471
6 BC 2 0.744977 45217.933898 166739.753397 225798.134939 36073.387633 22984.993909 166739.753397 202813.14103 189724.747306 225798.134939 36073.387633 22984.993909 0.0 36073.387633 22984.993909
7 ABC 3 0.766638 43254.975157 168240.807805 233456.332720 13826.241949 37793.057986 13596.22498 168240.807805 182066.895552 206033.869758 219859.957506 181836.890486 195663.270766 219629.928636 233456.33272 13826.404084 37793.061954 13596.375214 -0.023803 -0.316336 -0.0 0.023803 13826.24326 37793.05832 13596.223335
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 207590.130539 198246.330716 -9343.799823 -9343.799823
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 207590.130539 198246.330716 -9343.799823 -9343.799823
4 AB 2 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 182789.632430 183753.304113 229488.927603 200534.205454 963.671683 46699.295173 -29918.393833 358.767901 17385.805123
5 AC 2 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 0.0 207590.130539 198246.330716 207590.130539 198246.330716 -9343.799823 0.0 0.0 -9343.799823 0.0
6 BC 2 0.809248 39107.119385 182789.632430 200534.205454 31740.098257 -13995.525233 182789.632430 229488.927603 183753.304113 200534.205454 46699.295173 963.671683 -29918.393833 17385.805123 358.767901
7 ABC 3 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 0.0 182789.632430 183753.304113 229488.927603 200534.205454 182789.63243 183753.304113 229488.927603 200534.205454 -28954.72215 16780.90134 0.0 29918.393833 0.0 0.0 0.0 42204.431099 -24459.858076 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 187762.781250 -7323.453125 195086.234375 187762.78125 -7323.453125 -7323.453125
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.462856 65624.546618 195086.234375 187762.781250 -7323.453125 195086.234375 187762.78125 -7323.453125 -7323.453125
4 AB 2 0.790517 40982.155774 172155.718750 201395.015625 657.195312 28582.101562 172155.718750 172812.921875 200737.828125 201395.015625 657.203125 28582.109375 -0.015625 657.202774 28582.094101
5 AC 2 0.462856 65624.546618 195086.234375 187762.781250 -2695.28125 -4628.171875 195086.234375 192390.953125 190458.0625 187762.78125 -2695.28125 -4628.171875 0.0 -2695.28125 -4628.171875
6 BC 2 0.791406 40895.160920 170816.093750 201466.625000 28739.71875 1910.8125 170816.093750 199555.8125 172726.90625 201466.625 28739.71875 1910.8125 0.0 28739.71875 1910.8125
7 ABC 3 0.793780 40661.764880 171115.687500 194230.406250 2595.53125 23867.132812 -3347.945312 171115.687500 177160.484375 203239.34375 202385.609375 172575.0 178619.796875 195084.140625 194230.40625 -853.734375 15610.609375 -8155.203125 6898.53125 0.0 9614.515625 0.0 -1252.837387 43039.735164 -18672.179027
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1032 2340 8 2340
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 268996.782449 90414.574684 178582.207765 268996.782449 90414.574684 90414.574684
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.523397 61815.726690 178582.207765 268996.782449 90414.574684 178582.207765 268996.782449 90414.574684 90414.574684
4 AB 2 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 63324.646841 178582.207765 226389.273091 241906.854606 289713.919932 47807.065326 63324.646841 -0.0 47807.065326 63324.646841
5 AC 2 0.522521 61872.550411 178582.207765 268601.980443 -270129042870670208.0 270129042870760224.0 178582.207765 -270129042870491616.0 270129042870938816.0 268601.980443 -270129042870670208.0 270129042870760224.0 0.0 -270129042870670208.0 270129042870760224.0
6 BC 2 0.743635 45336.691057 178582.207765 289713.919932 63324.646841 47807.065326 178582.207765 241906.854606 226389.273091 289713.919932 63324.646841 47807.065326 0.0 63324.646841 47807.065326
7 ABC 3 0.741124 45558.189247 178582.207765 288021.183161 -1104020836322079104.0 63232.0 1104020836322125184.0 178582.207765 -1104020836321900160.0 242070.207765 -1104020836321837184.0 1104020836322303744.0 225045.183161 1104020836322367232.0 288021.183161 -1104020836322079232.0 62976.0 1104020836322125184.0 512.0 1.024603 0.0 -1.024603 -1104035116713449600.0 62975.410409 1104035116713496064.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
4 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
5 AC 2 0.522846 61851.496117 176231.472744 274199.991194 49044.644115 48923.874335 176231.472744 225276.330489 225155.560708 274199.991194 49044.857745 48924.087964 -0.427259 49044.643852 48923.874598
6 BC 2 0.744977 45217.933898 166739.753397 296194.036282 77650.638208 51803.644677 166739.753397 241370.377971 215523.38444 296194.036282 74630.624575 48783.631044 6040.027267 78283.128129 51171.154756
7 ABC 3 0.766638 43254.975157 168240.807805 304630.959380 29345.169704 78188.373956 28856.607916 168240.807805 197585.464851 246429.189968 275773.847014 197096.927686 226442.577216 275285.260603 304630.95938 29345.698776 78188.382163 28857.112366 -0.049246 -1.04173 -0.0 0.049246 29345.170695 78188.374588 28856.606292
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 207590.130539 337899.406371 130309.275832 130309.275832
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 207590.130539 337899.406371 130309.275832 130309.275832
4 AB 2 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 182789.632430 205582.297796 247036.506074 293900.114096 22792.665366 64246.873644 24070.942656 29096.018385 82014.463281
5 AC 2 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 0.0 207590.130539 337899.406371 207590.130539 337899.406371 130309.275832 0.0 0.0 130309.275832 0.0
6 BC 2 0.809248 39107.119385 182789.632430 293900.114096 76282.344972 34828.136694 182789.632430 247036.506074 205582.297796 293900.114096 64246.873644 22792.665366 24070.942656 82014.463281 29096.018385
7 ABC 3 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 0.0 182789.632430 205582.297796 247036.506074 293900.114096 182789.63243 205582.297796 247036.506074 293900.114096 46863.608022 88317.8163 0.0 -24070.942656 0.0 0.0 0.0 38518.887384 72591.594282 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 294881.656250 99795.421875 195086.234375 294881.65625 99795.421875 99795.421875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.462856 65624.546618 195086.234375 294881.656250 99795.421875 195086.234375 294881.65625 99795.421875 99795.421875
4 AB 2 0.790517 40982.155774 172155.718750 321153.718750 67974.367188 81023.632812 172155.718750 240129.8125 253179.078125 321153.71875 67974.09375 81023.359375 0.546875 67974.34324 81023.65676
5 AC 2 0.462856 65624.546618 195086.234375 294881.656250 59756.21875 40039.203125 195086.234375 254842.359375 235125.34375 294881.65625 59756.125 40039.109375 0.1875 59756.237273 40039.184602
6 BC 2 0.791406 40895.160920 170816.093750 317310.062500 81836.898438 64657.070312 170816.093750 252652.96875 235473.140625 317310.0625 81836.875 64657.046875 0.046875 81836.901186 64657.067564
7 ABC 3 0.793780 40661.764880 171115.687500 348892.375000 54579.140625 101389.085938 21808.460938 171115.687500 202994.46875 238082.90625 315362.40625 181202.640625 213081.421875 271612.875 348892.375 77279.5 135810.953125 33529.96875 -45400.71875 0.0 -23443.015625 0.0 60814.450566 88074.04982 28888.187114
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
2783 1282 6 1282
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 156077.336371 -22504.871393 178582.207765 156077.336371 -22504.871393 -22504.871393
2 B 1 0.651246 52878.678358 178582.207765 175736.484516 -2845.723249 178582.207765 175736.484516 -2845.723249 -2845.723249
3 C 1 0.523397 61815.726690 178582.207765 156077.336371 -22504.871393 178582.207765 156077.336371 -22504.871393 -22504.871393
4 AB 2 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -2107.564839 178582.207765 166682.667667 176474.642926 164575.102828 -11899.540097 -2107.564839 -0.0 -11899.540097 -2107.564839
5 AC 2 0.522521 61872.550411 178582.207765 156182.105443 67237161603227008.0 -67237161603249408.0 178582.207765 67237161603405592.0 -67237161603070824.0 156182.105443 67237161603227008.0 -67237161603249408.0 0.0 67237161603227008.0 -67237161603249408.0
6 BC 2 0.743635 45336.691057 178582.207765 164575.102828 -2107.564839 -11899.540097 178582.207765 176474.642926 166682.667667 164575.102828 -2107.564839 -11899.540097 0.0 -2107.564839 -11899.540097
7 ABC 3 0.741124 45558.189247 178582.207765 165176.683161 274799135243881216.0 -2048.0 -274799135243892576.0 178582.207765 274799135244059808.0 176534.207765 274799135244057760.0 -274799135243713984.0 167224.683161 -274799135243716032.0 165176.683161 274799135243881216.0 -2048.0 -274799135243892576.0 0.0 -2.475397 0.0 2.475397 274808279904904544.0 -2047.621966 -274808279904915904.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
4 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
5 AC 2 0.522846 61851.496117 176231.472744 151846.367622 -12207.564816 -12177.540307 176231.472744 164023.907929 164053.932438 151846.367622 -12207.564816 -12177.540307 -0.0 -12207.564816 -12177.540307
6 BC 2 0.744977 45217.933898 166739.753397 152113.289399 -2483.849308 -12142.61469 166739.753397 164255.904088 154597.138707 152113.289399 -2483.849308 -12142.61469 0.0 -2483.849308 -12142.61469
7 ABC 3 0.766638 43254.975157 168240.807805 151151.688995 -7304.155474 -2602.258225 -7182.70511 168240.807805 160936.60385 165638.549549 158334.34578 161058.054307 153753.947251 158455.796051 151151.688995 -7304.107056 -2602.258256 -7182.656785 0.0 -0.096713 0.000186 -0.000186 -7304.155897 -2602.258235 -7182.704678
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 207590.130539 155854.168999 -51735.961539 -51735.961539
2 B 1 0.725528 46910.500909 162512.760008 162512.760008 0.0 162512.760008 162512.760008 0.0 0.0
3 C 1 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 207590.130539 155854.168999 -51735.961539 -51735.961539
4 AB 2 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 182789.632430 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 -28447.486479 0.0
5 AC 2 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 0.0 207590.130539 155854.168999 207590.130539 155854.168999 -51735.961539 0.0 0.0 -51735.961539 0.0
6 BC 2 0.809248 39107.119385 182789.632430 154342.145951 0.0 -28447.486479 182789.632430 182789.63243 154342.145951 154342.145951 0.0 -28447.486479 0.0 0.0 -28447.486479
7 ABC 3 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 0.0 182789.632430 154342.145951 182789.63243 154342.145951 182789.63243 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 0.0 0.0 0.0 0.0 -28447.486479 0.0 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 154598.546875 -40487.6875 195086.234375 154598.546875 -40487.6875 -40487.6875
2 B 1 0.725640 46900.883370 162447.125000 162447.125000 0.0 162447.125000 162447.125 0.0 0.0
3 C 1 0.462856 65624.546618 195086.234375 154598.546875 -40487.6875 195086.234375 154598.546875 -40487.6875 -40487.6875
4 AB 2 0.790517 40982.155774 172155.718750 150816.812500 -21338.90625 0.0 172155.718750 150816.8125 172155.71875 150816.8125 -21338.90625 0.0 0.0 -21338.90625 0.0
5 AC 2 0.462856 65624.546618 195086.234375 154598.546875 -23842.09375 -16645.59375 195086.234375 171244.140625 178440.640625 154598.546875 -23842.09375 -16645.59375 0.0 -23842.09375 -16645.59375
6 BC 2 0.791406 40895.160920 170816.093750 149501.250000 0.0 -21314.84375 170816.093750 170816.09375 149501.25 149501.25 0.0 -21314.84375 0.0 0.0 -21314.84375
7 ABC 3 0.793780 40661.764880 171115.687500 154711.796875 -14439.953125 0.0 -1963.9375 171115.687500 156675.734375 171115.6875 156675.734375 169151.75 154711.796875 169151.75 154711.796875 -14439.953125 0.0 -1963.9375 0.0 0.0 0.0 0.0 -14439.953125 0.0 -1963.9375
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
411 1055 7 1055
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 131849.818168 -46732.389597 178582.207765 131849.818168 -46732.389597 -46732.389597
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.523397 61815.726690 178582.207765 131849.818168 -46732.389597 178582.207765 131849.818168 -46732.389597 -46732.389597
4 AB 2 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 30608.541001 178582.207765 153872.271229 209190.748766 184480.812231 -24709.936535 30608.541001 -0.0 -24709.936535 30608.541001
5 AC 2 0.522521 61872.550411 178582.207765 132091.292943 139621025890159616.0 -139621025890206096.0 178582.207765 139621025890338192.0 -139621025890027520.0 132091.292943 139621025890159616.0 -139621025890206096.0 -16.0 139669088205440224.0 -139669088205486720.0
6 BC 2 0.743635 45336.691057 178582.207765 184480.812231 30608.541001 -24709.936535 178582.207765 209190.748766 153872.271229 184480.812231 30608.541001 -24709.936535 -0.0 30608.541001 -24709.936535
7 ABC 3 0.741124 45558.189247 178582.207765 185697.933161 570632909861530624.0 30720.0 -570632909861554176.0 178582.207765 570632909861709248.0 209302.207765 570632909861739968.0 -570632909861375616.0 154977.933161 -570632909861344896.0 185697.933161 570632909861530560.0 30720.0 -570632909861554304.0 0.0 116.274603 0.0 11.725397 568777301371733440.0 30770.788246 -568777301371757120.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
4 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
5 AC 2 0.522846 61851.496117 176231.472744 125594.692243 -25349.545236 -25287.235265 176231.472744 150881.946152 150944.256123 125594.692243 -25349.526592 -25287.216621 -0.037288 -25349.545259 -25287.235242
6 BC 2 0.744977 45217.933898 166739.753397 177598.449042 36073.387633 -25214.691988 166739.753397 202813.14103 141525.061409 177598.449042 36073.387633 -25214.691988 0.0 36073.387633 -25214.691988
7 ABC 3 0.766638 43254.975157 168240.807805 175951.244552 -15167.546149 37793.057986 -14915.075089 168240.807805 153073.034136 206033.869758 190866.072287 153325.493295 138158.182599 191118.555248 175951.244552 -15167.310696 37793.061954 -14914.827734 -0.0 -0.486776 -0.023803 0.023803 -15167.602951 37793.139306 -14915.099607
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 207590.130539 136692.9966 -70897.133939 -70897.133939
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 207590.130539 136692.9966 -70897.133939 -70897.133939
4 AB 2 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 182789.632430 142918.51353 229488.927603 146990.418258 -39871.1189 46699.295173 -42627.390445 209038.939205 -244838.153377
5 AC 2 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 0.0 207590.130539 136692.9966 207590.130539 136692.9966 -70897.133939 0.0 0.0 -70897.133939 0.0
6 BC 2 0.809248 39107.119385 182789.632430 146990.418258 25385.599951 -61184.814123 182789.632430 229488.927603 142918.51353 146990.418258 46699.295173 -39871.1189 -42627.390445 -244838.153377 209038.939205
7 ABC 3 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 0.0 182789.632430 142918.51353 229488.927603 146990.418258 182789.63243 142918.51353 229488.927603 146990.418258 -82498.509345 4071.904729 0.0 42627.390445 0.0 0.0 0.0 -37657.90728 1858.693108 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 134324.750000 -60761.484375 195086.234375 134324.75 -60761.484375 -60761.484375
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.462856 65624.546618 195086.234375 134324.750000 -60761.484375 195086.234375 134324.75 -60761.484375 -60761.484375
4 AB 2 0.790517 40982.155774 172155.718750 169153.296875 -31584.53125 28582.109375 172155.718750 140571.1875 200737.828125 169153.296875 -31584.53125 28582.109375 0.0 -31584.53125 28582.109375
5 AC 2 0.462856 65624.546618 195086.234375 134324.750000 -36551.984375 -24209.5 195086.234375 158534.25 170876.734375 134324.75 -36551.984375 -24209.5 0.0 -36551.984375 -24209.5
6 BC 2 0.791406 40895.160920 170816.093750 171431.125000 28739.71875 -28124.6875 170816.093750 199555.8125 142691.40625 171431.125 28739.71875 -28124.6875 0.0 28739.71875 -28124.6875
7 ABC 3 0.793780 40661.764880 171115.687500 148306.625000 -30665.90625 19673.1875 -11816.34375 171115.687500 146659.671875 203239.34375 166363.546875 165539.921875 141083.90625 185182.421875 148306.625 -36875.796875 7222.71875 -18056.921875 12419.78125 0.0 12481.15625 0.0 -21430.879869 -4123.091838 2744.909207
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1385 1258 4 1258
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 153515.836649 -25066.371115 178582.207765 153515.836649 -25066.371115 -25066.371115
2 B 1 0.651246 52878.678358 178582.207765 87387.142860 -91195.064905 178582.207765 87387.14286 -91195.064905 -91195.064905
3 C 1 0.523397 61815.726690 178582.207765 153515.836649 -25066.371115 178582.207765 153515.836649 -25066.371115 -25066.371115
4 AB 2 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -67539.776519 178582.207765 165328.264519 111042.431246 97788.488001 -13253.943245 -67539.776519 -0.0 -13253.943245 -67539.776519
5 AC 2 0.522521 61872.550411 178582.207765 153679.605443 74890081175413760.0 -74890081175438656.0 178582.207765 74890081175592336.0 -74890081175260080.0 153679.605443 74890081175413760.0 -74890081175438656.0 0.0 74890081175413760.0 -74890081175438656.0
6 BC 2 0.743635 45336.691057 178582.207765 97788.488001 -67539.776519 -13253.943245 178582.207765 111042.431246 165328.264519 97788.488001 -67539.776519 -13253.943245 -0.0 -67539.776519 -13253.943245
7 ABC 3 0.741124 45558.189247 178582.207765 98370.683161 306076714939139392.0 -67584.0 -306076714939151936.0 178582.207765 306076714939317952.0 110998.207765 306076714939250368.0 -306076714938973376.0 165954.683161 -306076714939040960.0 98370.683161 306076714939139328.0 -67584.0 -306076714939152000.0 0.0 83.524603 0.0 -19.524603 304133328264798464.0 -67600.4589 -304133328264811008.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
4 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
5 AC 2 0.522846 61851.496117 176231.472744 149070.860093 -13597.027272 -13563.58538 176231.472744 162634.445472 162667.887365 149070.860093 -13597.027272 -13563.58538 -0.0 -13597.027272 -13563.58538
6 BC 2 0.744977 45217.933898 166739.753397 106963.358040 -58365.699474 -1410.695883 166739.753397 96260.065269 153215.06886 106963.35804 -70479.688128 -13524.684536 24227.977307 -50152.409553 -9623.985803
7 ABC 3 0.766638 43254.975157 168240.807805 95740.685743 -2491.610107 -67530.939278 -2477.572677 168240.807805 160105.238912 90329.310042 91909.527405 160240.514817 152105.055834 91802.326362 95740.685743 3938.359381 -56364.370091 3831.158337 -12073.818364 -2358.142017 -11831.341415 2358.032107 3535.056228 -79455.141083 3419.962794
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 207590.130539 165029.481902 -42560.648637 -42560.648637
2 B 1 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 162512.760008 105779.163578 -56733.59643 -56733.59643
3 C 1 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 207590.130539 165029.481902 -42560.648637 -42560.648637
4 AB 2 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 182789.632430 159205.404641 128531.458118 120456.636131 -23584.227789 -54258.174312 15509.405802 -18885.280307 -43447.715991
5 AC 2 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 0.0 207590.130539 165029.481902 207590.130539 165029.481902 -42560.648637 0.0 0.0 -42560.648637 0.0
6 BC 2 0.809248 39107.119385 182789.632430 120456.636131 -46503.471411 -15829.524888 182789.632430 128531.458118 159205.404641 120456.636131 -54258.174312 -23584.227789 15509.405802 -43447.715991 -18885.280307
7 ABC 3 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 0.0 182789.632430 159205.404641 128531.458118 120456.636131 182789.63243 159205.404641 128531.458118 120456.636131 -8074.821987 -38748.76851 0.0 -15509.405802 0.0 0.0 0.0 -10749.450089 -51583.54621 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 163633.953125 -31452.28125 195086.234375 163633.953125 -31452.28125 -31452.28125
2 B 1 0.725640 46900.883370 162447.125000 105912.015625 -56535.109375 162447.125000 105912.015625 -56535.109375 -56535.109375
3 C 1 0.462856 65624.546618 195086.234375 163633.953125 -31452.28125 195086.234375 163633.953125 -31452.28125 -31452.28125
4 AB 2 0.790517 40982.155774 172155.718750 117276.554688 -13997.238281 -40881.925781 172155.718750 158158.5625 131273.875 117276.554688 -13997.15625 -40881.84375 -0.164062 -13997.198095 -40881.965968
5 AC 2 0.462856 65624.546618 195086.234375 163633.953125 -19664.679688 -11787.601562 195086.234375 175421.546875 183298.625 163633.953125 -19664.6875 -11787.609375 0.015625 -19664.677731 -11787.603519
6 BC 2 0.791406 40895.160920 170816.093750 115746.328125 -41246.035156 -13823.730469 170816.093750 129570.132812 156992.4375 115746.328125 -41245.960938 -13823.65625 -0.148438 -41246.072114 -13823.693511
7 ABC 3 0.793780 40661.764880 171115.687500 121708.531250 -5207.950521 -45658.634115 1459.428385 171115.687500 162308.140625 118690.140625 117081.78125 169407.796875 160600.25 123316.882812 121708.53125 -1608.351562 -38891.71875 4626.75 -7199.195312 -0.007812 -6334.640625 0.007812 -1894.243814 -52995.007108 5482.094672
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1577 1394 5 1394
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 168031.001741 -10551.206024 178582.207765 168031.001741 -10551.206024 -10551.206024
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.523397 61815.726690 178582.207765 168031.001741 -10551.206024 178582.207765 168031.001741 -10551.206024 -10551.206024
4 AB 2 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -34823.670679 178582.207765 173003.215689 143758.537086 138179.545011 -5578.992075 -34823.670679 -0.0 -5578.992075 -34823.670679
5 AC 2 0.522521 61872.550411 178582.207765 168095.105443 31523536933022400.0 -31523536933032888.0 178582.207765 31523536933200980.0 -31523536932854304.0 168095.105443 31523536933022396.0 -31523536933032888.0 4.0 31511518809906488.0 -31511518809916976.0
6 BC 2 0.743635 45336.691057 178582.207765 138179.545011 -34823.670679 -5578.992075 178582.207765 143758.537086 173003.215689 138179.545011 -34823.670679 -5578.992075 0.0 -34823.670679 -5578.992075
7 ABC 3 0.741124 45558.189247 178582.207765 138420.683161 128837096666010112.0 -34816.0 -128837096666015472.0 178582.207765 128837096666188688.0 143766.207765 128837096666153872.0 -128837096665836880.0 173236.683161 -128837096665871696.0 138420.683161 128837096666010112.0 -34816.0 -128837096666015456.0 0.0 1.524603 0.0 -1.524603 128805230266201104.0 -34817.321778 -128805230266206448.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
4 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
5 AC 2 0.522846 61851.496117 176231.472744 164798.736091 -5723.406687 -5709.329967 176231.472744 170508.066057 170522.142778 164798.736091 -5723.406687 -5709.329967 -0.0 -5723.406687 -5709.329967
6 BC 2 0.744977 45217.933898 166739.753397 120005.711741 -41041.086249 -5692.955406 166739.753397 125698.667147 161046.797991 120005.711741 -41041.086249 -5692.955406 0.0 -41041.086249 -5692.955406
7 ABC 3 0.766638 43254.975157 168240.807805 118451.195398 -3424.482308 -42997.577953 -3367.552146 168240.807805 164816.306896 125243.229339 121818.731504 164873.238595 161448.773864 121875.66013 118451.195398 -3424.464731 -42997.578465 -3367.536106 -0.0 -0.033104 0.003073 -0.003073 -3424.481633 -42997.578269 -3367.552504
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 207590.130539 158197.633277 -49392.497261 -49392.497261
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 207590.130539 158197.633277 -49392.497261 -49392.497261
4 AB 2 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 182789.632430 158731.955162 160199.795645 143978.553894 -24057.677268 -22589.836785 7836.435517 -20016.166366 -18794.91217
5 AC 2 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 0.0 207590.130539 158197.633277 207590.130539 158197.633277 -49392.497261 0.0 0.0 -49392.497261 0.0
6 BC 2 0.809248 39107.119385 182789.632430 143978.553894 -18671.619027 -20139.459509 182789.632430 160199.795645 158731.955162 143978.553894 -22589.836785 -24057.677268 7836.435517 -18794.91217 -20016.166366
7 ABC 3 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 0.0 182789.632430 158731.955162 160199.795645 143978.553894 182789.63243 158731.955162 160199.795645 143978.553894 -16221.241751 -14753.401269 0.0 -7836.435517 0.0 0.0 0.0 -20325.137796 -18485.94074 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 161489.765625 -33596.46875 195086.234375 161489.765625 -33596.46875 -33596.46875
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.462856 65624.546618 195086.234375 161489.765625 -33596.46875 195086.234375 161489.765625 -33596.46875 -33596.46875
4 AB 2 0.790517 40982.155774 172155.718750 140205.796875 -14059.90625 -17890.015625 172155.718750 158095.8125 154265.703125 140205.796875 -14059.90625 -17890.015625 0.0 -14059.90625 -17890.015625
5 AC 2 0.462856 65624.546618 195086.234375 161489.765625 -20913.453125 -12683.015625 195086.234375 174172.78125 182403.21875 161489.765625 -20913.453125 -12683.015625 0.0 -20913.453125 -12683.015625
6 BC 2 0.791406 40895.160920 170816.093750 139818.953125 -18009.984375 -12987.15625 170816.093750 152806.109375 157828.9375 139818.953125 -18009.984375 -12987.15625 0.0 -18009.984375 -12987.15625
7 ABC 3 0.793780 40661.764880 171115.687500 140088.125000 -10157.671875 -21482.171875 612.28125 171115.687500 160067.453125 148987.484375 139720.375 171972.5 160924.265625 149355.234375 140088.125 -9267.109375 -20836.140625 367.75 -1781.125 0.0 489.0625 0.0 -9815.418285 -21571.107363 358.963147
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1427 2097 7 2097
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 243061.597764 64479.389999 178582.207765 243061.597764 64479.389999 64479.389999
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.523397 61815.726690 178582.207765 243061.597764 64479.389999 178582.207765 243061.597764 64479.389999 64479.389999
4 AB 2 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 30608.541001 178582.207765 212675.941221 209190.748766 243284.482223 34093.733457 30608.541001 -0.0 34093.733457 30608.541001
5 AC 2 0.522521 61872.550411 178582.207765 242800.167943 -192643232202279808.0 192643232202344000.0 178582.207765 -192643232202101216.0 192643232202522592.0 242800.167943 -192643232202279808.0 192643232202344000.0 32.0 -192739265717834272.0 192739265717898496.0
6 BC 2 0.743635 45336.691057 178582.207765 243284.482223 30608.541001 34093.733457 178582.207765 209190.748766 212675.941221 243284.482223 30608.541001 34093.733457 0.0 30608.541001 34093.733457
7 ABC 3 0.741124 45558.189247 178582.207765 242586.433161 -787335341907590656.0 30720.0 787335341907623936.0 178582.207765 -787335341907412096.0 209302.207765 -787335341907381376.0 787335341907802496.0 211866.433161 787335341907833216.0 242586.433161 -787335341907590656.0 30720.0 787335341907623936.0 0.0 -4.225397 0.0 4.225397 -787287355987554304.0 30722.028324 787287355987587584.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
4 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
5 AC 2 0.522846 61851.496117 176231.472744 246097.977463 34976.250692 34890.254026 176231.472744 211207.792974 211121.796308 246097.977463 34976.32023 34890.323563 -0.139075 34976.250607 34890.254112
6 BC 2 0.744977 45217.933898 166739.753397 237603.314878 36073.387633 34790.173848 166739.753397 202813.14103 201529.927244 237603.314878 36073.387633 34790.173848 -0.0 36073.387633 34790.173848
7 ABC 3 0.766638 43254.975157 168240.807805 247540.613573 20927.533134 37793.057986 20579.214648 168240.807805 189168.026593 206033.869758 226961.088547 188819.720008 209747.551619 226612.758158 247540.613573 20927.855415 37793.061954 20579.525026 -0.023803 -0.636626 0.0 0.023803 20927.532229 37793.057978 20579.215562
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 207590.130539 252701.449543 45111.319005 45111.319005
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 207590.130539 252701.449543 45111.319005 45111.319005
4 AB 2 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 182789.632430 200242.930323 229488.927603 231949.98396 17453.297893 46699.295173 -14992.241537 13374.521882 35785.829647
5 AC 2 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 0.0 207590.130539 252701.449543 207590.130539 252701.449543 45111.319005 0.0 0.0 45111.319005 0.0
6 BC 2 0.809248 39107.119385 182789.632430 231949.983960 39203.174405 9957.177125 182789.632430 229488.927603 200242.930323 231949.98396 46699.295173 17453.297893 -14992.241537 35785.829647 13374.521882
7 ABC 3 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 0.0 182789.632430 200242.930323 229488.927603 231949.98396 182789.63243 200242.930323 229488.927603 231949.98396 2461.056357 31707.053637 0.0 14992.241537 0.0 0.0 0.0 3540.915656 45619.435874 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 263538.906250 68452.671875 195086.234375 263538.90625 68452.671875 68452.671875
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.462856 65624.546618 195086.234375 263538.906250 68452.671875 195086.234375 263538.90625 68452.671875 68452.671875
4 AB 2 0.790517 40982.155774 172155.718750 243396.140625 42658.3125 28582.109375 172155.718750 214814.03125 200737.828125 243396.140625 42658.3125 28582.109375 0.0 42658.3125 28582.109375
5 AC 2 0.462856 65624.546618 195086.234375 263538.906250 33856.664062 34596.007812 195086.234375 228943.0625 229682.40625 263538.90625 33856.828125 34596.171875 -0.328125 33856.665834 34596.006041
6 BC 2 0.791406 40895.160920 170816.093750 243465.359375 28739.71875 43909.546875 170816.093750 199555.8125 214725.640625 243465.359375 28739.71875 43909.546875 0.0 28739.71875 43909.546875
7 ABC 3 0.793780 40661.764880 171115.687500 231925.500000 26411.606771 34129.778646 268.427083 171115.687500 194516.578125 203239.34375 232661.65625 172388.703125 195789.59375 202503.171875 231925.5 29422.328125 36135.90625 -736.15625 -6021.4375 -0.015625 2009.171875 0.015625 26719.91687 34867.833455 -777.937825
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
518 1314 8 1314
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 159492.669334 -19089.53843 178582.207765 159492.669334 -19089.53843 -19089.53843
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.523397 61815.726690 178582.207765 159492.669334 -19089.53843 178582.207765 159492.669334 -19089.53843 -19089.53843
4 AB 2 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 63324.646841 178582.207765 168488.538531 241906.854606 231813.185372 -10093.669234 63324.646841 -0.0 -10093.669234 63324.646841
5 AC 2 0.522521 61872.550411 178582.207765 159604.105443 57033268840311424.0 -57033268840330400.0 178582.207765 57033268840490008.0 -57033268840151824.0 159604.105443 57033268840311424.0 -57033268840330408.0 8.0 57009234593012512.0 -57009234593031488.0
6 BC 2 0.743635 45336.691057 178582.207765 231813.185372 63324.646841 -10093.669234 178582.207765 241906.854606 168488.538531 231813.185372 63324.646841 -10093.669234 0.0 63324.646841 -10093.669234
7 ABC 3 0.741124 45558.189247 178582.207765 232320.683161 233095695650203648.0 63488.0 -233095695650213408.0 178582.207765 233095695650382240.0 242070.207765 233095695650445728.0 -233095695650034816.0 168832.683161 -233095695649971328.0 232320.683161 233095695650203648.0 63488.0 -233095695650213408.0 0.0 -10.475397 0.0 10.475397 233391381518173568.0 63500.380727 -233391381518183328.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
4 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
5 AC 2 0.522846 61851.496117 176231.472744 155547.044327 -10354.948208 -10329.48021 176231.472744 165876.524537 165901.992535 155547.044327 -10354.948208 -10329.48021 -0.0 -10354.948208 -10329.48021
6 BC 2 0.744977 45217.933898 166739.753397 231070.523077 74630.624575 -10299.854894 166739.753397 241370.377971 156439.898502 231070.523077 74630.624575 -10299.854894 0.0 74630.624575 -10299.854894
7 ABC 3 0.766638 43254.975157 168240.807805 234140.851304 -6195.678526 78188.381232 -6092.659206 168240.807805 162045.090435 246429.189968 240233.467009 162148.106961 155952.46914 240336.489124 234140.851304 -6195.63782 78188.382163 -6092.615706 0.0 -0.085139 -0.005589 0.005589 -6195.681272 78188.382733 -6092.657962
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 207590.130539 171220.025307 -36370.105231 -36370.105231
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 207590.130539 171220.025307 -36370.105231 -36370.105231
4 AB 2 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 182789.632430 157787.322563 247036.506074 206902.567063 -25002.309868 64246.873644 -15131.629144 -15362.10383 39475.038463
5 AC 2 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 0.0 207590.130539 171220.025307 207590.130539 171220.025307 -36370.105231 0.0 0.0 -36370.105231 0.0
6 BC 2 0.809248 39107.119385 182789.632430 206902.567063 56681.059072 -32568.12444 182789.632430 247036.506074 157787.322563 206902.567063 64246.873644 -25002.309868 -15131.629144 39475.038463 -15362.10383
7 ABC 3 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 0.0 182789.632430 157787.322563 247036.506074 206902.567063 182789.63243 157787.322563 247036.506074 206902.567063 -40133.939012 49115.2445 0.0 15131.629144 0.0 0.0 0.0 -107751.267247 131864.20188 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 168911.546875 -26174.6875 195086.234375 168911.546875 -26174.6875 -26174.6875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.462856 65624.546618 195086.234375 168911.546875 -26174.6875 195086.234375 168911.546875 -26174.6875 -26174.6875
4 AB 2 0.790517 40982.155774 172155.718750 237431.468750 -15747.609375 81023.359375 172155.718750 156408.109375 253179.078125 237431.46875 -15747.609375 81023.359375 0.0 -15747.609375 81023.359375
5 AC 2 0.462856 65624.546618 195086.234375 168911.546875 -22281.765625 -3892.921875 195086.234375 172804.46875 191193.3125 168911.546875 -22281.765625 -3892.921875 0.0 -22281.765625 -3892.921875
6 BC 2 0.791406 40895.160920 170816.093750 238151.015625 81836.867188 -14501.945312 170816.093750 252652.96875 156314.15625 238151.015625 81836.875 -14501.9375 -0.015625 81836.85601 -14501.934135
7 ABC 3 0.793780 40661.764880 171115.687500 214613.937500 -12660.046875 60856.4375 -4698.140625 171115.687500 160330.875 238082.90625 223547.625 170653.09375 159868.28125 229149.21875 214613.9375 -14535.28125 54745.65625 -8933.6875 3750.46875 0.0 8471.09375 0.0 -15891.003948 69974.870113 -10585.616164
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
2181 4676 10 4676
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 518316.088722 339733.880958 178582.207765 518316.088722 339733.880958 339733.880958
2 B 1 0.651246 52878.678358 178582.207765 352435.167828 173852.960063 178582.207765 352435.167828 173852.960063 173852.960063
3 C 1 0.523397 61815.726690 178582.207765 518316.088722 339733.880958 178582.207765 518316.088722 339733.880958 339733.880958
4 AB 2 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 128756.858521 178582.207765 358217.846125 307339.066286 486974.704646 179635.63836 128756.858521 -0.0 179635.63836 128756.858521
5 AC 2 0.522521 61872.550411 178582.207765 516615.980443 -1015013214563509632.0 1015013214563847680.0 178582.207765 -1015013214563331072.0 1015013214564026240.0 516615.980443 -1015013214563509632.0 1015013214563847680.0 0.0 -1015013214563509632.0 1015013214563847680.0
6 BC 2 0.743635 45336.691057 178582.207765 486974.704646 128756.858521 179635.63836 178582.207765 307339.066286 358217.846125 486974.704646 128756.858521 179635.63836 0.0 128756.858521 179635.63836
7 ABC 3 0.741124 45558.189247 178582.207765 480925.183161 -4148371926660531712.0 129024.0 4148371926660704768.0 178582.207765 -4148371926660353024.0 307606.207765 -4148371926660224000.0 4148371926660883456.0 351901.183161 4148371926661012480.0 480925.183161 -4148371926660531200.0 129024.0 4148371926660705280.0 0.0 -262.975397 0.0 -249.024603 -4138699122604449280.0 128917.904234 4138699122604622848.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
4 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
5 AC 2 0.522846 61851.496117 176231.472744 548994.651062 186609.113958 186154.06436 176231.472744 360519.555324 360064.505726 548994.651062 184288.08258 183833.032982 4642.062756 186611.98308 186151.195238
6 BC 2 0.744977 45217.933898 166739.753397 655906.235308 178831.233241 310335.24867 166739.753397 318746.009854 450250.025282 655906.235308 152006.256457 283510.271886 53649.953569 170731.443839 318435.038073
7 ABC 3 0.766638 43254.975157 168240.807805 576909.704467 130173.352453 150272.25686 128223.28735 168240.807805 285945.311118 327219.830387 440319.392649 283995.699141 439445.584875 438368.874421 576909.704467 138540.830046 137464.119593 136590.311818 16909.055687 -25441.267784 16909.961939 -12304.114637 130294.64092 149855.779718 128518.476025
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 207590.130539 386181.566891 178591.436353 178591.436353
2 B 1 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 162512.760008 449028.228831 286515.468823 286515.468823
3 C 1 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 207590.130539 386181.566891 178591.436353 178591.436353
4 AB 2 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 182789.632430 309526.773946 305311.493879 484943.892199 126737.141516 122521.861449 52895.256804 153632.032241 148522.227527
5 AC 2 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 0.0 207590.130539 386181.566891 207590.130539 386181.566891 178591.436353 0.0 0.0 178591.436353 0.0
6 BC 2 0.809248 39107.119385 182789.632430 484943.892199 148969.489851 153184.769918 182789.632430 305311.493879 309526.773946 484943.892199 122521.861449 126737.141516 52895.256804 148522.227527 153632.032241
7 ABC 3 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 0.0 182789.632430 309526.773946 305311.493879 484943.892199 182789.63243 309526.773946 305311.493879 484943.892199 179632.39832 175417.118253 0.0 -52895.256804 0.0 0.0 0.0 152870.773825 149283.485943 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 459439.187500 264352.9375 195086.234375 459439.1875 264352.953125 264352.953125
2 B 1 0.725640 46900.883370 162447.125000 450958.187500 288511.0625 162447.125000 450958.1875 288511.0625 288511.0625
3 C 1 0.462856 65624.546618 195086.234375 459439.187500 264352.9375 195086.234375 459439.1875 264352.953125 264352.953125
4 AB 2 0.790517 40982.155774 172155.718750 542171.750000 151095.578125 218920.453125 172155.718750 323251.0 391075.875 542171.75 151095.28125 218920.15625 0.59375 151095.523707 218920.507543
5 AC 2 0.462856 65624.546618 195086.234375 459439.187500 133504.40625 130848.539062 195086.234375 328590.5625 325934.6875 459439.1875 133504.328125 130848.453125 0.171875 133504.414926 130848.538199
6 BC 2 0.791406 40895.160920 170816.093750 542190.375000 214688.4375 156685.84375 170816.093750 385504.53125 327501.9375 542190.375 214688.4375 156685.84375 0.0 214688.4375 156685.84375
7 ABC 3 0.793780 40661.764880 171115.687500 551924.375000 150144.304688 164694.039062 65970.34375 171115.687500 270253.5625 251601.28125 452751.84375 203884.109375 303021.625 350773.375 551924.375 201151.0 248902.75 99172.53125 -102013.484375 -0.4375 -66404.46875 0.796875 155556.22432 144999.664067 80252.799113
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual Gr Liv Area (DUP)
1880 1422 5 1422
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 171019.418083 -7562.789682 178582.207765 171019.418083 -7562.789682 -7562.789682
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.523397 61815.726690 178582.207765 171019.418083 -7562.789682 178582.207765 171019.418083 -7562.789682 -7562.789682
4 AB 2 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -34823.670679 178582.207765 174583.352695 143758.537086 139759.682016 -3998.85507 -34823.670679 -0.0 -3998.85507 -34823.670679
5 AC 2 0.522521 61872.550411 178582.207765 171057.355443 22595130765471232.0 -22595130765478760.0 178582.207765 22595130765649812.0 -22595130765300176.0 171057.355443 22595130765471228.0 -22595130765478760.0 8.0 22571131688715548.0 -22571131688723072.0
6 BC 2 0.743635 45336.691057 178582.207765 139759.682016 -34823.670679 -3998.85507 178582.207765 143758.537086 174583.352695 139759.682016 -34823.670679 -3998.85507 0.0 -34823.670679 -3998.85507
7 ABC 3 0.741124 45558.189247 178582.207765 139987.683161 92346587021542400.0 -34816.0 -92346587021546176.0 178582.207765 92346587021720976.0 143766.207765 92346587021686160.0 -92346587021367600.0 174803.683161 -92346587021402416.0 139987.683161 92346587021542400.0 -34816.0 -92346587021546176.0 0.0 2.524603 0.0 -2.524603 92290882299286064.0 -34818.277735 -92290882299289840.0
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
4 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
5 AC 2 0.522846 61851.496117 176231.472744 168036.828208 -4102.367155 -4092.277382 176231.472744 172129.10559 172139.195363 168036.828208 -4102.367155 -4092.277382 -0.0 -4102.367155 -4092.277382
6 BC 2 0.744977 45217.933898 166739.753397 121618.126562 -41041.086249 -4080.540585 166739.753397 125698.667147 162659.212811 121618.126562 -41041.086249 -4080.540585 -0.0 -41041.086249 -4080.540585
7 ABC 3 0.766638 43254.975157 168240.807805 120374.902051 -2454.564136 -42997.577953 -2413.763664 168240.807805 165786.232657 125243.229339 122788.657266 165827.034667 163372.480517 122829.456202 120374.902051 -2454.55415 -42997.578465 -2413.755214 -0.0 -0.017924 0.003073 -0.003073 -2454.563345 -42997.578316 -2413.764092
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 207590.130539 166761.015241 -40829.115298 -40829.115298
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 207590.130539 166761.015241 -40829.115298 -40829.115298
4 AB 2 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 182789.632430 164634.126044 160199.795645 144744.578437 -18155.506386 -22589.836785 2700.289179 -16952.29852 -21092.755473
5 AC 2 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 0.0 207590.130539 166761.015241 207590.130539 166761.015241 -40829.115298 0.0 0.0 -40829.115298 0.0
6 BC 2 0.809248 39107.119385 182789.632430 144744.578437 -21239.692196 -16805.361797 182789.632430 160199.795645 164634.126044 144744.578437 -22589.836785 -18155.506386 2700.289179 -21092.755473 -16952.29852
7 ABC 3 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 0.0 182789.632430 164634.126044 160199.795645 144744.578437 182789.63243 164634.126044 160199.795645 144744.578437 -15455.217208 -19889.547607 0.0 -2700.289179 0.0 0.0 0.0 -16635.973566 -21409.080427 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 165546.140625 -29540.09375 195086.234375 165546.140625 -29540.09375 -29540.09375
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.462856 65624.546618 195086.234375 165546.140625 -29540.09375 195086.234375 165546.140625 -29540.09375 -29540.09375
4 AB 2 0.790517 40982.155774 172155.718750 141969.359375 -12296.34375 -17890.015625 172155.718750 159859.375 154265.703125 141969.359375 -12296.34375 -17890.015625 0.0 -12296.34375 -17890.015625
5 AC 2 0.462856 65624.546618 195086.234375 165546.140625 -19308.46875 -10231.625 195086.234375 175777.765625 184854.609375 165546.140625 -19308.46875 -10231.625 0.0 -19308.46875 -10231.625
6 BC 2 0.791406 40895.160920 170816.093750 143328.031250 -18009.984375 -9478.078125 170816.093750 152806.109375 161338.015625 143328.03125 -18009.984375 -9478.078125 0.0 -18009.984375 -9478.078125
7 ABC 3 0.793780 40661.764880 171115.687500 144691.859375 -4817.625 -20962.679688 -643.523438 171115.687500 165026.75 148987.484375 145441.171875 170577.953125 164489.015625 148238.171875 144691.859375 -3546.3125 -19797.15625 -749.3125 -2542.625 0.0 211.578125 0.0 -3932.585119 -21749.646583 -741.596423

Insert TotRms (Add correlated C)"¶

In [26]:
# ============================================================
# 6) Settings & Run (edit here, then run this cell)
# ============================================================

SETTINGS = dict(
    # Data slice / feature modes:
    MODE="totrms",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=10,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=True,
    USE_MLP=True,
    USE_LGB=True,              # will only run if LightGBM is installed
    USE_XGB=True,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)
Mode: totrms | Features (A,B,C): [('A', 'Gr Liv Area'), ('B', 'Overall Qual'), ('C', 'TotRms AbvGrd')]
Evaluating 10 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1093 1892 7 8
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 221182.120972 42599.913207 178582.207765 221182.120972 42599.913207 42599.913207
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.288147 75546.762088 178582.207765 215890.060207 37307.852442 178582.207765 215890.060207 37307.852442 37307.852442
4 AB 2 0.743635 45336.691057 178582.207765 231715.622003 22524.873238 30608.541001 178582.207765 201107.081002 209190.748766 231715.622003 22524.873238 30608.541001 -0.0 22524.873238 30608.541001
5 AC 2 0.521656 61928.537197 178582.207765 213841.190891 54937.291689 -19678.308563 178582.207765 233519.499454 158903.899201 213841.190891 54937.291689 -19678.308563 -0.0 54937.291689 -19678.308563
6 BC 2 0.704846 48645.794104 178582.207765 232541.703154 37281.130089 16678.3653 178582.207765 215863.337854 195260.573065 232541.703154 37281.130089 16678.3653 0.0 37281.130089 16678.3653
7 ABC 3 0.739246 45723.129778 178582.207765 228178.509907 28524.805077 29894.21804 -8822.720975 178582.207765 207107.012842 208476.425805 237001.230882 169759.48679 198284.291867 199653.70483 228178.509907 28524.805077 29894.21804 -8822.720975 0.0 0.0 0.0 0.0 28524.805077 29894.21804 -8822.720975
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 215711.861930 50861.384877 -13711.683054 178562.160107 229423.544984 164850.477053 215711.86193 50861.384877 -13711.683054 -0.0 50861.384877 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 227390.717398 29113.368662 37189.86674 -5311.18007 166398.662065 195540.60821 203189.349614 232274.140795 160659.725322 189801.671466 198305.926217 227390.717398 29084.79118 37589.045932 -4883.423397 57.154964 0.0 -855.513347 0.0 29109.723602 36638.014587 -4755.682857
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 198246.330716 -9343.799823 207590.130539 198246.330716 -9343.799823 -9343.799823
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.281669 75889.745660 163151.986814 213537.060561 50385.073747 163151.986814 213537.060561 50385.073747 50385.073747
4 AB 2 0.809248 39107.119385 182789.632430 200534.205454 -13995.525233 31740.098257 182789.632430 183753.304113 229488.927603 200534.205454 963.671683 46699.295173 -29918.393833 358.767901 17385.805123
5 AC 2 0.495569 63594.818608 198958.451050 200154.066817 25667.640244 -24472.024477 198958.451050 226149.405319 176009.740598 200154.066817 27190.954269 -22948.710452 -3046.628051 7663.381698 -6467.765932
6 BC 2 0.775553 42420.682377 157543.014898 219488.528310 37156.840157 24788.673256 157543.014898 191916.250153 179548.083253 219488.52831 34373.235256 22005.068355 5567.209802 37767.502198 24178.011214
7 ABC 3 0.814589 38555.644091 180774.726888 201250.840713 -1071.154354 34137.336719 -12590.06854 180774.726888 193720.619101 222445.468198 222204.243328 182828.102606 168366.798401 210058.931451 201250.840713 -8808.090738 32884.042312 -20953.402615 -5653.213467 8566.865869 -4400.418084 18840.330549 -52629.96282 163153.297937 -90047.221292
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1093 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 187762.781250 -7323.453125 195086.234375 187762.78125 -7323.453125 -7323.453125
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.287263 75593.665172 163281.500000 214163.328125 50881.828125 163281.500000 214163.328125 50881.828125 50881.828125
4 AB 2 0.790517 40982.155774 172155.718750 201395.015625 657.195312 28582.101562 172155.718750 172812.921875 200737.828125 201395.015625 657.203125 28582.109375 -0.015625 657.202774 28582.094101
5 AC 2 0.450673 66364.557401 202322.750000 185604.328125 6500.40625 -23218.828125 202322.750000 208823.15625 179103.921875 185604.328125 6500.40625 -23218.828125 0.0 6500.40625 -23218.828125
6 BC 2 0.765065 43400.491297 158229.531250 216725.250000 36260.53125 22235.1875 158229.531250 194490.0625 180464.71875 216725.25 36260.53125 22235.1875 0.0 36260.53125 22235.1875
7 ABC 3 0.763958 43502.578558 174092.593750 192289.828125 7664.817708 24869.559896 -14337.143229 174092.593750 194514.9375 204259.75 211829.40625 164957.890625 172717.859375 197382.53125 192289.828125 -5092.703125 19571.96875 -19539.578125 12852.671875 12662.359375 -2257.5 0.015625 -6995.368439 -1327145.654368 1352338.257181
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1032 2340 8 9
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 268996.782449 90414.574684 178582.207765 268996.782449 90414.574684 90414.574684
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.288147 75546.762088 178582.207765 239341.282015 60759.074251 178582.207765 239341.282015 60759.074251 60759.074251
4 AB 2 0.743635 45336.691057 178582.207765 289713.919932 47807.065326 63324.646841 178582.207765 226389.273091 241906.854606 289713.919932 47807.065326 63324.646841 -0.0 47807.065326 63324.646841
5 AC 2 0.521656 61928.537197 178582.207765 263133.952179 116599.577052 -32047.832637 178582.207765 295181.784816 146534.375127 263133.952179 116599.577052 -32047.832637 -0.0 116599.577052 -32047.832637
6 BC 2 0.704846 48645.794104 178582.207765 282873.641681 77129.269135 27162.164781 178582.207765 255711.4769 205744.372546 282873.641681 77129.269135 27162.164781 0.0 77129.269135 27162.164781
7 ABC 3 0.739246 45723.129778 178582.207765 286601.850901 60541.393746 61846.815891 -14368.5665 178582.207765 239123.60151 240429.023655 300970.417401 164213.641264 224755.03501 226060.457155 286601.850901 60541.393746 61846.815891 -14368.5665 0.0 0.0 0.0 0.0 60541.393746 61846.815891 -14368.5665
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
5 AC 2 0.520030 62033.710779 178562.160107 264180.317516 107948.822785 -22330.665376 178562.160107 286510.982892 156231.494731 264180.317516 107948.822785 -22330.665376 -0.0 107948.822785 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 297600.953652 91592.161818 33760.300928 172248.490907 253063.830223 195231.969333 297600.953652 80815.339316 22983.478426 21553.645004 97596.50477 27755.957976
7 ABC 3 0.786106 41411.391808 166398.662065 300504.077931 61790.575612 80964.561429 -8649.721175 166398.662065 228249.883529 246727.241511 308457.156582 157052.305711 218903.527175 238774.140826 300504.077931 61729.937105 81600.550757 -7953.078651 121.284358 -0.022033 -1393.277703 0.022033 61782.157018 80125.877579 -7802.618732
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 337899.406371 130309.275832 207590.130539 337899.406371 130309.275832 130309.275832
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.281669 75889.745660 163151.986814 244759.355467 81607.368653 163151.986814 244759.355467 81607.368653 81607.368653
4 AB 2 0.809248 39107.119385 182789.632430 293900.114096 34828.136694 76282.344972 182789.632430 205582.297796 247036.506074 293900.114096 22792.665366 64246.873644 24070.942656 29096.018385 82014.463281
5 AC 2 0.495569 63594.818608 198958.451050 363590.552038 194958.770035 -30326.669047 198958.451050 365882.952756 140597.513674 363590.552038 166924.501706 -58360.937376 56068.536657 253134.019611 -88501.918624
6 BC 2 0.775553 42420.682377 157543.014898 301167.927023 92898.012201 50726.899924 157543.014898 247791.796952 205620.684674 301167.927023 90248.782054 48077.669777 5298.460294 93705.673936 49919.238189
7 ABC 3 0.814589 38555.644091 180774.726888 309019.451320 64126.723122 71820.931915 -7702.930606 180774.726888 214564.462994 240801.598683 302252.95355 162349.67956 221657.14763 210808.429527 309019.45132 98211.021793 87362.30369 6766.49777 -38903.553723 -36759.666926 326.186866 11241.934962 47385.647738 76013.798068 4845.278626
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1032 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 294881.656250 99795.421875 195086.234375 294881.65625 99795.421875 99795.421875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.287263 75593.665172 163281.500000 244796.203125 81514.703125 163281.500000 244796.203125 81514.703125 81514.703125
4 AB 2 0.790517 40982.155774 172155.718750 321153.718750 67974.367188 81023.632812 172155.718750 240129.8125 253179.078125 321153.71875 67974.09375 81023.359375 0.546875 67974.34324 81023.65676
5 AC 2 0.450673 66364.557401 202322.750000 283340.843750 119144.429688 -38126.335938 202322.750000 321467.0625 164196.296875 283340.84375 119144.3125 -38126.453125 0.234375 119144.65717 -38126.56342
6 BC 2 0.765065 43400.491297 158229.531250 291481.093750 97929.578125 35321.984375 158229.531250 256159.0625 193551.46875 291481.09375 97929.53125 35321.9375 0.09375 97929.600149 35321.962351
7 ABC 3 0.763958 43502.578558 174092.593750 348993.718750 89934.713542 86307.736979 -1341.325521 174092.593750 221620.296875 239778.671875 335516.125 157932.390625 242064.28125 216651.953125 348993.71875 132341.765625 106929.4375 13477.59375 -48209.875 -36604.3125 6966.390625 0.125 72455.662821 91571.301095 10874.161084
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2783 1282 6 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 156077.336371 -22504.871393 178582.207765 156077.336371 -22504.871393 -22504.871393
2 B 1 0.651246 52878.678358 178582.207765 175736.484516 -2845.723249 178582.207765 175736.484516 -2845.723249 -2845.723249
3 C 1 0.288147 75546.762088 178582.207765 168987.616589 -9594.591175 178582.207765 168987.616589 -9594.591175 -9594.591175
4 AB 2 0.743635 45336.691057 178582.207765 164575.102828 -11899.540097 -2107.564839 178582.207765 166682.667667 176474.642926 164575.102828 -11899.540097 -2107.564839 -0.0 -11899.540097 -2107.564839
5 AC 2 0.521656 61928.537197 178582.207765 154620.430845 -29022.516505 5060.739585 178582.207765 149559.69126 183642.947349 154620.430845 -29022.516505 5060.739585 -0.0 -29022.516505 5060.739585
6 BC 2 0.704846 48645.794104 178582.207765 171725.965146 -2567.008957 -4289.233661 178582.207765 176015.198807 174292.974103 171725.965146 -2567.008957 -4289.233661 0.0 -2567.008957 -4289.233661
7 ABC 3 0.739246 45723.129778 178582.207765 163723.587287 -15069.210743 -2058.37981 2268.970076 178582.207765 163512.997021 176523.827954 161454.617211 180851.177841 165781.967097 178792.798031 163723.587287 -15069.210743 -2058.37981 2268.970076 0.0 0.0 0.0 0.0 -15069.210743 -2058.37981 2268.970076
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 155219.163350 -26869.278347 3526.28159 178562.160107 151692.88176 182088.441697 155219.16335 -26869.278347 3526.28159 -0.0 -26869.278347 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 150684.756136 -14985.381285 -2307.323692 1578.799048 166398.662065 151134.985759 163880.012288 148967.132883 167874.564543 152744.82611 165355.914765 150684.756136 -14671.158629 -2060.069973 1717.623253 -458.579804 -241.720776 -107.782902 107.782902 -15242.497951 -2746.091752 2274.683774
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 155854.168999 -51735.961539 207590.130539 155854.168999 -51735.961539 -51735.961539
2 B 1 0.725528 46910.500909 162512.760008 162512.760008 0.0 162512.760008 162512.760008 0.0 0.0
3 C 1 0.281669 75889.745660 163151.986814 163151.986814 0.0 163151.986814 163151.986814 0.0 0.0
4 AB 2 0.809248 39107.119385 182789.632430 154342.145951 -28447.486479 0.0 182789.632430 154342.145951 182789.63243 154342.145951 -28447.486479 0.0 0.0 -28447.486479 0.0
5 AC 2 0.495569 63594.818608 198958.451050 155681.496172 -43276.954878 0.0 198958.451050 155681.496172 198958.45105 155681.496172 -43276.954878 0.0 0.0 -43276.954878 0.0
6 BC 2 0.775553 42420.682377 157543.014898 157543.014898 0.0 0.0 157543.014898 157543.014898 157543.014898 157543.014898 0.0 0.0 0.0 0.0 0.0
7 ABC 3 0.814589 38555.644091 180774.726888 154741.353423 -26033.373465 0.0 0.0 180774.726888 154741.353423 180774.726888 154741.353423 180774.726888 154741.353423 180774.726888 154741.353423 -26033.373465 0.0 0.0 0.0 0.0 0.0 0.0 -26033.373465 0.0 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2783 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 154598.546875 -40487.6875 195086.234375 154598.546875 -40487.6875 -40487.6875
2 B 1 0.725640 46900.883370 162447.125000 162447.125000 0.0 162447.125000 162447.125 0.0 0.0
3 C 1 0.287263 75593.665172 163281.500000 163281.500000 0.0 163281.500000 163281.5 0.0 0.0
4 AB 2 0.790517 40982.155774 172155.718750 150816.812500 -21338.90625 0.0 172155.718750 150816.8125 172155.71875 150816.8125 -21338.90625 0.0 0.0 -21338.90625 0.0
5 AC 2 0.450673 66364.557401 202322.750000 151381.078125 -50941.671875 0.0 202322.750000 151381.078125 202322.75 151381.078125 -50941.671875 0.0 0.0 -50941.671875 0.0
6 BC 2 0.765065 43400.491297 158229.531250 158229.531250 0.0 0.0 158229.531250 158229.53125 158229.53125 158229.53125 0.0 0.0 0.0 0.0 0.0
7 ABC 3 0.763958 43502.578558 174092.593750 154135.343750 -19957.25 0.0 0.0 174092.593750 154135.34375 174092.59375 154135.34375 174092.59375 154135.34375 174092.59375 154135.34375 -19957.25 0.0 0.0 0.0 0.0 0.0 0.0 -19957.25 0.0 0.0
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
411 1055 7 4
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 131849.818168 -46732.389597 178582.207765 131849.818168 -46732.389597 -46732.389597
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.288147 75546.762088 178582.207765 122085.172972 -56497.034792 178582.207765 122085.172972 -56497.034792 -56497.034792
4 AB 2 0.743635 45336.691057 178582.207765 184480.812231 -24709.936535 30608.541001 178582.207765 153872.271229 209190.748766 184480.812231 -24709.936535 30608.541001 -0.0 -24709.936535 30608.541001
5 AC 2 0.521656 61928.537197 178582.207765 148115.419222 -60266.576275 29799.787733 178582.207765 118315.631489 208381.995497 148115.419222 -60266.576275 29799.787733 -0.0 -60266.576275 29799.787733
6 BC 2 0.704846 48645.794104 178582.207765 190606.505231 37281.130089 -25256.832623 178582.207765 215863.337854 153325.375142 190606.505231 37281.130089 -25256.832623 0.0 37281.130089 -25256.832623
7 ABC 3 0.739246 45723.129778 178582.207765 190545.185056 -31291.901877 29894.21804 13360.661128 178582.207765 147290.305888 208476.425805 177184.523928 191942.868892 160650.967015 221837.086932 190545.185056 -31291.901877 29894.21804 13360.661128 0.0 0.0 -0.0 0.0 -31291.901877 29894.21804 13360.661128
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 143531.127090 -55795.279251 20764.246234 178562.160107 122766.880856 199326.406341 143531.12709 -55795.279251 20764.246234 -0.0 -55795.279251 20764.246234
6 BC 2 0.753073 44494.356934 172248.490907 187268.215739 36390.991532 -21371.266701 172248.490907 208639.482439 150877.224206 187268.215739 36390.991532 -21371.266701 0.0 36390.991532 -21371.266701
7 ABC 3 0.786106 41411.391808 166398.662065 179699.295082 -29266.33181 33361.450634 9205.514191 166398.662065 138530.777358 203189.349614 171283.139239 175089.403763 151071.212204 211668.204835 179699.295082 -31968.909753 28628.082878 8416.155843 7950.718193 62.699378 4124.279002 -3912.392525 34152.042926 -28061.353523 7209.943613
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 136692.996600 -70897.133939 207590.130539 136692.9966 -70897.133939 -70897.133939
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.281669 75889.745660 163151.986814 126696.577136 -36455.409678 163151.986814 126696.577136 -36455.409678 -36455.409678
4 AB 2 0.809248 39107.119385 182789.632430 146990.418258 -61184.814123 25385.599951 182789.632430 142918.51353 229488.927603 146990.418258 -39871.1189 46699.295173 -42627.390445 209038.939205 -244838.153377
5 AC 2 0.495569 63594.818608 198958.451050 134366.713704 -83774.953883 19183.216537 198958.451050 135404.40328 238362.5737 134366.713704 -63554.04777 39404.12265 -40441.812226 -169982.570976 105390.83363
6 BC 2 0.775553 42420.682377 157543.014898 167789.375484 27500.420547 -17254.059961 157543.014898 191916.250153 147161.769645 167789.375484 34373.235256 -10381.245252 -13745.629417 14679.922878 -4433.562292
7 ABC 3 0.814589 38555.644091 180774.726888 143555.424943 -74646.827429 25535.385073 11892.14041 180774.726888 144061.871111 222445.468198 150030.635724 204668.500026 140521.114572 262501.942488 143555.424943 -118946.517545 3034.310371 -6475.210782 54799.132091 46531.685071 2934.454242 -19097.155395 -38166.834885 -2028.972661 2976.505601
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 411 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 134324.750000 -60761.484375 195086.234375 134324.75 -60761.484375 -60761.484375
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.287263 75593.665172 163281.500000 126777.679688 -36503.820312 163281.500000 126777.679688 -36503.820312 -36503.820312
4 AB 2 0.790517 40982.155774 172155.718750 169153.296875 -31584.53125 28582.109375 172155.718750 140571.1875 200737.828125 169153.296875 -31584.53125 28582.109375 0.0 -31584.53125 28582.109375
5 AC 2 0.450673 66364.557401 202322.750000 139237.562500 -68314.015625 5228.828125 202322.750000 134008.71875 207551.5625 139237.5625 -68314.03125 5228.8125 0.03125 -68313.99741 5228.80991
6 BC 2 0.765065 43400.491297 158229.531250 176934.625000 36260.53125 -17555.4375 158229.531250 194490.0625 140674.09375 176934.625 36260.53125 -17555.4375 0.0 36260.53125 -17555.4375
7 ABC 3 0.763958 43502.578558 174092.593750 148468.562500 -51338.692708 17662.971354 8051.690104 174092.593750 142292.296875 204259.75 151277.46875 193004.890625 143309.765625 219345.640625 148468.5625 -70877.078125 5158.796875 -2808.90625 21181.953125 17894.796875 3826.375 0.03125 -30819.684461 11896.202013 -6700.548802
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1385 1258 4 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 153515.836649 -25066.371115 178582.207765 153515.836649 -25066.371115 -25066.371115
2 B 1 0.651246 52878.678358 178582.207765 87387.142860 -91195.064905 178582.207765 87387.14286 -91195.064905 -91195.064905
3 C 1 0.288147 75546.762088 178582.207765 168987.616589 -9594.591175 178582.207765 168987.616589 -9594.591175 -9594.591175
4 AB 2 0.743635 45336.691057 178582.207765 97788.488001 -13253.943245 -67539.776519 178582.207765 165328.264519 111042.431246 97788.488001 -13253.943245 -67539.776519 -0.0 -13253.943245 -67539.776519
5 AC 2 0.521656 61928.537197 178582.207765 151317.094129 -32325.85322 5060.739585 178582.207765 146256.354544 183642.947349 151317.094129 -32325.85322 5060.739585 -0.0 -32325.85322 5060.739585
6 BC 2 0.704846 48645.794104 178582.207765 92029.687053 -82263.28705 -4289.233661 178582.207765 96318.920714 174292.974103 92029.687053 -82263.28705 -4289.233661 0.0 -82263.28705 -4289.233661
7 ABC 3 0.739246 45723.129778 178582.207765 98103.217193 -16784.385136 -65963.575512 2268.970076 178582.207765 161797.822628 112618.632253 95834.247117 180851.177841 164066.792704 114887.602329 98103.217193 -16784.385136 -65963.575512 2268.970076 0.0 0.0 0.0 0.0 -16784.385136 -65963.575512 2268.970076
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
5 AC 2 0.520030 62033.710779 178562.160107 152160.907748 -29927.533949 3526.28159 178562.160107 148634.626158 182088.441697 152160.907748 -29927.533949 3526.28159 -0.0 -29927.533949 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 95427.443060 -74263.509264 -2557.538583 172248.490907 96913.151575 168619.122257 95427.44306 -75335.339331 -3629.36865 2143.660135 -73290.205907 -3530.84194
7 ABC 3 0.786106 41411.391808 166398.662065 109160.356479 -13759.826737 -45194.891385 1716.412535 166398.662065 149516.86715 117851.808625 107400.555541 167874.564543 151200.407462 119995.336691 109160.356479 -10834.980212 -42040.050982 1759.800938 -5839.176869 383.727127 -76.260627 -591.364961 -11694.208615 -47250.449134 1706.352162
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 165029.481902 -42560.648637 207590.130539 165029.481902 -42560.648637 -42560.648637
2 B 1 0.725528 46910.500909 162512.760008 105779.163578 -56733.59643 162512.760008 105779.163578 -56733.59643 -56733.59643
3 C 1 0.281669 75889.745660 163151.986814 163151.986814 0.0 163151.986814 163151.986814 0.0 0.0
4 AB 2 0.809248 39107.119385 182789.632430 120456.636131 -15829.524888 -46503.471411 182789.632430 159205.404641 128531.458118 120456.636131 -23584.227789 -54258.174312 15509.405802 -18885.280307 -43447.715991
5 AC 2 0.495569 63594.818608 198958.451050 160199.875297 -38758.575753 0.0 198958.451050 160199.875297 198958.45105 160199.875297 -38758.575753 0.0 0.0 -38758.575753 0.0
6 BC 2 0.775553 42420.682377 157543.014898 112162.409447 -45380.60545 0.0 157543.014898 112162.409447 157543.014898 112162.409447 -45380.60545 0.0 0.0 -45380.60545 0.0
7 ABC 3 0.814589 38555.644091 180774.726888 121951.847058 -17658.186844 -41164.692987 0.0 180774.726888 156021.16156 132514.655417 121951.847058 180774.726888 156021.16156 132514.655417 121951.847058 -10562.808359 -34069.314502 0.0 -14190.756969 0.0 0.0 0.0 -13921.247007 -44901.632824 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1385 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 163633.953125 -31452.28125 195086.234375 163633.953125 -31452.28125 -31452.28125
2 B 1 0.725640 46900.883370 162447.125000 105912.015625 -56535.109375 162447.125000 105912.015625 -56535.109375 -56535.109375
3 C 1 0.287263 75593.665172 163281.500000 163281.500000 0.0 163281.500000 163281.5 0.0 0.0
4 AB 2 0.790517 40982.155774 172155.718750 117276.554688 -13997.238281 -40881.925781 172155.718750 158158.5625 131273.875 117276.554688 -13997.15625 -40881.84375 -0.164062 -13997.198095 -40881.965968
5 AC 2 0.450673 66364.557401 202322.750000 162926.328125 -39396.421875 0.0 202322.750000 162926.328125 202322.75 162926.328125 -39396.421875 0.0 0.0 -39396.421875 0.0
6 BC 2 0.765065 43400.491297 158229.531250 108188.835938 -50040.695312 0.0 158229.531250 108188.835938 158229.53125 108188.835938 -50040.695312 0.0 0.0 -50040.695312 0.0
7 ABC 3 0.763958 43502.578558 174092.593750 120542.585938 -9750.007812 -43800.0 0.0 174092.593750 158989.828125 124939.835938 120542.585938 174092.59375 158989.828125 124939.835938 120542.585938 -4397.25 -38447.242188 0.0 -10705.515625 0.0 0.0 0.0 -5495.986995 -48054.020818 0.0
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1577 1394 5 7
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 168031.001741 -10551.206024 178582.207765 168031.001741 -10551.206024 -10551.206024
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.288147 75546.762088 178582.207765 192438.838398 13856.630634 178582.207765 192438.838398 13856.630634 13856.630634
4 AB 2 0.743635 45336.691057 178582.207765 138179.545011 -5578.992075 -34823.670679 178582.207765 173003.215689 143758.537086 138179.545011 -5578.992075 -34823.670679 -0.0 -5578.992075 -34823.670679
5 AC 2 0.521656 61928.537197 178582.207765 157666.478111 -13606.945164 -7308.784489 178582.207765 164975.2626 171273.423275 157666.478111 -13606.945164 -7308.784489 0.0 -13606.945164 -7308.784489
6 BC 2 0.704846 48645.794104 178582.207765 142361.625580 -42415.148004 6194.565819 178582.207765 136167.059761 184776.773584 142361.62558 -42415.148004 6194.565819 0.0 -42415.148004 6194.565819
7 ABC 3 0.739246 45723.129778 178582.207765 134229.291078 -7065.063576 -34010.977661 -3276.875449 178582.207765 171517.144188 144571.230104 137506.166527 175305.332315 168240.268739 141294.354654 134229.291078 -7065.063576 -34010.977661 -3276.875449 0.0 0.0 0.0 0.0 -7065.063576 -34010.977661 -3276.875449
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 160872.040505 -12597.41887 -5092.700732 178562.160107 165964.741237 173469.459375 160872.040505 -12597.41887 -5092.700732 0.0 -12597.41887 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 129540.855448 -5707.452073 -28536.740728 -2613.613817 166398.662065 159180.744127 136833.552157 132636.565949 164267.144932 157049.226994 133737.841655 129540.855448 -4196.986208 -27508.371547 -3095.710501 -3020.93173 0.0 964.193368 -0.0 -4596.881043 -29262.746623 -2998.178952
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 158197.633277 -49392.497261 207590.130539 158197.633277 -49392.497261 -49392.497261
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.281669 75889.745660 163151.986814 200750.756833 37598.770019 163151.986814 200750.756833 37598.770019 37598.770019
4 AB 2 0.809248 39107.119385 182789.632430 143978.553894 -20139.459509 -18671.619027 182789.632430 158731.955162 160199.795645 143978.553894 -24057.677268 -22589.836785 7836.435517 -20016.166366 -18794.91217
5 AC 2 0.495569 63594.818608 198958.451050 160288.675586 -39393.024488 723.249025 198958.451050 159286.217471 199402.490983 160288.675586 -39672.233579 444.039933 558.418182 -39107.494434 437.71897
6 BC 2 0.775553 42420.682377 157543.014898 146000.659414 -24129.076307 12586.720824 157543.014898 138391.504384 175107.301515 146000.659414 -19151.510514 17564.286617 -9955.131587 -139270.548369 127728.192886
7 ABC 3 0.814589 38555.644091 180774.726888 144293.581473 -17663.859578 -16549.892104 -2267.393734 180774.726888 157994.898906 162510.184045 147623.574186 181599.417785 157099.337185 154811.986994 144293.581473 -10518.405521 -12805.755712 -3329.992713 -13981.675079 -4368.204338 2434.430993 6088.456956 -17260.782371 -15897.033364 -3323.329681
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1577 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 161489.765625 -33596.46875 195086.234375 161489.765625 -33596.46875 -33596.46875
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.287263 75593.665172 163281.500000 200845.468750 37563.96875 163281.500000 200845.46875 37563.96875 37563.96875
4 AB 2 0.790517 40982.155774 172155.718750 140205.796875 -14059.90625 -17890.015625 172155.718750 158095.8125 154265.703125 140205.796875 -14059.90625 -17890.015625 0.0 -14059.90625 -17890.015625
5 AC 2 0.450673 66364.557401 202322.750000 156790.515625 -37226.015625 -8306.21875 202322.750000 165096.734375 194016.53125 156790.515625 -37226.015625 -8306.21875 0.0 -37226.015625 -8306.21875
6 BC 2 0.765065 43400.491297 158229.531250 147097.218750 -24235.5625 13103.25 158229.531250 133993.96875 171332.78125 147097.21875 -24235.5625 13103.25 0.0 -24235.5625 13103.25
7 ABC 3 0.763958 43502.578558 174092.593750 140161.140625 -12161.981771 -19219.497396 -2549.973958 174092.593750 159404.796875 156317.75 144875.234375 173706.75 160825.28125 149797.296875 140161.140625 -9636.15625 -20664.140625 -4714.09375 -3245.3125 -1806.359375 6134.578125 0.03125 -11881.19128 -17882.297117 -4167.964728
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1427 2097 7 9
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 243061.597764 64479.389999 178582.207765 243061.597764 64479.389999 64479.389999
2 B 1 0.651246 52878.678358 178582.207765 219911.155344 41328.947579 178582.207765 219911.155344 41328.947579 41328.947579
3 C 1 0.288147 75546.762088 178582.207765 239341.282015 60759.074251 178582.207765 239341.282015 60759.074251 60759.074251
4 AB 2 0.743635 45336.691057 178582.207765 243284.482223 34093.733457 30608.541001 178582.207765 212675.941221 209190.748766 243284.482223 34093.733457 30608.541001 -0.0 34093.733457 30608.541001
5 AC 2 0.521656 61928.537197 178582.207765 229687.667931 83153.292804 -32047.832637 178582.207765 261735.500568 146534.375127 229687.667931 83153.292804 -32047.832637 -0.0 83153.292804 -32047.832637
6 BC 2 0.704846 48645.794104 178582.207765 243025.502635 37281.130089 27162.164781 178582.207765 215863.337854 205744.372546 243025.502635 37281.130089 27162.164781 0.0 37281.130089 27162.164781
7 ABC 3 0.739246 45723.129778 178582.207765 237283.112321 43175.253017 29894.21804 -14368.5665 178582.207765 221757.460781 208476.425805 251651.678822 164213.641264 207388.894281 194107.859304 237283.112321 43175.253017 29894.21804 -14368.5665 0.0 0.0 0.0 0.0 43175.253017 29894.21804 -14368.5665
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 233215.479543 76983.984812 -22330.665376 178562.160107 255546.144919 156231.494731 233215.479543 76983.984812 -22330.665376 -0.0 76983.984812 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 239102.818452 40130.920326 26723.40722 172248.490907 208639.482439 195231.969333 239102.818452 36390.991532 22983.478426 7479.857587 40975.443979 25878.883566
7 ABC 3 0.786106 41411.391808 166398.662065 239259.121067 44066.105089 37444.071416 -8649.717503 166398.662065 210508.022139 203189.349614 247212.199718 157052.305711 201161.665785 195236.270963 239259.121067 44022.850104 38097.455282 -7953.078651 86.50997 -0.0 -1393.277703 0.0 44069.226154 36376.718985 -7585.486138
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 252701.449543 45111.319005 207590.130539 252701.449543 45111.319005 45111.319005
2 B 1 0.725528 46910.500909 162512.760008 204048.490253 41535.730245 162512.760008 204048.490253 41535.730245 41535.730245
3 C 1 0.281669 75889.745660 163151.986814 244759.355467 81607.368653 163151.986814 244759.355467 81607.368653 81607.368653
4 AB 2 0.809248 39107.119385 182789.632430 231949.983960 9957.177125 39203.174405 182789.632430 200242.930323 229488.927603 231949.98396 17453.297893 46699.295173 -14992.241537 13374.521882 35785.829647
5 AC 2 0.495569 63594.818608 198958.451050 208130.158491 57146.079137 -47974.371696 198958.451050 245717.964506 140597.513674 208130.158491 46759.513456 -58360.937376 20773.131361 -36966.54656 46138.254001
6 BC 2 0.775553 42420.682377 157543.014898 228024.938026 28388.744304 42093.178825 157543.014898 191916.250153 205620.684674 228024.938026 34373.235256 48077.669777 -11968.981904 29383.446113 41098.477016
7 ABC 3 0.814589 38555.644091 180774.726888 219744.435062 27479.06654 30231.98324 -18741.341606 180774.726888 197602.300802 222445.468198 229987.655239 162349.67956 192136.794186 172799.46019 219744.435062 46944.974872 27607.640876 -10243.220178 -17157.860246 -39402.787831 4777.713562 26443.247118 -44359.598494 89752.004664 -6422.697996
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1427 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 263538.906250 68452.671875 195086.234375 263538.90625 68452.671875 68452.671875
2 B 1 0.725640 46900.883370 162447.125000 204150.109375 41702.984375 162447.125000 204150.109375 41702.984375 41702.984375
3 C 1 0.287263 75593.665172 163281.500000 244796.203125 81514.703125 163281.500000 244796.203125 81514.703125 81514.703125
4 AB 2 0.790517 40982.155774 172155.718750 243396.140625 42658.3125 28582.109375 172155.718750 214814.03125 200737.828125 243396.140625 42658.3125 28582.109375 0.0 42658.3125 28582.109375
5 AC 2 0.450673 66364.557401 202322.750000 245032.578125 80836.21875 -38126.390625 202322.750000 283158.90625 164196.296875 245032.578125 80836.15625 -38126.453125 0.125 80836.392836 -38126.564711
6 BC 2 0.765065 43400.491297 158229.531250 229811.984375 36260.523438 35321.929688 158229.531250 194490.0625 193551.46875 229811.984375 36260.53125 35321.9375 -0.015625 36260.523335 35321.92979
7 ABC 3 0.763958 43502.578558 174092.593750 230401.875000 35464.075521 27258.528646 -6413.322917 174092.593750 192931.859375 204259.75 227068.3125 157932.390625 206051.96875 178312.984375 230401.875 52088.890625 24349.90625 3333.5625 -3969.3125 -29280.328125 9786.546875 0.015625 21864.857504 31693.553008 2750.870739
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
518 1314 8 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 159492.669334 -19089.53843 178582.207765 159492.669334 -19089.53843 -19089.53843
2 B 1 0.651246 52878.678358 178582.207765 264085.826172 85503.618407 178582.207765 264085.826172 85503.618407 85503.618407
3 C 1 0.288147 75546.762088 178582.207765 168987.616589 -9594.591175 178582.207765 168987.616589 -9594.591175 -9594.591175
4 AB 2 0.743635 45336.691057 178582.207765 231813.185372 -10093.669234 63324.646841 178582.207765 168488.538531 241906.854606 231813.185372 -10093.669234 63324.646841 -0.0 -10093.669234 63324.646841
5 AC 2 0.521656 61928.537197 178582.207765 159024.879799 -24618.06755 5060.739585 178582.207765 153964.140214 183642.947349 159024.879799 -24618.06755 5060.739585 -0.0 -24618.06755 5060.739585
6 BC 2 0.704846 48645.794104 178582.207765 251422.243239 77129.269135 -4289.233661 178582.207765 255711.4769 174292.974103 251422.243239 77129.269135 -4289.233661 0.0 77129.269135 -4289.233661
7 ABC 3 0.739246 45723.129778 178582.207765 229915.682179 -12782.311553 61846.815891 2268.970076 178582.207765 165799.896212 240429.023655 227646.712102 180851.177841 168068.866288 242697.993732 229915.682179 -12782.311553 61846.815891 2268.970076 -0.0 0.0 -0.0 0.0 -12782.311553 61846.815891 2268.970076
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 159296.837486 -22791.604211 3526.28159 178562.160107 155770.555896 182088.441697 159296.837486 -22791.604211 3526.28159 -0.0 -22791.604211 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 247204.347482 79700.28227 -4744.425695 172248.490907 253063.830223 168619.122257 247204.347482 80815.339316 -3629.36865 -2230.114091 78480.362824 -3524.506249
7 ABC 3 0.786106 41411.391808 166398.662065 234949.880496 -13026.504801 80198.756963 1378.966268 166398.662065 153352.757642 246727.241511 233693.993385 167874.564543 154907.088962 247983.128622 234949.880496 -13033.248126 80042.791534 1255.887111 65.772545 0.0 298.444208 -78.428843 -13031.131977 80323.293461 1259.056947
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 171220.025307 -36370.105231 207590.130539 171220.025307 -36370.105231 -36370.105231
2 B 1 0.725528 46910.500909 162512.760008 269359.021972 106846.261963 162512.760008 269359.021972 106846.261963 106846.261963
3 C 1 0.281669 75889.745660 163151.986814 163151.986814 0.0 163151.986814 163151.986814 0.0 0.0
4 AB 2 0.809248 39107.119385 182789.632430 206902.567063 -32568.12444 56681.059072 182789.632430 157787.322563 247036.506074 206902.567063 -25002.309868 64246.873644 -15131.629144 -15362.10383 39475.038463
5 AC 2 0.495569 63594.818608 198958.451050 162764.968504 -36193.482546 0.0 198958.451050 162764.968504 198958.45105 162764.968504 -36193.482546 0.0 0.0 -36193.482546 0.0
6 BC 2 0.775553 42420.682377 157543.014898 247791.796952 90248.782054 0.0 157543.014898 247791.796952 157543.014898 247791.796952 90248.782054 0.0 0.0 90248.782054 0.0
7 ABC 3 0.814589 38555.644091 180774.726888 217744.674206 -23083.482263 60053.42958 0.0 180774.726888 157664.68684 240801.598683 217744.674206 180774.726888 157664.68684 240801.598683 217744.674206 -23056.924477 60079.987366 0.0 -53.115572 0.0 0.0 0.0 -23023.845589 59993.792907 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 518 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 168911.546875 -26174.6875 195086.234375 168911.546875 -26174.6875 -26174.6875
2 B 1 0.725640 46900.883370 162447.125000 269223.312500 106776.1875 162447.125000 269223.3125 106776.1875 106776.1875
3 C 1 0.287263 75593.665172 163281.500000 163281.500000 0.0 163281.500000 163281.5 0.0 0.0
4 AB 2 0.790517 40982.155774 172155.718750 237431.468750 -15747.609375 81023.359375 172155.718750 156408.109375 253179.078125 237431.46875 -15747.609375 81023.359375 0.0 -15747.609375 81023.359375
5 AC 2 0.450673 66364.557401 202322.750000 172020.203125 -30302.546875 0.0 202322.750000 172020.203125 202322.75 172020.203125 -30302.546875 0.0 0.0 -30302.546875 0.0
6 BC 2 0.765065 43400.491297 158229.531250 256159.062500 97929.53125 0.0 158229.531250 256159.0625 158229.53125 256159.0625 97929.53125 0.0 0.0 97929.53125 0.0
7 ABC 3 0.763958 43502.578558 174092.593750 218661.468750 -18585.375 63154.25 0.0 174092.593750 158039.046875 239778.671875 218661.46875 174092.59375 158039.046875 239778.671875 218661.46875 -21117.203125 60622.421875 0.0 5063.65625 0.0 0.0 0.0 -23823.940639 68392.815639 0.0
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2181 4676 10 11
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 518316.088722 339733.880958 178582.207765 518316.088722 339733.880958 339733.880958
2 B 1 0.651246 52878.678358 178582.207765 352435.167828 173852.960063 178582.207765 352435.167828 173852.960063 173852.960063
3 C 1 0.288147 75546.762088 178582.207765 286243.725633 107661.517868 178582.207765 286243.725633 107661.517868 107661.517868
4 AB 2 0.743635 45336.691057 178582.207765 486974.704646 179635.63836 128756.858521 178582.207765 358217.846125 307339.066286 486974.704646 179635.63836 128756.858521 -0.0 179635.63836 128756.858521
5 AC 2 0.521656 61928.537197 178582.207765 559919.677705 438124.350726 -56786.880785 178582.207765 616706.55849 121795.326979 559919.677705 438124.350726 -56786.880785 -0.0 438124.350726 -56786.880785
6 BC 2 0.704846 48645.794104 178582.207765 383537.518735 156825.547228 48129.763743 178582.207765 335407.754993 226711.971507 383537.518735 156825.547228 48129.763743 0.0 156825.547228 48129.763743
7 ABC 3 0.739246 45723.129778 178582.207765 506358.996464 227485.034659 125752.011592 -25460.257552 178582.207765 406067.242423 304334.219357 531819.254016 153121.950213 380606.984872 278873.961805 506358.996464 227485.034659 125752.011592 -25460.257552 0.0 0.0 0.0 -0.0 227485.034659 125752.011592 -25460.257552
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.292963 75290.806037 177835.958675 292775.570492 114939.611817 177835.958675 292775.570492 114939.611817 114939.611817
4 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
5 AC 2 0.520030 62033.710779 178562.160107 544612.564823 405257.464227 -39207.059511 178562.160107 584181.199765 139716.676027 544612.564823 405619.039658 -38845.48408 -723.150861 404819.298909 -38768.894193
6 BC 2 0.753073 44494.356934 172248.490907 449973.929119 213372.304348 64353.133863 172248.490907 368768.996693 219749.826208 449973.929119 196520.505786 47501.335301 33703.597125 223663.354657 54062.083555
7 ABC 3 0.786106 41411.391808 166398.662065 551661.554364 232178.895794 168410.842209 -15326.845705 166398.662065 398805.39055 333803.025303 565753.943523 149837.398023 382244.194975 319710.453053 551661.554364 231951.10131 169417.359389 -14092.389159 455.695642 -0.18309 -2468.806416 0.114622 232214.32228 166916.960554 -13868.390536
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 386181.566891 178591.436353 207590.130539 386181.566891 178591.436353 178591.436353
2 B 1 0.725528 46910.500909 162512.760008 449028.228831 286515.468823 162512.760008 449028.228831 286515.468823 286515.468823
3 C 1 0.281669 75889.745660 163151.986814 263083.621020 99931.634206 163151.986814 263083.62102 99931.634206 99931.634206
4 AB 2 0.809248 39107.119385 182789.632430 484943.892199 153184.769918 148969.489851 182789.632430 309526.773946 305311.493879 484943.892199 126737.141516 122521.861449 52895.256804 153632.032241 148522.227527
5 AC 2 0.495569 63594.818608 198958.451050 349440.508415 187545.161334 -37063.103969 198958.451050 333891.337958 109283.072655 349440.508415 134932.886908 -89675.378395 105224.548852 448654.358034 -298172.300669
6 BC 2 0.775553 42420.682377 157543.014898 446590.475295 259507.573265 29539.887133 157543.014898 415560.092484 185592.406352 446590.475295 258017.077587 28049.391455 2980.991356 260705.776757 28341.68364
7 ABC 3 0.814589 38555.644091 180774.726888 428058.132886 159209.721838 141768.899392 -53695.215232 180774.726888 325012.394681 294212.618308 500688.91377 140769.185128 263204.117702 259122.249184 428058.132886 168935.883703 164854.015184 -72630.780883 -46500.951129 37540.411759 10822.503904 -15737.67654 198613.286624 151586.376295 -102916.256921
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 2181 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 459439.187500 264352.9375 195086.234375 459439.1875 264352.953125 264352.953125
2 B 1 0.725640 46900.883370 162447.125000 450958.187500 288511.0625 162447.125000 450958.1875 288511.0625 288511.0625
3 C 1 0.287263 75593.665172 163281.500000 296172.187500 132890.6875 163281.500000 296172.1875 132890.6875 132890.6875
4 AB 2 0.790517 40982.155774 172155.718750 542171.750000 151095.578125 218920.453125 172155.718750 323251.0 391075.875 542171.75 151095.28125 218920.15625 0.59375 151095.523707 218920.507543
5 AC 2 0.450673 66364.557401 202322.750000 515571.625000 359299.875 -46050.984375 202322.750000 561622.5625 156271.71875 515571.625 359299.8125 -46051.03125 0.09375 359299.920032 -46051.045032
6 BC 2 0.765065 43400.491297 158229.531250 495223.062500 273418.5 63575.007812 158229.531250 431648.125 221804.609375 495223.0625 273418.59375 63575.078125 -0.140625 273418.479654 63575.051596
7 ABC 3 0.763958 43502.578558 174092.593750 665873.625000 286827.369792 212944.104167 -7990.442708 174092.593750 389124.9375 293089.8125 635943.75 128182.125 358982.34375 307250.9375 665873.625 358622.6875 306891.28125 29929.875 -127822.46875 -15768.75 -60072.46875 0.875 275189.929416 193213.868836 23377.232998
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1880 1422 5 6
=== Subset predictions + SHAP + μ + ReSHAP — Linear Regression | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.523397 61815.726690 178582.207765 171019.418083 -7562.789682 178582.207765 171019.418083 -7562.789682 -7562.789682
2 B 1 0.651246 52878.678358 178582.207765 131561.813688 -47020.394077 178582.207765 131561.813688 -47020.394077 -47020.394077
3 C 1 0.288147 75546.762088 178582.207765 168987.616589 -9594.591175 178582.207765 168987.616589 -9594.591175 -9594.591175
4 AB 2 0.743635 45336.691057 178582.207765 139759.682016 -3998.85507 -34823.670679 178582.207765 174583.352695 143758.537086 139759.682016 -3998.85507 -34823.670679 -0.0 -3998.85507 -34823.670679
5 AC 2 0.521656 61928.537197 178582.207765 173889.895020 -9753.052329 5060.739585 178582.207765 168829.155436 183642.947349 173889.89502 -9753.052329 5060.739585 -0.0 -9753.052329 5060.739585
6 BC 2 0.704846 48645.794104 178582.207765 131877.826099 -42415.148004 -4289.233661 178582.207765 136167.059761 174292.974103 131877.826099 -42415.148004 -4289.233661 0.0 -42415.148004 -4289.233661
7 ABC 3 0.739246 45723.129778 178582.207765 141776.173395 -5064.026785 -34010.977661 2268.970076 178582.207765 173518.18098 144571.230104 139507.203319 180851.177841 175787.151056 146840.20018 141776.173395 -5064.026785 -34010.977661 2268.970076 0.0 0.0 0.0 0.0 -5064.026785 -34010.977661 2268.970076
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 173058.987696 -9029.454001 3526.28159 178562.160107 169532.706106 182088.441697 173058.987696 -9029.454001 3526.28159 0.0 -9029.454001 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 135968.805726 -4090.931363 -28148.640247 1809.715271 166398.662065 161225.073835 136833.552157 133825.277661 167874.564543 162700.976312 138977.080222 135968.805726 -3008.274496 -26732.170586 2143.528066 -2165.313734 0.0 -667.625588 -0.0 -3227.298054 -29404.286969 2201.728683
=== Subset predictions + SHAP + μ + ReSHAP — LightGBM | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.494836 63641.022042 207590.130539 166761.015241 -40829.115298 207590.130539 166761.015241 -40829.115298 -40829.115298
2 B 1 0.725528 46910.500909 162512.760008 135047.645999 -27465.11401 162512.760008 135047.645999 -27465.11401 -27465.11401
3 C 1 0.281669 75889.745660 163151.986814 163151.986814 0.0 163151.986814 163151.986814 0.0 0.0
4 AB 2 0.809248 39107.119385 182789.632430 144744.578437 -16805.361797 -21239.692196 182789.632430 164634.126044 160199.795645 144744.578437 -18155.506386 -22589.836785 2700.289179 -16952.29852 -21092.755473
5 AC 2 0.495569 63594.818608 198958.451050 173974.140928 -24984.310122 0.0 198958.451050 173974.140928 198958.45105 173974.140928 -24984.310122 0.0 0.0 -24984.310122 0.0
6 BC 2 0.775553 42420.682377 157543.014898 138391.504384 -19151.510514 0.0 157543.014898 138391.504384 157543.014898 138391.504384 -19151.510514 0.0 0.0 -19151.510514 0.0
7 ABC 3 0.814589 38555.644091 180774.726888 149076.732985 -15068.189235 -16629.804668 0.0 180774.726888 164071.799478 162510.184045 149076.732985 180774.726888 164071.799478 162510.184045 149076.732985 -13433.45106 -14995.066493 0.0 -3269.476351 0.0 0.0 0.0 -14978.390941 -16719.602961 0.0
=== Subset predictions + SHAP + μ + ReSHAP — XGBoost | Test idx 1880 ===
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.462856 65624.546618 195086.234375 165546.140625 -29540.09375 195086.234375 165546.140625 -29540.09375 -29540.09375
2 B 1 0.725640 46900.883370 162447.125000 135150.843750 -27296.28125 162447.125000 135150.84375 -27296.28125 -27296.28125
3 C 1 0.287263 75593.665172 163281.500000 163281.500000 0.0 163281.500000 163281.5 0.0 0.0
4 AB 2 0.790517 40982.155774 172155.718750 141969.359375 -12296.34375 -17890.015625 172155.718750 159859.375 154265.703125 141969.359375 -12296.34375 -17890.015625 0.0 -12296.34375 -17890.015625
5 AC 2 0.450673 66364.557401 202322.750000 170629.843750 -31692.90625 0.0 202322.750000 170629.84375 202322.75 170629.84375 -31692.90625 0.0 0.0 -31692.90625 0.0
6 BC 2 0.765065 43400.491297 158229.531250 133993.968750 -24235.5625 0.0 158229.531250 133993.96875 158229.53125 133993.96875 -24235.5625 0.0 0.0 -24235.5625 0.0
7 ABC 3 0.763958 43502.578558 174092.593750 146376.593750 -11661.476562 -16054.523438 0.0 174092.593750 160710.796875 156317.75 146376.59375 174092.59375 160710.796875 156317.75 146376.59375 -9941.15625 -14334.203125 0.0 -3440.640625 0.0 0.0 0.0 -11350.154796 -16365.845204 0.0

P / R Investigation¶

In [154]:
# ============================================================
# 5) Model factory (parameterized) + training/eval runner
# ============================================================
def make_model_factory(settings, random_state):
    """Return a factory function build_models_for_subset(k) honoring SETTINGS & available libs."""
    def build_models_for_subset(n_features):
        models = {}
        if settings["USE_LINEAR"]:
            models["Linear Regression"] = Pipeline([("model", LinearRegression())])
        if settings["USE_MLP"]:
            models["MLP"] = Pipeline([
                ("scaler", StandardScaler()),
                ("model", MLPRegressor(
                    hidden_layer_sizes=settings["MLP_hidden"],
                    activation="relu", solver="adam",
                    max_iter=settings["MLP_max_iter"],
                    random_state=random_state,
                    early_stopping=settings["MLP_early_stopping"],
                    n_iter_no_change=settings["MLP_n_iter_no_change"],
                    tol=settings["MLP_tol"],
                    learning_rate_init=settings["MLP_lr"],
                    alpha=settings["MLP_alpha"]
                ))
            ])
        if settings["USE_LGB"] and HAS_LGB:
            models["LightGBM"] = lgb.LGBMRegressor(
                n_estimators=500, learning_rate=0.05, num_leaves=31,
                random_state=random_state, verbose=-1
            )
        if settings["USE_XGB"] and HAS_XGB:
            models["XGBoost"] = xgb.XGBRegressor(
                n_estimators=500, learning_rate=0.05, max_depth=6,
                subsample=0.8, colsample_bytree=0.8,
                objective="reg:squarederror",
                random_state=random_state, n_jobs=-1
            )
        return models
    return build_models_for_subset

def _num(x):
    try:
        return float(x)
    except Exception:
        return np.nan

def compute_P_R_metrics(tbl):
    """
    Compute diagnostics + P, R, P/R from AB and ABC rows.
    Returns dict with raw f-values, ψ, ReSHAP, denominators, and metrics.
    """
    out = {"P": np.nan, "R": np.nan, "P/R": np.nan}

    try:
        row_ab  = tbl.loc[tbl["Feature subset"] == "AB"].iloc[0]
        row_abc = tbl.loc[tbl["Feature subset"] == "ABC"].iloc[0]
    except Exception:
        return out

    # --- raw lattice values ---
    f_ab_s  = _num(row_ab["f(AB)"])
    f_ab_a  = _num(row_ab["f(∅)"])
    f_abc_s = _num(row_abc["f(ABC)"])
    f_abc_a = _num(row_abc["f(∅)"])

    # --- SHAP ψ values ---
    psi_ab_a   = abs(_num(row_ab["φ B"]))
    psi_abc_a  = abs(_num(row_abc["φ B"]))


    # --- ReSHAP values ---
    resh_ab_a  = abs(_num(row_ab["ReSHAP B"]))
    resh_abc_a = abs(_num(row_abc["ReSHAP B"]))

    # --- denominators ---
    d_ab  = abs(f_ab_s  - f_ab_a)
    d_abc = abs(f_abc_s - f_abc_a)

    # safe fractions
    with np.errstate(divide="ignore", invalid="ignore"):
        p_ab  = psi_ab_a  / d_ab  if d_ab  not in (0, np.nan) else np.nan
        p_abc = psi_abc_a / d_abc if d_abc not in (0, np.nan) else np.nan
        r_ab  = resh_ab_a / d_ab  if d_ab  not in (0, np.nan) else np.nan
        r_abc = resh_abc_a/ d_abc if d_abc not in (0, np.nan) else np.nan

        P = p_ab - p_abc if p_abc not in (0, np.nan) else np.nan
        R = r_ab - r_abc if r_abc not in (0, np.nan) else np.nan

        PR = P / R       if R    not in (0, np.nan) else np.nan

    return {
        # raw values
        "f_AB(S)": f_ab_s, "f_AB(A)": f_ab_a,
        "f_ABC(S)": f_abc_s, "f_ABC(A)": f_abc_a,
        # diagnostics
        "psi_AB(A)": psi_ab_a, "den_AB": d_ab,
        "psi_ABC(A)": psi_abc_a, "den_ABC": d_abc,
        "ReSHAP_AB(A)": resh_ab_a, "ReSHAP_ABC(A)": resh_abc_a,
        # metrics
        "P": P, "R": R, "P/R": PR
    }


def run_pipeline(settings):
    """
    End-to-end:
    - slice data by MODE
    - split train/test
    - pick one or many test rows (per settings)
    - train each subset model per chosen algorithms (once)
    - for each chosen test row: compute SHAP, lattice f, μ, and ReSHAP
    - display per-model tables per test row
    """
    # Warnings
    if settings["SILENCE_MLP_WARNINGS"]:
        warnings.filterwarnings("ignore", category=ConvergenceWarning)
    else:
        warnings.filterwarnings("default", category=ConvergenceWarning)

    # ---- Data slice
    MODE = settings["MODE"]
    RANDOM_STATE = settings["RANDOM_STATE"]

    df = df_ames.copy()
    df.columns = df.columns.str.strip()

    need = ["Gr Liv Area", "Overall Qual", "SalePrice"]
    if MODE == "totrms":
        need += ["TotRms AbvGrd"]
    base_df = df[need].apply(pd.to_numeric, errors="coerce").dropna()

    if MODE == "two":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        feature_names_full = ["Gr Liv Area", "Overall Qual"]
    elif MODE == "dup_grliv":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        X_full["Gr Liv Area (DUP)"] = X_full["Gr Liv Area"]
        feature_names_full = ["Gr Liv Area", "Overall Qual", "Gr Liv Area (DUP)"]
    elif MODE == "dup_qual":
        X_full = base_df[["Gr Liv Area", "Overall Qual"]].copy()
        X_full["Overall Qual (DUP)"] = X_full["Overall Qual"]
        feature_names_full = ["Gr Liv Area", "Overall Qual", "Overall Qual (DUP)"]
    elif MODE == "totrms":
        X_full = base_df[["Gr Liv Area", "Overall Qual", "TotRms AbvGrd"]].copy()
        feature_names_full = ["Gr Liv Area", "Overall Qual", "TotRms AbvGrd"]
    else:
        raise ValueError("Invalid MODE")

    y_full = base_df["SalePrice"]
    m = len(feature_names_full)
    letters = "ABC"[:m]

    X_tr_full, X_te_full, y_tr, y_te = train_test_split(
        X_full, y_full, test_size=0.2, random_state=RANDOM_STATE
    )

    # ---- Pick one or many test rows
    if settings["PICK_RANDOM_POINT"]:
        random.seed(RANDOM_STATE)
        n = min(int(settings.get("N_RANDOM_POINTS", 1)), len(X_te_full))
        test_indices = random.sample(list(X_te_full.index), n)
    else:
        if settings["CUSTOM_TEST_INDEX"] is not None:
            idx = settings["CUSTOM_TEST_INDEX"]
            if idx not in X_te_full.index:
                raise ValueError(f"CUSTOM_TEST_INDEX {idx} not in test indices.")
            test_indices = [idx]
        else:
            test_indices = [X_te_full.index[0]]

    print(f"Mode: {MODE} | Features (A,B{',C' if m==3 else ''}): {list(zip(list(letters), feature_names_full))}")
    print(f"Evaluating {len(test_indices)} test point(s): {test_indices}")

    # ---- Build subsets & models
    build_models_for_subset = make_model_factory(settings, RANDOM_STATE)
    subsets = all_subsets_indices(m, letters)

    # Determine model order (respect settings + availability)
    model_names = list(build_models_for_subset(1).keys())
    if settings["USE_LGB"] and HAS_LGB and "LightGBM" not in model_names:
        model_names.append("LightGBM")
    if settings["USE_XGB"] and HAS_XGB and "XGBoost" not in model_names:
        model_names.append("XGBoost")

    # container: model_name -> list of dicts {"P":..,"R":..,"P/R":..}
    model_metrics = {mn: [] for mn in model_names}
    
    # ---- Train per subset (once)
    fitted = {}  # (model_name, label) -> dict(model, r2, rmse, x0_S)
    for label, comb in subsets:
        k = len(comb)
        if k == 0:
            # Constant model
            y_hat_test = np.full_like(y_te.values, fill_value=y_tr.mean(), dtype=float)
            r2  = r2_score(y_te.values, y_hat_test)
            rmse = float(np.sqrt(mean_squared_error(y_te.values, y_hat_test)))
            for mn in model_names:
                fitted[(mn, label)] = {"model": None, "r2": r2, "rmse": rmse, "x0_S": None}
            continue

        X_tr_S = X_tr_full.iloc[:, list(comb)].values
        X_te_S = X_te_full.iloc[:, list(comb)].values

        # Baseline
        bl = settings["BASELINE"]
        if bl == "mean":
            x0_S = X_tr_full.iloc[:, list(comb)].mean(axis=0).values.astype(float)
        elif bl == "median":
            x0_S = X_tr_full.iloc[:, list(comb)].median(axis=0).values.astype(float)
        elif bl == "zero":
            x0_S = np.zeros(len(comb), dtype=float)
        else:
            raise ValueError("BASELINE must be one of {'mean','median','zero'}")

        for mn, mdl in build_models_for_subset(k).items():
            mdl.fit(X_tr_S, y_tr.values)
            y_hat_test = mdl.predict(X_te_S)
            r2  = r2_score(y_te.values, y_hat_test)
            rmse = float(np.sqrt(mean_squared_error(y_te.values, y_hat_test)))
            fitted[(mn, label)] = {"model": mdl, "r2": r2, "rmse": rmse, "x0_S": x0_S}

    # ---- Evaluate each chosen point
    for idx in test_indices:
        x_point_full = X_te_full.loc[idx]
        y_true = y_te.loc[idx]

        print(f"\n──────── Test point index: {idx} | True SalePrice: {y_true} ────────")
        display(pd.DataFrame(x_point_full).T)

        for mn in model_names:
            rows = []
            for label, comb in subsets:
                k = len(comb)
                entry = fitted[(mn, label)]
                r2, rmse = entry["r2"], entry["rmse"]

                # defaults
                phi_cols = [""]*m
                mu_cols  = {}
                resh_cols = {}
                lattice_dict = {}
                baseline = float(y_tr.mean()) if k==0 else None
                prediction = baseline

                if k == 0:
                    lattice_dict = {"f(∅)": baseline}
                else:
                    model = entry["model"]
                    x0_S = entry["x0_S"]
                    x_point_S = x_point_full.iloc[list(comb)].values

                    # φ for this point
                    phi_S, f_xS, f_x0S = shap_for_subset_model(model, x_point_S, x0_S)
                    baseline  = float(f_x0S)
                    prediction = float(f_xS)

                    # Align φ into global A,B,(C)
                    for j in range(m):
                        if j in comb:
                            pos = list(comb).index(j)
                            phi_cols[j] = float(phi_S[pos])
                        else:
                            phi_cols[j] = ""

                    # Lattice
                    lattice_dict = lattice_predictions_for_row(model, comb, letters, x_point_full, x0_S)

                    # μ and ReSHAP
                    if k == 2:
                        sub_letters = "".join(letters[j] for j in comb)  # "AB"/"AC"/"BC"
                        mu_cols = mu_from_f_upper_2f(lattice_dict, sub_letters)
                        resh_cols = reshAP_from_mu_recursive(mu_cols, sub_letters)
                    elif k == 3:
                        mu_cols = mu_from_f_upper_3f(lattice_dict)
                        resh_cols = reshAP_from_mu_recursive(mu_cols, "ABC")
                    elif k == 1:
                        sl = "".join(letters[j] for j in comb)  # "A"/"B"/"C"
                        f0 = float(lattice_dict["f(∅)"])
                        fA = float(lattice_dict[f"f({sl})"])
                        mu_cols = {f"μ({sl})": (fA - f0)}
                        resh_cols = reshAP_from_mu_recursive(mu_cols, sl)

                # row
                row = {
                    "Feature subset": label,
                    "n_features": k,
                    "R2 (subset test)": r2,
                    "RMSE (subset test)": rmse,
                    "Baseline f(∅)": baseline,
                    "Prediction f(S)": prediction,
                    "φ A": phi_cols[0] if m>=1 else "",
                    "φ B": phi_cols[1] if m>=2 else "",
                    "φ C": phi_cols[2] if m>=3 else "",
                    "f(∅)": lattice_dict.get("f(∅)", ""),
                    "f(A)": lattice_dict.get("f(A)", ""),
                    "f(B)": lattice_dict.get("f(B)", ""),
                    "f(AB)": lattice_dict.get("f(AB)", ""),
                }
                if m == 3:
                    row.update({
                        "f(C)": lattice_dict.get("f(C)", ""),
                        "f(AC)": lattice_dict.get("f(AC)", ""),
                        "f(BC)": lattice_dict.get("f(BC)", ""),
                        "f(ABC)": lattice_dict.get("f(ABC)", ""),
                    })
                for key in ["μ(A)","μ(B)","μ(C)","μ(AB)","μ(AC)","μ(BC)","μ(ABC)"]:
                    row[key] = mu_cols.get(key, "")
                for key in ["ReSHAP A","ReSHAP B","ReSHAP C"]:
                    row[key] = resh_cols.get(key, "")

                rows.append(row)

            # display table for this point + model
            base_cols = ["Feature subset","n_features","R2 (subset test)","RMSE (subset test)",
                         "Baseline f(∅)","Prediction f(S)","φ A","φ B"]
            if m==3: base_cols += ["φ C"]
            lattice_cols = ["f(∅)","f(A)","f(B)","f(AB)"]
            if m==3: lattice_cols += ["f(C)","f(AC)","f(BC)","f(ABC)"]
            mu_cols_all = ["μ(A)","μ(B)","μ(AB)"]
            if m==3: mu_cols_all = ["μ(A)","μ(B)","μ(C)","μ(AB)","μ(AC)","μ(BC)","μ(ABC)"]
            resh_cols_all = ["ReSHAP A","ReSHAP B"]
            if m==3: resh_cols_all += ["ReSHAP C"]

            ordered_cols = base_cols + lattice_cols + mu_cols_all + resh_cols_all
            tbl = pd.DataFrame(rows)[ordered_cols]
            
            # --- NEW: compute P, R, P/R ---
            metrics = compute_P_R_metrics(tbl)
            #model_metrics[mn].append(metrics)

            print(f"\n=== Subset predictions + SHAP + μ + ReSHAP — {mn} | Test idx {idx} ===")
            
            # Raw f-values
            print(f"f_AB(S)   = {metrics['f_AB(S)']}")
            print(f"f_AB(A)   = {metrics['f_AB(A)']}")
            print(f"f_ABC(S)  = {metrics['f_ABC(S)']}")
            print(f"f_ABC(A)  = {metrics['f_ABC(A)']}")
            
            # SHAP ψ terms + denominators
            print(f"|psi_AB(A)|    = {metrics['psi_AB(A)']}")
            print(f"|f_AB(S)-f_A|  = {metrics['den_AB']}")
            print(f"|psi_ABC(A)|   = {metrics['psi_ABC(A)']}")
            print(f"|f_ABC(S)-f_A| = {metrics['den_ABC']}")
            
            # ReSHAP terms
            print(f"|ReSHAP_AB(A)|  = {metrics['ReSHAP_AB(A)']}")
            print(f"|ReSHAP_ABC(A)| = {metrics['ReSHAP_ABC(A)']}")
            
            # Final metrics
            print(f"\nP   = {metrics['P']}")
            print(f"R   = {metrics['R']}")
            print(f"P/R = {metrics['P/R']}")
            
            display(tbl)

            # Build the ordered table (required to extract the values),
            # but do not display it per point
            ordered_cols = base_cols + lattice_cols + mu_cols_all + resh_cols_all
            tbl = pd.DataFrame(rows)[ordered_cols]
            
            # Compute P/R metrics for this (model, test point) and collect
            metrics = compute_P_R_metrics(tbl)
            model_metrics[mn].append(metrics)
    # ===== Overall Summary: mean of |P/R| across all chosen points =====
    summary_rows = []
    for mn, vals in model_metrics.items():
        # take absolute of per-point ratios, then average
        PR_abs_list = []
        for v in vals:
            pr = v.get("P/R", np.nan)
            if np.isfinite(pr):
                PR_abs_list.append(abs(pr))
    
        mean_abs_PR = float(np.mean(PR_abs_list)) if PR_abs_list else np.nan
        n_pts = len(vals)  # should match N_RANDOM_POINTS if all valid
    
        summary_rows.append({
            "Model": mn,
            "mean(|P/R|)": mean_abs_PR,
            "n_points": n_pts
        })
    
    summary_df = pd.DataFrame(summary_rows).set_index("Model")
    print("\n================ Overall Summary (averaged over test points) ================")
    display(summary_df[["mean(|P/R|)", "n_points"]])
 
In [150]:
# ============================================================
# 6) Settings & Run (edit here, then run this cell)
# ============================================================

SETTINGS = dict(
    # Data slice / feature modes:
    MODE="dup_qual",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=100,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=False,
    USE_MLP=True,
    USE_LGB=False,              # will only run if LightGBM is installed
    USE_XGB=False,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)

          
Mode: dup_qual | Features (A,B,C): [('A', 'Gr Liv Area'), ('B', 'Overall Qual'), ('C', 'Overall Qual (DUP)')]
Evaluating 100 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880, 961, 1476, 1423, 1398, 2212, 96, 1080, 2044, 1397, 2786, 2233, 2684, 2359, 2227, 781, 2351, 1873, 1337, 2550, 211, 1557, 1910, 881, 554, 1462, 87, 2652, 478, 1505, 1283, 485, 134, 2605, 56, 282, 2240, 2111, 581, 1189, 70, 1878, 259, 495, 1506, 2841, 1608, 2916, 1691, 2394, 188, 1117, 555, 175, 366, 456, 506, 1538, 2381, 1820, 1600, 1486, 1221, 651, 1465, 120, 1058, 2659, 755, 1226, 2873, 2554, 1364, 291, 2662, 2834, 1264, 218, 2028, 2726, 1756, 856, 2718, 283, 2525, 527, 2421, 1509, 1662, 594, 1362]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1093 1892 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
f_AB(S)   = 226477.06687724203
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 225058.58968850703
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 21867.743725858847
|f_AB(S)-f_A|  = 56403.733753776585
|psi_ABC(A)|   = 24436.11332120607
|f_ABC(S)-f_A| = 58553.46195598788
|ReSHAP_AB(A)|  = 21867.743725858843
|ReSHAP_ABC(A)| = 24414.61874371478

P   = -0.029629663181037713
R   = -0.029262569984952302
P/R = 1.0125448036954436
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 225058.589689 24436.113321 17063.785283 17053.563352 166505.127733 190864.068295 183491.465006 207850.405569 183481.243075 207840.183638 200468.13085 225058.589689 24590.458838 17218.40605 17208.18412 -231.518276 -231.518276 -232.068777 231.518276 24414.618744 17074.501554 17064.341658
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1032 2340 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
f_AB(S)   = 293320.0938323849
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 303870.4711899712
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 49104.5775969823
|f_AB(S)-f_A|  = 123246.76070891946
|psi_ABC(A)|   = 52128.01218914255
|f_ABC(S)-f_A| = 137365.34345745205
|ReSHAP_AB(A)|  = 48532.66851551276
|ReSHAP_ABC(A)| = 52098.50042511663

P   = 0.01894044145569085
R   = 0.014514924951027519
P/R = 1.3048942050747596
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
5 AC 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 303870.471190 52128.012189 42686.830311 42550.500957 166505.127733 218204.839889 202112.051547 254454.213752 201975.722193 254317.884398 251528.308985 303870.47119 52342.162205 49552.586792 49416.257438 0.0 0.0 -13303.212929 -642.450048 52098.500425 42692.148956 42574.694076
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2783 1282 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
f_AB(S)   = 156142.95321827102
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 151298.1784716801
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 11552.38879976097
|f_AB(S)-f_A|  = 13930.379905194422
|psi_ABC(A)|   = 12868.449331459371
|f_ABC(S)-f_A| = 15206.949260839057
|ReSHAP_AB(A)|  = 11552.38879976098
|ReSHAP_ABC(A)| = 12868.449331459384

P   = -0.016927028361455765
R   = -0.016927028361455987
P/R = 0.9999999999999869
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 151298.178472 -12868.449331 -1169.601883 -1168.898047 166505.127733 153636.678401 165335.52585 152467.076519 165336.229686 152467.780354 164166.627803 151298.178472 -12868.449331 -1169.601883 -1168.898047 0.0 -0.0 0.0 0.0 -12868.449331 -1169.601883 -1168.898047
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
411 1055 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
f_AB(S)   = 180620.26208389158
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 173745.6972434084
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 23989.061067491624
|f_AB(S)-f_A|  = 10546.928960426128
|psi_ABC(A)|   = 26578.654253096665
|f_ABC(S)-f_A| = 7240.56951088924
|ReSHAP_AB(A)|  = 23989.061067491784
|ReSHAP_ABC(A)| = 28405.7825894105

P   = -1.3962893446798752
R   = -1.6486352870212708
P/R = 0.8469364665866576
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 173745.697243 -26578.654253 16914.722847 16904.500917 166505.127733 140213.519233 183491.465006 156769.544354 183481.243075 156759.322423 200468.13085 173745.697243 -26722.433607 16986.374821 16976.15289 0.512954 0.512954 -430.3497 429.799199 -28405.782589 17828.539913 17817.812187
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1385 1258 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
f_AB(S)   = 102256.39318916843
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 112880.82914882246
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3900.93975408173
|f_AB(S)-f_A|  = 67816.93993429701
|psi_ABC(A)|   = 9482.786534445278
|f_ABC(S)-f_A| = 53624.29858369668
|ReSHAP_AB(A)|  = 10176.365975821107
|ReSHAP_ABC(A)| = 7717.1863247588835

P   = -0.11931588045816871
R   = 0.006144284757585827
P/R = -19.419002400704088
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
5 AC 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 112880.829149 -9482.786534 -21858.224524 -22283.287525 166505.127733 152171.994209 136767.390763 128548.419598 136359.403222 128106.281139 118760.008604 112880.829149 -5879.179455 -15225.45199 -15667.590449 -2373.942628 -2339.791709 -8398.122621 -3740.219732 -7717.186325 -22657.362436 -23249.749823
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1577 1394 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
f_AB(S)   = 125365.12756503362
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 132295.25727372261
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 5416.233319647181
|f_AB(S)-f_A|  = 44708.20555843183
|psi_ABC(A)|   = 4840.536576073733
|f_ABC(S)-f_A| = 34209.87045879653
|ReSHAP_AB(A)|  = 5416.233319647175
|ReSHAP_ABC(A)| = 4264.736729123379

P   = -0.020349028372194822
R   = -0.0035176332447603847
P/R = 5.784863559185791
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 132295.257274 -4840.536576 -14615.114407 -14754.219476 166505.127733 160471.8713 147859.950307 143065.757178 147746.497458 142900.999889 135963.76522 132295.257274 -3668.507946 -10605.742616 -10770.499904 -1176.989622 -1125.685183 -6800.371506 -62.073681 -4264.736729 -14881.224561 -15063.909168
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1427 2097 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
f_AB(S)   = 237708.4228899503
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 237724.9007031596
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 33099.099738567136
|f_AB(S)-f_A|  = 67635.08976648486
|psi_ABC(A)|   = 36998.77975590934
|f_ABC(S)-f_A| = 71219.77297064045
|ReSHAP_AB(A)|  = 33099.099738567136
|ReSHAP_ABC(A)| = 36929.55476788414

P   = -0.03012388508840691
R   = -0.029151893952550023
P/R = 1.0333422980146325
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 237724.900703 36998.779756 17115.607573 17105.385642 166505.127733 203374.91244 183491.465006 220361.249714 183481.243075 220351.027783 200468.13085 237724.900703 37256.769853 17373.87292 17363.650989 -386.985146 -386.985146 -387.535647 386.985146 36929.554768 17150.16583 17140.052373
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
518 1314 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
f_AB(S)   = 231724.10276500593
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 240477.12923371006
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 9799.201519728464
|f_AB(S)-f_A|  = 61650.76964154048
|psi_ABC(A)|   = 10994.046018125073
|f_ABC(S)-f_A| = 73972.00150119091
|ReSHAP_AB(A)|  = 9799.201519728464
|ReSHAP_ABC(A)| = 11081.352034301839

P   = 0.010322522772031217
R   = 0.00914226518260583
P/R = 1.129099032444493
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 240477.129234 -10994.046018 42533.309176 42432.738344 166505.127733 155589.590658 202112.051547 191060.871796 201975.722193 190996.059486 251528.308985 240477.129234 -11051.179751 49481.069747 49416.257438 71.517045 0.0 -14009.788609 64.125632 -11081.352034 42600.597855 42452.755681
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2181 4676 10 10
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
f_AB(S)   = 627703.3582424738
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 550268.367511139
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 283815.0212120809
|f_AB(S)-f_A|  = 457630.0251190084
|psi_ABC(A)|   = 195700.81821175863
|f_ABC(S)-f_A| = 383763.23977861984
|ReSHAP_AB(A)|  = 290938.5744531937
|ReSHAP_ABC(A)| = 195280.23432588394

P   = 0.11023247580007733
R   = 0.12689460620083182
P/R = 0.8686931548975069
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
4 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
5 AC 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
6 BC 2 0.697364 49258.516562 161570.634319 408111.076369 122989.130608 123551.311442 161570.634319 278211.79659 278773.977424 408111.076369 116641.16227 117203.343104 12695.936675 122973.869586 123566.572464
7 ABC 3 0.797024 40340.721518 166505.127733 550268.367511 195700.818212 94169.666582 93892.754985 166505.127733 360595.490204 253313.842793 449848.307854 253036.646783 449571.68067 353791.024821 550268.367511 196477.34269 100696.686841 100420.059658 57.691197 57.122371 -11443.869191 -2501.793787 195280.234326 94371.214413 94111.79104
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1880 1422 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
f_AB(S)   = 126899.16643506207
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 133334.95627988945
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3882.1944496187134
|f_AB(S)-f_A|  = 43174.16668840338
|psi_ABC(A)|   = 3493.0708161314396
|f_ABC(S)-f_A| = 33170.1714526297
|ReSHAP_AB(A)|  = 3882.1944496186957
|ReSHAP_ABC(A)| = 2967.6265278924257

P   = -0.015388206566436274
R   = 0.00045266085233444664
P/R = -33.995001969082935
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 133334.956280 -3493.070816 -14768.033664 -14909.066973 166505.127733 162180.669524 147859.950307 144361.585467 147746.497458 144192.971698 135963.76522 133334.95628 -2628.80894 -10858.015418 -11026.629187 -924.716819 -869.5559 -6961.068639 98.623451 -2967.626528 -15011.574086 -15190.970839
──────── Test point index: 961 | True SalePrice: 201000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
961 1466 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 961 ===
f_AB(S)   = 240051.74234516037
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 249868.73544945248
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 1471.5619395740214
|f_AB(S)-f_A|  = 69978.40922169492
|psi_ABC(A)|   = 1652.7836419233354
|f_ABC(S)-f_A| = 83363.60771693333
|ReSHAP_AB(A)|  = 1471.5619395740214
|ReSHAP_ABC(A)| = 1659.9791468513438

P   = 0.001202598547987925
R   = 0.0011162838463444763
P/R = 1.0773232560214014
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175641.109802 -2966.610841 178607.720643 175641.109802 -2966.610841 -2966.610841
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 240051.742345 -1471.56194 71449.971161 170073.333123 168601.771184 241523.304285 240051.742345 -1471.56194 71449.971161 0.0 -1471.56194 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 240051.742345 -1471.56194 71449.971161 170073.333123 168601.771184 241523.304285 240051.742345 -1471.56194 71449.971161 0.0 -1471.56194 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 249868.735449 -1652.783642 42576.360357 42440.031002 166505.127733 164865.923877 202112.051547 200452.478012 201975.722193 200316.148657 251528.308985 249868.735449 -1659.573535 49552.586792 49416.257438 0.0 -0.0 -13966.032658 20.36968 -1659.979147 42570.353329 42453.233535
──────── Test point index: 1476 | True SalePrice: 185900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1476 1582 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1476 ===
f_AB(S)   = 209493.06510192703
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 205909.80637388042
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4883.741950543859
|f_AB(S)-f_A|  = 39419.73197846158
|psi_ABC(A)|   = 5440.627224896758
|f_ABC(S)-f_A| = 39404.67864136127
|ReSHAP_AB(A)|  = 4883.741950543859
|ReSHAP_ABC(A)| = 5441.1295093596245

P   = -0.014179794469220075
R   = -0.014192541292190969
P/R = 0.9991018646549292
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 188453.151831 9845.431188 178607.720643 188453.151831 9845.431188 9845.431188
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 209493.065102 4883.741951 34535.990028 170073.333123 174957.075074 204609.323151 209493.065102 4883.741951 34535.990028 0.0 4883.741951 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 209493.065102 4883.741951 34535.990028 170073.333123 174957.075074 204609.323151 209493.065102 4883.741951 34535.990028 0.0 4883.741951 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 205909.806374 5440.627225 16987.136674 16976.914743 166505.127733 171945.230808 183491.465006 188931.568082 183481.243075 188921.346151 200468.13085 205909.806374 5441.675524 16988.460223 16978.238292 -1.572448 -1.572448 -2.122949 1.572448 5441.129509 16986.885329 16976.663803
──────── Test point index: 1423 | True SalePrice: 177439 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1423 1356 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1423 ===
f_AB(S)   = 197111.1799366974
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 192115.5898806097
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 7498.143214685784
|f_AB(S)-f_A|  = 27037.84681323194
|psi_ABC(A)|   = 8352.406815002149
|f_ABC(S)-f_A| = 25610.462148090563
|ReSHAP_AB(A)|  = 7498.143214685784
|ReSHAP_ABC(A)| = 8352.864905468125

P   = -0.048812329363258744
R   = -0.048830216212130406
P/R = 0.9996336930233125
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 163491.759602 -15115.961041 178607.720643 163491.759602 -15115.961041 -15115.961041
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 197111.179937 -7498.143215 34535.990028 170073.333123 162575.189909 204609.323151 197111.179937 -7498.143215 34535.990028 0.0 -7498.143215 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 197111.179937 -7498.143215 34535.990028 170073.333123 162575.189909 204609.323151 197111.179937 -7498.143215 34535.990028 0.0 -7498.143215 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 192115.589881 -8352.406815 16986.545447 16976.323516 166505.127733 158152.787995 183491.465006 175139.125269 183481.243075 175128.903338 200468.13085 192115.589881 -8352.54097 16986.686543 16976.464612 0.201232 0.201232 -0.349269 -0.201232 -8352.864905 16986.774284 16976.552769
──────── Test point index: 1398 | True SalePrice: 173000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1398 1291 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1398 ===
f_AB(S)   = 156636.03714078016
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 151847.4350438877
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 11059.304877251838
|f_AB(S)-f_A|  = 13437.29598268529
|psi_ABC(A)|   = 12319.192759251766
|f_ABC(S)-f_A| = 14657.692688631447
|ReSHAP_AB(A)|  = 11059.304877251847
|ReSHAP_ABC(A)| = 12319.192759251775

P   = -0.017428672836123926
R   = -0.017428672836123926
P/R = 1.0
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 156312.598120 -22295.122523 178607.720643 156312.59812 -22295.122523 -22295.122523
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 156636.037141 -11059.304877 -2377.991105 170073.333123 159014.028246 167695.342018 156636.037141 -11059.304877 -2377.991105 -0.0 -11059.304877 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 156636.037141 -11059.304877 -2377.991105 170073.333123 159014.028246 167695.342018 156636.037141 -11059.304877 -2377.991105 -0.0 -11059.304877 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 151847.435044 -12319.192759 -1169.601883 -1168.898047 166505.127733 154185.934973 165335.52585 153016.333091 165336.229686 153017.036926 164166.627803 151847.435044 -12319.192759 -1169.601883 -1168.898047 0.0 -0.0 0.0 0.0 -12319.192759 -1169.601883 -1168.898047
──────── Test point index: 2212 | True SalePrice: 153500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2212 1226 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2212 ===
f_AB(S)   = 116160.89434486291
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 126988.7317798773
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 14620.466539817877
|f_AB(S)-f_A|  = 53912.43877860253
|psi_ABC(A)|   = 12414.996423732644
|f_ABC(S)-f_A| = 39516.39595264185
|ReSHAP_AB(A)|  = 14620.46653981787
|ReSHAP_ABC(A)| = 12415.201021371544

P   = -0.04298418292176187
R   = -0.04298936045969198
P/R = 0.9998795623411293
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 149133.436638 -29474.284005 178607.720643 149133.436638 -29474.284005 -29474.284005
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 116160.894345 -14620.46654 -39291.972239 170073.333123 155452.866584 130781.360885 116160.894345 -14620.46654 -39291.972239 -0.0 -14620.46654 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 116160.894345 -14620.46654 -39291.972239 170073.333123 155452.866584 130781.360885 116160.894345 -14620.46654 -39291.972239 -0.0 -14620.46654 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 126988.731780 -12414.996424 -13462.374768 -13639.02476 166505.127733 150219.081952 147859.950307 135939.2374 147746.497458 135699.390265 135963.76522 126988.73178 -8975.03344 -8710.658485 -8950.50562 -3072.073752 -2945.679467 -5569.186067 -1293.259121 -12415.201021 -13409.361 -13691.833931
──────── Test point index: 96 | True SalePrice: 151000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
96 1200 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 96 ===
f_AB(S)   = 188564.3919465389
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 182594.9810719605
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 16044.931204844266
|f_AB(S)-f_A|  = 18491.058823073457
|psi_ABC(A)|   = 17872.907918950816
|f_ABC(S)-f_A| = 16089.853339441353
|ReSHAP_AB(A)|  = 16044.931204844266
|ReSHAP_ABC(A)| = 17858.203814356053

P   = -0.2431056222810154
R   = -0.24219174791034492
P/R = 1.0037733505726578
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 146261.772045 -32345.948598 178607.720643 146261.772045 -32345.948598 -32345.948598
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 188564.391947 -16044.931205 34535.990028 170073.333123 154028.401919 204609.323151 188564.391947 -16044.931205 34535.990028 0.0 -16044.931205 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 188564.391947 -16044.931205 34535.990028 170073.333123 154028.401919 204609.323151 188564.391947 -16044.931205 34535.990028 0.0 -16044.931205 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 182594.981072 -17872.907919 16986.491595 16976.269664 166505.127733 148632.340743 183491.465006 165618.678017 183481.243075 165608.456086 200468.13085 182594.981072 -17873.149778 16986.524986 16976.303055 0.362789 0.362789 -0.187712 -0.362789 -17858.203814 16979.097723 16968.959431
──────── Test point index: 1080 | True SalePrice: 195000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1080 1455 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1080 ===
f_AB(S)   = 202535.103084298
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 198157.55335524218
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 2074.2200670851744
|f_AB(S)-f_A|  = 32461.76996083255
|psi_ABC(A)|   = 2310.5374606022765
|f_ABC(S)-f_A| = 31652.425622723036
|ReSHAP_AB(A)|  = 2074.2200670851744
|ReSHAP_ABC(A)| = 2310.5920263376074

P   = -0.009099850046829117
R   = -0.009101573950582378
P/R = 0.9998105927872891
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 174426.174782 -4181.545861 178607.720643 174426.174782 -4181.545861 -4181.545861
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 202535.103084 -2074.220067 34535.990028 170073.333123 167999.113056 204609.323151 202535.103084 -2074.220067 34535.990028 0.0 -2074.220067 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 202535.103084 -2074.220067 34535.990028 170073.333123 167999.113056 204609.323151 202535.103084 -2074.220067 34535.990028 0.0 -2074.220067 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 198157.553355 -2310.537461 16986.592507 16976.370576 166505.127733 164194.610289 183491.465006 181180.947563 183481.243075 181170.725632 200468.13085 198157.553355 -2310.577495 16986.827723 16976.605792 0.060051 0.060051 -0.490449 -0.060051 -2310.592026 16986.619703 16976.397946
──────── Test point index: 2044 | True SalePrice: 219000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2044 2267 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2044 ===
f_AB(S)   = 247022.23031512307
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 248228.67081287148
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 42412.9071637399
|f_AB(S)-f_A|  = 76948.89719165763
|psi_ABC(A)|   = 47416.60070176087
|f_ABC(S)-f_A| = 81723.54308035233
|ReSHAP_AB(A)|  = 42412.9071637399
|ReSHAP_ABC(A)| = 47304.00678400433

P   = -0.02902457295019656
R   = -0.027646831438844832
P/R = 1.04983361346125
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 264110.468986 85502.748343 178607.720643 264110.468986 85502.748343 85502.748343
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 247022.230315 42412.907164 34535.990028 170073.333123 212486.240287 204609.323151 247022.230315 42412.907164 34535.990028 0.0 42412.907164 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 247022.230315 42412.907164 34535.990028 170073.333123 212486.240287 204609.323151 247022.230315 42412.907164 34535.990028 0.0 42412.907164 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 248228.670813 47416.600702 17158.582155 17148.360224 166505.127733 213749.758804 183491.465006 230736.096078 183481.243075 230725.874147 200468.13085 248228.670813 47760.539963 17502.796666 17492.574735 -515.908891 -515.908891 -516.459392 515.908891 47304.006784 17214.805917 17204.730379
──────── Test point index: 1397 | True SalePrice: 144000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1397 904 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1397 ===
f_AB(S)   = 106720.21185667881
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 119012.0709937853
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 28161.531286573518
|f_AB(S)-f_A|  = 63353.121266786635
|psi_ABC(A)|   = 24281.398265618318
|f_ABC(S)-f_A| = 47493.05673873385
|ReSHAP_AB(A)|  = 28564.38750644621
|ReSHAP_ABC(A)| = 28246.66684794837

P   = -0.06674519325243439
R   = -0.1438778352135297
P/R = 0.46390184529380474
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 113568.975143 -65038.745499 178607.720643 113568.975143 -65038.745499 -65038.745499
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106720.211857 -28161.531287 -35191.58998 170073.333123 137811.419578 130781.360885 106720.211857 -32261.913545 -39291.972239 8200.764517 -28564.387506 -34788.73376
5 AC 2 0.749226 44839.624931 170073.333123 106720.211857 -28161.531287 -35191.58998 170073.333123 137811.419578 130781.360885 106720.211857 -32261.913545 -39291.972239 8200.764517 -28564.387506 -34788.73376
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 119012.070994 -24281.398266 -11511.192973 -11700.4655 166505.127733 133521.368995 147859.950307 125027.028151 147746.497458 124761.935948 135963.76522 119012.070994 -16951.694226 -5749.864954 -6014.957158 -6032.867284 -5881.227929 -2744.475889 -4117.969298 -28246.666848 -9437.484934 -9808.904957
──────── Test point index: 2786 | True SalePrice: 149900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2786 894 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2786 ===
f_AB(S)   = 106942.91881037506
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 118761.74363168926
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 28324.11332223047
|f_AB(S)-f_A|  = 63130.41431309038
|psi_ABC(A)|   = 24582.74381523358
|f_ABC(S)-f_A| = 47743.38410082989
|ReSHAP_AB(A)|  = 28727.390081850535
|ReSHAP_ABC(A)| = 28642.863117301968

P   = -0.06623285720412436
R   = -0.14488532146618383
P/R = 0.4571398712711079
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 112464.488761 -66143.231881 178607.720643 112464.488761 -66143.231881 -66143.231881
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106942.918810 -28324.113322 -34806.300991 170073.333123 137263.548553 130781.360885 106942.91881 -32809.78457 -39291.972239 8971.342496 -28727.390082 -34403.024231
5 AC 2 0.749226 44839.624931 170073.333123 106942.918810 -28324.113322 -34806.300991 170073.333123 137263.548553 130781.360885 106942.91881 -32809.78457 -39291.972239 8971.342496 -28727.390082 -34403.024231
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 118761.743632 -24582.743815 -11485.683879 -11674.956406 166505.127733 133110.517007 147859.950307 124784.170852 147746.497458 124519.078649 135963.76522 118761.743632 -17202.021588 -5757.335017 -6022.427221 -6025.397221 -5873.757866 -2569.011138 -4293.43405 -28642.863117 -9366.147326 -9734.373657
──────── Test point index: 2233 | True SalePrice: 87000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2233 803 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2233 ===
f_AB(S)   = 109883.57230562333
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 116483.76463661541
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 29346.599738402576
|f_AB(S)-f_A|  = 60189.76081784212
|psi_ABC(A)|   = 27204.64643465454
|f_ABC(S)-f_A| = 50021.36309590374
|ReSHAP_AB(A)|  = 29510.62352082107
|ReSHAP_ABC(A)| = 32051.176307496746

P   = -0.056292583773007276
R   = -0.15045667311230876
P/R = 0.3741448126464124
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 102507.256627 -76100.464016 178607.720643 102507.256627 -76100.464016 -76100.464016
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 109883.572306 -29346.599738 -30843.161079 170073.333123 132277.922226 130781.360885 109883.572306 -37795.410898 -39291.972239 16897.622319 -29510.623521 -30679.137297
5 AC 2 0.749226 44839.624931 170073.333123 109883.572306 -29346.599738 -30843.161079 170073.333123 132277.922226 130781.360885 109883.572306 -37795.410898 -39291.972239 16897.622319 -29510.623521 -30679.137297
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 116483.764637 -27204.646435 -11323.929331 -11492.78733 166505.127733 129406.196058 147859.950307 122880.348411 147746.497458 122656.085263 135963.76522 116483.764637 -19480.000584 -6172.320626 -6396.583775 -5610.411611 -5499.601312 -353.52702 -6508.918168 -32051.176307 -8847.083623 -9123.103165
──────── Test point index: 2684 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2684 1048 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2684 ===
f_AB(S)   = 107477.33854400982
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122616.78500796811
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 23838.29656283486
|f_AB(S)-f_A|  = 62595.994579455626
|psi_ABC(A)|   = 19594.359163268895
|f_ABC(S)-f_A| = 43888.34272455104
|ReSHAP_AB(A)|  = 23963.500502552048
|ReSHAP_ABC(A)| = 21909.779468873574

P   = -0.06563148244615186
R   = -0.11638836007907422
P/R = 0.5639007406029422
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 129473.579042 -49134.141601 178607.720643 129473.579042 -49134.141601 -49134.141601
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107477.338544 -23838.296563 -38757.698017 170073.333123 145700.762338 130781.360885 107477.338544 -24372.570785 -39291.972239 1068.548444 -23963.500503 -38632.494077
5 AC 2 0.749226 44839.624931 170073.333123 107477.338544 -23838.296563 -38757.698017 170073.333123 145700.762338 130781.360885 107477.338544 -24372.570785 -39291.972239 1068.548444 -23963.500503 -38632.494077
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122616.785008 -19594.359163 -12050.077388 -12243.906173 166505.127733 139849.505693 147859.950307 129159.851005 147746.497458 128885.646284 135963.76522 122616.785008 -13346.980212 -6268.861276 -6543.065997 -5513.870962 -5353.119091 -4420.793413 -2441.651775 -21909.779469 -10794.621963 -11183.941293
──────── Test point index: 2359 | True SalePrice: 138800 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2359 1096 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2359 ===
f_AB(S)   = 109403.87612744633
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 123818.35634602906
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 21560.13731109222
|f_AB(S)-f_A|  = 60669.456996019115
|psi_ABC(A)|   = 17763.70134251151
|f_ABC(S)-f_A| = 42686.77138649009
|ReSHAP_AB(A)|  = 21612.654972638087
|ReSHAP_ABC(A)| = 19368.419679572336

P   = -0.06077012923023217
R   = -0.0974973654041214
P/R = 0.6233002192248295
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 134775.113674 -43832.606968 178607.720643 134775.113674 -43832.606968 -43832.606968
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 109403.876127 -21560.137311 -39109.319685 170073.333123 148330.543259 130781.360885 109403.876127 -21742.789865 -39291.972239 365.305108 -21612.654973 -39056.802023
5 AC 2 0.749226 44839.624931 170073.333123 109403.876127 -21560.137311 -39109.319685 170073.333123 148330.543259 130781.360885 109403.876127 -21742.789865 -39291.972239 365.305108 -21612.654973 -39056.802023
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 123818.356346 -17763.701343 -12386.436181 -12536.633863 166505.127733 142439.132639 147859.950307 130816.995079 147746.497458 130630.052566 135963.76522 123818.356346 -12145.408874 -6811.69622 -6998.638733 -4971.036018 -4897.546354 -4810.441341 -2052.003847 -19368.41968 -11524.558354 -11793.793353
──────── Test point index: 2227 | True SalePrice: 200000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2227 1020 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2227 ===
f_AB(S)   = 215616.69462970717
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 222311.78563220834
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 25906.609655027205
|f_AB(S)-f_A|  = 45543.36150624172
|psi_ABC(A)|   = 28749.08259063671
|f_ABC(S)-f_A| = 55806.65789968919
|ReSHAP_AB(A)|  = 25906.609655027172
|ReSHAP_ABC(A)| = 30245.634103018205

P   = 0.05367887053083353
R   = 0.02686215064999875
P/R = 1.998308744159918
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 126381.017172 -52226.70347 178607.720643 126381.017172 -52226.70347 -52226.70347
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 215616.694630 -25906.609655 71449.971161 170073.333123 144166.723468 241523.304285 215616.69463 -25906.609655 71449.971161 -0.0 -25906.609655 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 215616.694630 -25906.609655 71449.971161 170073.333123 144166.723468 241523.304285 215616.69463 -25906.609655 71449.971161 -0.0 -25906.609655 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 222311.785632 -28749.082591 42316.421468 42239.319023 166505.127733 138481.811042 202112.051547 173045.416909 201975.722193 173027.541373 251528.308985 222311.785632 -29216.523353 49284.244259 49266.368723 268.342533 149.888715 -14720.638391 774.975414 -30245.634103 43181.260644 42871.031358
──────── Test point index: 781 | True SalePrice: 161500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
781 1788 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 781 ===
f_AB(S)   = 110037.26481708279
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 127403.76582784308
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 14508.078222644712
|f_AB(S)-f_A|  = 60036.06830638266
|psi_ABC(A)|   = 13126.964114259454
|f_ABC(S)-f_A| = 39101.36190467607
|ReSHAP_AB(A)|  = 17117.515558367766
|ReSHAP_ABC(A)| = 3214.492813915103

P   = -0.09406025404090518
R   = 0.20291130009142544
P/R = -0.46355355270270604
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 211205.571296 32597.850654 178607.720643 211205.571296 32597.850654 32597.850654
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 110037.264817 14508.078223 -74544.146529 170073.333123 186243.218189 97190.993438 110037.264817 16169.885066 -72882.339686 -3323.613686 17117.515558 -77153.583865
5 AC 2 0.749226 44839.624931 170073.333123 110037.264817 14508.078223 -74544.146529 170073.333123 186243.218189 97190.993438 110037.264817 16169.885066 -72882.339686 -3323.613686 17117.515558 -77153.583865
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 127403.765828 13126.964114 -25938.93965 -26289.386369 166505.127733 184517.103461 136767.390763 149435.009332 136359.403222 149142.103435 118760.008604 127403.765828 8643.757224 -21738.337607 -22031.243504 4138.942989 4023.861345 -13343.756522 1205.41417 3214.492814 -20851.686223 -21464.168495
──────── Test point index: 2351 | True SalePrice: 179900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2351 1552 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2351 ===
f_AB(S)   = 170935.47089354537
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 167775.87563790806
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3240.128875513372
|f_AB(S)-f_A|  = 862.1377700799203
|psi_ABC(A)|   = 3609.2478347685924
|f_ABC(S)-f_A| = 1270.747905388911
|ReSHAP_AB(A)|  = 3240.128875513277
|ReSHAP_ABC(A)| = 3609.2478347686083

P   = 0.917994739766367
R   = 0.9179947397662436
P/R = 1.0000000000001346
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 185139.692685 6531.972043 178607.720643 185139.692685 6531.972043 6531.972043
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 170935.470894 3240.128876 -2377.991105 170073.333123 173313.461999 167695.342018 170935.470894 3240.128876 -2377.991105 -0.0 3240.128876 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 170935.470894 3240.128876 -2377.991105 170073.333123 173313.461999 167695.342018 170935.470894 3240.128876 -2377.991105 -0.0 3240.128876 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 167775.875638 3609.247835 -1169.601883 -1168.898047 166505.127733 170114.375567 165335.52585 168944.773685 165336.229686 168945.47752 164166.627803 167775.875638 3609.247835 -1169.601883 -1168.898047 -0.0 -0.0 0.0 0.0 3609.247835 -1169.601883 -1168.898047
──────── Test point index: 1873 | True SalePrice: 127000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1873 1052 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1873 ===
f_AB(S)   = 143541.91964303734
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138365.9175551035
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24153.42237499467
|f_AB(S)-f_A|  = 26531.413480428106
|psi_ABC(A)|   = 26133.705099126437
|f_ABC(S)-f_A| = 28139.21017741566
|ReSHAP_AB(A)|  = 24153.42237499467
|ReSHAP_ABC(A)| = 26429.476431582254

P   = -0.018358428010712302
R   = -0.028869431361232767
P/R = 0.6359123524464311
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 129915.373594 -48692.347049 178607.720643 129915.373594 -48692.347049 -48692.347049
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 143541.919643 -24153.422375 -2377.991105 170073.333123 145919.910748 167695.342018 143541.919643 -24153.422375 -2377.991105 0.0 -24153.422375 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 143541.919643 -24153.422375 -2377.991105 170073.333123 145919.910748 167695.342018 143541.919643 -24153.422375 -2377.991105 0.0 -24153.422375 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 138365.917555 -26133.705099 -1001.205956 -1004.299123 166505.127733 140057.51343 165335.52585 139186.532106 165336.229686 139179.641936 164166.627803 138365.917555 -25800.710248 -813.724381 -820.614551 -355.877502 -348.283496 -57.256943 57.256943 -26429.476432 -851.380586 -858.35316
──────── Test point index: 1337 | True SalePrice: 157000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1337 1497 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1337 ===
f_AB(S)   = 167922.18025598946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 164419.3076966394
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 226.83823795747594
|f_AB(S)-f_A|  = 2151.15286747599
|psi_ABC(A)|   = 252.67989349994363
|f_ABC(S)-f_A| = 2085.820035879762
|ReSHAP_AB(A)|  = 226.83823795751118
|ReSHAP_ABC(A)| = 252.67989349993428

P   = -0.015692138729620622
R   = -0.01569213872959975
P/R = 1.00000000000133
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 179065.017585 457.296943 178607.720643 179065.017585 457.296943 457.296943
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 167922.180256 226.838238 -2377.991105 170073.333123 170300.171361 167695.342018 167922.180256 226.838238 -2377.991105 -0.0 226.838238 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 167922.180256 226.838238 -2377.991105 170073.333123 170300.171361 167695.342018 167922.180256 226.838238 -2377.991105 -0.0 226.838238 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 164419.307697 252.679893 -1169.601883 -1168.898047 166505.127733 166757.807626 165335.52585 165588.205744 165336.229686 165588.909579 164166.627803 164419.307697 252.679893 -1169.601883 -1168.898047 0.0 0.0 0.0 -0.0 252.679893 -1169.601883 -1168.898047
──────── Test point index: 2550 | True SalePrice: 151500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2550 1248 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2550 ===
f_AB(S)   = 117366.21059988528
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 127616.00102777171
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 13415.150284795513
|f_AB(S)-f_A|  = 52707.12252358017
|psi_ABC(A)|   = 11457.072824654097
|f_ABC(S)-f_A| = 38889.12670474744
|ReSHAP_AB(A)|  = 13415.150284795505
|ReSHAP_ABC(A)| = 11290.077867512693

P   = -0.040086110345254
R   = -0.03579198056421651
P/R = 1.1199746343551216
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 151563.306678 -27044.413965 178607.720643 151563.306678 -27044.413965 -27044.413965
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 117366.210600 -13415.150285 -39291.972239 170073.333123 156658.182839 130781.360885 117366.2106 -13415.150285 -39291.972239 -0.0 -13415.150285 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 117366.210600 -13415.150285 -39291.972239 170073.333123 156658.182839 130781.360885 117366.2106 -13415.150285 -39291.972239 -0.0 -13415.150285 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 127616.001028 -11457.072825 -13627.701944 -13804.351936 166505.127733 151561.709128 147859.950307 136843.111772 147746.497458 136603.264638 135963.76522 127616.001028 -8347.764192 -8987.26361 -9227.110745 -2795.468627 -2669.074342 -5731.333746 -1131.111442 -11290.077868 -13661.855409 -13937.193428
──────── Test point index: 211 | True SalePrice: 100000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
211 1392 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 211 ===
f_AB(S)   = 125255.5533600316
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 132220.9930589964
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 5525.807524649201
|f_AB(S)-f_A|  = 44817.77976343385
|psi_ABC(A)|   = 4936.491933185443
|f_ABC(S)-f_A| = 34284.134673522756
|ReSHAP_AB(A)|  = 5525.807524649195
|ReSHAP_ABC(A)| = 4359.3518185714765

P   = -0.020692662605612144
R   = -0.0038586336849498887
P/R = 5.362691640391066
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167467.910576 -11139.810067 178607.720643 167467.910576 -11139.810067 -11139.810067
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 125255.553360 -5525.807525 -39291.972239 170073.333123 164547.525599 130781.360885 125255.55336 -5525.807525 -39291.972239 -0.0 -5525.807525 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 125255.553360 -5525.807525 -39291.972239 170073.333123 164547.525599 130781.360885 125255.55336 -5525.807525 -39291.972239 -0.0 -5525.807525 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 132220.993059 -4936.491933 -14604.268836 -14743.373905 166505.127733 160349.814284 147859.950307 142974.212337 147746.497458 142809.455049 135963.76522 132220.993059 -3742.772161 -10588.46199 -10753.219278 -1194.270248 -1142.965809 -6787.139957 -75.305231 -4359.351819 -14870.869984 -15053.912871
──────── Test point index: 1557 | True SalePrice: 81000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1557 894 3 3
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1557 ===
f_AB(S)   = 132820.53798010125
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 88854.29925338605
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 2711.4728282182623
|f_AB(S)-f_A|  = 37252.7951433642
|psi_ABC(A)|   = 21450.125636622422
|f_ABC(S)-f_A| = 77650.8284791331
|ReSHAP_AB(A)|  = 12542.467677157292
|ReSHAP_ABC(A)| = 19977.8385574657

P   = -0.20345245185992694
R   = 0.07940742309892806
P/R = -2.562133915445915
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 112464.488761 -66143.231881 178607.720643 112464.488761 -66143.231881 -66143.231881
2 B 1 0.653707 52691.751258 167830.652805 125996.448016 -41834.204789 167830.652805 125996.448016 -41834.204789 -41834.204789
3 C 1 0.653707 52691.751258 167830.652805 125996.448016 -41834.204789 167830.652805 125996.448016 -41834.204789 -41834.204789
4 AB 2 0.749226 44839.624931 170073.333123 132820.537980 -2711.472828 -34541.322315 170073.333123 137263.548553 105433.699066 132820.53798 -32809.78457 -64639.634057 60196.623484 -12542.467677 -24710.327466
5 AC 2 0.749226 44839.624931 170073.333123 132820.537980 -2711.472828 -34541.322315 170073.333123 137263.548553 105433.699066 132820.53798 -32809.78457 -64639.634057 60196.623484 -12542.467677 -24710.327466
6 BC 2 0.697364 49258.516562 161570.634319 126608.036695 -17522.773705 -17439.823919 161570.634319 122322.512736 122405.462522 126608.036695 -39248.121584 -39165.171798 43450.695757 -17499.791467 -17462.806157
7 ABC 3 0.797024 40340.721518 166505.127733 88854.299253 -21450.125637 -27957.723472 -28242.97937 166505.127733 133110.517007 127934.299416 111890.335647 127647.967046 111606.156221 103767.178141 88854.299253 -14912.878888 -22751.856968 -23036.036394 -1128.931937 -1131.084881 1531.675608 -16221.715019 -19977.838557 -28659.95002 -29013.039902
──────── Test point index: 1910 | True SalePrice: 126000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1910 1560 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1910 ===
f_AB(S)   = 134459.7865802023
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138604.82585533083
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3678.4256955215023
|f_AB(S)-f_A|  = 35613.54654326316
|psi_ABC(A)|   = 3430.663637998679
|f_ABC(S)-f_A| = 27900.30187718832
|ReSHAP_AB(A)|  = 3678.4256955215287
|ReSHAP_ABC(A)| = 2193.3577558648713

P   = -0.019674262446960453
R   = 0.024673138795513805
P/R = -0.797396010698798
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186023.281791 7415.561148 178607.720643 186023.281791 7415.561148 7415.561148
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 134459.786580 3678.425696 -39291.972239 170073.333123 173751.758819 130781.360885 134459.78658 3678.425696 -39291.972239 -0.0 3678.425696 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 134459.786580 3678.425696 -39291.972239 170073.333123 173751.758819 130781.360885 134459.78658 3678.425696 -39291.972239 -0.0 3678.425696 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 138604.825855 3430.663638 -15636.12217 -15694.843345 166505.127733 170602.603631 147859.950307 151358.673012 147746.497458 151354.683513 135963.76522 138604.825855 2641.060635 -12749.857657 -12753.847157 967.12542 857.66207 -6494.072962 -368.372226 2193.357756 -14974.29417 -15119.365463
──────── Test point index: 881 | True SalePrice: 190000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
881 1500 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 881 ===
f_AB(S)   = 205000.52269684372
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 200903.90759342205
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 391.19954546053486
|f_AB(S)-f_A|  = 34927.18957337827
|psi_ABC(A)|   = 435.7691928163501
|f_ABC(S)-f_A| = 34398.7798609029
|ReSHAP_AB(A)|  = 391.19954546052065
|ReSHAP_ABC(A)| = 435.77632006316867

P   = -0.001467728320874679
R   = -0.001467935515631626
P/R = 0.9998588529572718
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 179396.363500 788.642857 178607.720643 179396.3635 788.642857 788.642857
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 205000.522697 391.199545 34535.990028 170073.333123 170464.532669 204609.323151 205000.522697 391.199545 34535.990028 0.0 391.199545 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 205000.522697 391.199545 34535.990028 170073.333123 170464.532669 204609.323151 205000.522697 391.199545 34535.990028 0.0 391.199545 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 200903.907593 435.769193 16986.6163 16976.394369 166505.127733 166940.89315 183491.465006 183927.230424 183481.243075 183917.008493 200468.13085 200903.907593 435.776743 16986.8991 16976.677169 -0.011326 -0.011326 -0.561827 0.011326 435.77632 16986.612653 16976.390888
──────── Test point index: 554 | True SalePrice: 110000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
554 980 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 554 ===
f_AB(S)   = 106012.19526378946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 120914.5589457151
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 26433.629687979585
|f_AB(S)-f_A|  = 64061.13785967599
|psi_ABC(A)|   = 21896.971146536525
|f_ABC(S)-f_A| = 45590.56878680404
|ReSHAP_AB(A)|  = 26710.10676561293
|ReSHAP_ABC(A)| = 25050.14147119592

P   = -0.06766478698439254
R   = -0.13251173208279016
P/R = 0.5106324241699385
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 121963.071645 -56644.648998 178607.720643 121963.071645 -56644.648998 -56644.648998
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
5 AC 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 120914.558946 -21896.971147 -11761.424342 -11932.173299 166505.127733 136684.208441 147859.950307 127096.458541 147746.497458 126868.413476 135963.76522 120914.558946 -15049.206274 -5953.854531 -6181.899595 -5828.877707 -5714.285492 -3633.89537 -3228.549818 -25050.141471 -10104.571143 -10435.856173
──────── Test point index: 1462 | True SalePrice: 255900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1462 2290 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1462 ===
f_AB(S)   = 289400.26577273966
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 300781.12704005593
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 45774.986004634266
|f_AB(S)-f_A|  = 119326.93264927421
|psi_ABC(A)|   = 49051.307622147935
|f_ABC(S)-f_A| = 134275.99930753678
|ReSHAP_AB(A)|  = 45267.81976020132
|ReSHAP_ABC(A)| = 49032.06810742249

P   = 0.018307720829765117
R   = 0.014200779746590508
P/R = 1.289205322275395
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 266650.787664 88043.067021 178607.720643 266650.787664 88043.067021 88043.067021
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 289400.265773 45774.986005 73551.946645 170073.333123 213746.343645 241523.304285 289400.265773 43673.010521 71449.971161 4203.950967 45267.81976 74059.112889
5 AC 2 0.749226 44839.624931 170073.333123 289400.265773 45774.986005 73551.946645 170073.333123 213746.343645 241523.304285 289400.265773 43673.010521 71449.971161 4203.950967 45267.81976 74059.112889
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 300781.127040 49051.307622 42680.51052 42544.181165 166505.127733 215153.414488 202112.051547 251364.869602 201975.722193 251228.540248 251528.308985 300781.12704 49252.818055 49552.586792 49416.257438 0.0 0.0 -13341.131678 -604.5313 49032.068107 42680.677259 42563.253941
──────── Test point index: 87 | True SalePrice: 133000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
87 1040 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 87 ===
f_AB(S)   = 107209.99535683448
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122416.52311829128
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24191.116566426586
|f_AB(S)-f_A|  = 62863.33776663097
|psi_ABC(A)|   = 19883.469796604117
|f_ABC(S)-f_A| = 44088.604614227865
|ReSHAP_AB(A)|  = 24331.12096651936
|ReSHAP_ABC(A)| = 22311.56398385221

P   = -0.06616811946367918
R   = -0.11901405214749777
P/R = 0.5559689655947097
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 128589.989936 -50017.730707 178607.720643 128589.989936 -50017.730707 -50017.730707
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107209.995357 -24191.116566 -38672.2212 170073.333123 145262.465518 130781.360885 107209.995357 -24810.867605 -39291.972239 1239.502077 -24331.120967 -38532.2168
5 AC 2 0.749226 44839.624931 170073.333123 107209.995357 -24191.116566 -38672.2212 170073.333123 145262.465518 130781.360885 107209.995357 -24810.867605 -39291.972239 1239.502077 -24331.120967 -38532.2168
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122416.523118 -19883.469797 -12003.276393 -12201.858425 166505.127733 139449.794672 147859.950307 128897.245262 147746.497458 128613.534048 135963.76522 122416.523118 -13547.242102 -6197.01093 -6480.722144 -5585.721308 -5415.462943 -4355.53848 -2506.906708 -22311.563984 -10686.863556 -11090.177074
──────── Test point index: 2652 | True SalePrice: 139000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2652 1566 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2652 ===
f_AB(S)   = 134788.5091952084
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138851.03973833672
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4007.1483105276056
|f_AB(S)-f_A|  = 35284.82392825704
|psi_ABC(A)|   = 3750.508691804416
|f_ABC(S)-f_A| = 27654.087994182424
|ReSHAP_AB(A)|  = 4007.1483105276147
|ReSHAP_ABC(A)| = 2338.161435198861

P   = -0.022056442707606844
R   = 0.029015473202354716
P/R = -0.7601613991881021
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186685.973620 8078.252977 178607.720643 186685.97362 8078.252977 8078.252977
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 134788.509195 4007.148311 -39291.972239 170073.333123 174080.481434 130781.360885 134788.509195 4007.148311 -39291.972239 -0.0 4007.148311 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 134788.509195 4007.148311 -39291.972239 170073.333123 174080.481434 130781.360885 134788.509195 4007.148311 -39291.972239 -0.0 4007.148311 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 138851.039738 3750.508692 -15672.72268 -15731.874006 166505.127733 170968.77468 147859.950307 151706.253393 147746.497458 151701.403592 135963.76522 138851.039738 2887.274518 -12850.363854 -12855.213655 1067.631617 959.028567 -6412.157433 -450.287755 2338.161435 -14921.778808 -15070.470621
──────── Test point index: 478 | True SalePrice: 184100 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
478 1419 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 478 ===
f_AB(S)   = 237476.74852761257
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 246964.75194853212
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4046.555757121794
|f_AB(S)-f_A|  = 67403.41540414712
|psi_ABC(A)|   = 4542.2842192893295
|f_ABC(S)-f_A| = 80459.62421601298
|ReSHAP_AB(A)|  = 4046.5557571217614
|ReSHAP_ABC(A)| = 4567.434598051294

P   = 0.0035806708210761665
R   = 0.003268086975363918
P/R = 1.0956473460065856
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170450.023807 -8157.696836 178607.720643 170450.023807 -8157.696836 -8157.696836
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 237476.748528 -4046.555757 71449.971161 170073.333123 166026.777366 241523.304285 237476.748528 -4046.555757 71449.971161 -0.0 -4046.555757 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 237476.748528 -4046.555757 71449.971161 170073.333123 166026.777366 241523.304285 237476.748528 -4046.555757 71449.971161 -0.0 -4046.555757 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 246964.751949 -4542.284219 42565.216321 42436.692114 166505.127733 161997.584 202112.051547 197548.494511 201975.722193 197427.77545 251528.308985 246964.751949 -4563.557036 49536.976499 49416.257438 15.610294 0.0 -13986.065988 40.40301 -4567.434598 42573.985223 42453.073591
──────── Test point index: 1505 | True SalePrice: 120000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1505 1053 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1505 ===
f_AB(S)   = 107644.42803599441
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122741.9486890161
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 23617.78406059002
|f_AB(S)-f_A|  = 62428.90508747104
|psi_ABC(A)|   = 19409.410942710052
|f_ABC(S)-f_A| = 43763.17904350304
|ReSHAP_AB(A)|  = 23733.03353965206
|ReSHAP_ABC(A)| = 21652.075245297136

P   = -0.0651951926198393
R   = -0.11459456239521032
P/R = 0.5689204728143737
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130025.822232 -48581.89841 178607.720643 130025.822232 -48581.89841 -48581.89841
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107644.428036 -23617.784061 -38811.121027 170073.333123 145974.697851 130781.360885 107644.428036 -24098.635272 -39291.972239 961.702424 -23733.03354 -38695.871548
5 AC 2 0.749226 44839.624931 170073.333123 107644.428036 -23617.784061 -38811.121027 170073.333123 145974.697851 130781.360885 107644.428036 -24098.635272 -39291.972239 961.702424 -23733.03354 -38695.871548
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122741.948689 -19409.410943 -12082.741018 -12271.027083 166505.127733 140109.515364 147859.950307 129323.979593 147746.497458 129060.860314 135963.76522 122741.948689 -13221.816531 -6318.911625 -6582.030904 -5463.820612 -5314.154183 -4466.624146 -2395.821042 -21652.075245 -10868.515896 -11242.587902
──────── Test point index: 1283 | True SalePrice: 143000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1283 1133 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1283 ===
f_AB(S)   = 147979.67494561963
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 142462.4699418163
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 19715.66707241237
|f_AB(S)-f_A|  = 22093.65817784582
|psi_ABC(A)|   = 21841.138274923836
|f_ABC(S)-f_A| = 24042.657790702855
|ReSHAP_AB(A)|  = 19715.66707241238
|ReSHAP_ABC(A)| = 21949.9206125408

P   = -0.016065066172275766
R   = -0.02058962159222677
P/R = 0.7802506763087299
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 138861.713287 -39746.007356 178607.720643 138861.713287 -39746.007356 -39746.007356
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 147979.674946 -19715.667072 -2377.991105 170073.333123 150357.666051 167695.342018 147979.674946 -19715.667072 -2377.991105 -0.0 -19715.667072 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 147979.674946 -19715.667072 -2377.991105 170073.333123 150357.666051 167695.342018 147979.674946 -19715.667072 -2377.991105 -0.0 -19715.667072 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 142462.469942 -21841.138275 -1099.841693 -1101.677823 166505.127733 144546.310336 165335.52585 143477.626249 165336.229686 143473.250153 164166.627803 142462.469942 -21704.157861 -1010.780211 -1015.156307 -158.821671 -153.74174 -57.903876 57.903876 -21949.920613 -1044.222051 -1048.515128
──────── Test point index: 485 | True SalePrice: 244000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
485 2054 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 485 ===
f_AB(S)   = 198438.59634905544
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 198412.18666548748
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 30743.254331023432
|f_AB(S)-f_A|  = 28365.263225589995
|psi_ABC(A)|   = 34245.558862348014
|f_ABC(S)-f_A| = 31907.058932968328
|ReSHAP_AB(A)|  = 30743.254331023432
|ReSHAP_ABC(A)| = 34245.558862348

P   = 0.010543631715321578
R   = 0.010543631715322022
P/R = 0.9999999999999579
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 240584.909053 61977.18841 178607.720643 240584.909053 61977.18841 61977.18841
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 198438.596349 30743.254331 -2377.991105 170073.333123 200816.587454 167695.342018 198438.596349 30743.254331 -2377.991105 0.0 30743.254331 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 198438.596349 30743.254331 -2377.991105 170073.333123 200816.587454 167695.342018 198438.596349 30743.254331 -2377.991105 0.0 30743.254331 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 198412.186665 34245.558862 -1169.601883 -1168.898047 166505.127733 200750.686595 165335.52585 199581.084712 165336.229686 199581.788548 164166.627803 198412.186665 34245.558862 -1169.601883 -1168.898047 -0.0 0.0 0.0 0.0 34245.558862 -1169.601883 -1168.898047
──────── Test point index: 134 | True SalePrice: 178000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
134 1128 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 134 ===
f_AB(S)   = 147705.73943311456
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 142191.8593196713
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 19989.602584917448
|f_AB(S)-f_A|  = 22367.593690350885
|psi_ABC(A)|   = 22123.098563191565
|f_ABC(S)-f_A| = 24313.268412847858
|ReSHAP_AB(A)|  = 19989.602584917448
|ReSHAP_ABC(A)| = 22242.717072215062

P   = -0.016232863090277827
R   = -0.02115274909623699
P/R = 0.7674115083776796
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 138309.470096 -40298.250547 178607.720643 138309.470096 -40298.250547 -40298.250547
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 147705.739433 -19989.602585 -2377.991105 170073.333123 150083.730539 167695.342018 147705.739433 -19989.602585 -2377.991105 0.0 -19989.602585 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 147705.739433 -19989.602585 -2377.991105 170073.333123 150083.730539 167695.342018 147705.739433 -19989.602585 -2377.991105 0.0 -19989.602585 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 142191.859320 -22123.098563 -1094.16686 -1096.00299 166505.127733 144252.666156 165335.52585 143196.000186 165336.229686 143191.62409 164166.627803 142191.85932 -21974.768483 -999.764771 -1004.140867 -169.837112 -164.75718 -56.901199 56.901199 -22242.717072 -1033.126048 -1037.425293
──────── Test point index: 2605 | True SalePrice: 170000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2605 1466 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2605 ===
f_AB(S)   = 129309.7989451068
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 134967.4525839328
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 1471.561939574014
|f_AB(S)-f_A|  = 40763.53417835865
|psi_ABC(A)|   = 1329.0574911171377
|f_ABC(S)-f_A| = 31537.675148586335
|ReSHAP_AB(A)|  = 1471.561939574021
|ReSHAP_ABC(A)| = 1050.7890853233182

P   = -0.006041938395263284
R   = 0.002781426183222392
P/R = -2.1722447396621036
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175641.109802 -2966.610841 178607.720643 175641.109802 -2966.610841 -2966.610841
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 129309.798945 -1471.56194 -39291.972239 170073.333123 168601.771184 130781.360885 129309.798945 -1471.56194 -39291.972239 0.0 -1471.56194 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 129309.798945 -1471.56194 -39291.972239 170073.333123 168601.771184 130781.360885 129309.798945 -1471.56194 -39291.972239 0.0 -1471.56194 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 134967.452584 -1329.057491 -15047.582404 -15161.035254 166505.127733 164865.923877 147859.950307 146508.294325 147746.497458 146394.841476 135963.76522 134967.452584 -996.312636 -11427.388892 -11540.841741 -355.343346 -355.343346 -6930.24066 67.795472 -1050.789085 -15169.630188 -15317.255875
──────── Test point index: 56 | True SalePrice: 185088 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
56 1733 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 56 ===
f_AB(S)   = 217765.91757958042
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 215234.4752917765
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 13156.594428197277
|f_AB(S)-f_A|  = 47692.58445611497
|psi_ABC(A)|   = 14692.38667185085
|f_ABC(S)-f_A| = 48729.34755925735
|ReSHAP_AB(A)|  = 13156.59442819729
|ReSHAP_ABC(A)| = 14696.967392263681

P   = -0.025647546493850315
R   = -0.02574154981304616
P/R = 0.9963481872739378
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 205130.896196 26523.175554 178607.720643 205130.896196 26523.175554 26523.175554
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 217765.917580 13156.594428 34535.990028 170073.333123 183229.927552 204609.323151 217765.91758 13156.594428 34535.990028 -0.0 13156.594428 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 217765.917580 13156.594428 34535.990028 170073.333123 183229.927552 204609.323151 217765.91758 13156.594428 34535.990028 -0.0 13156.594428 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 215234.475292 14692.386672 17023.591409 17013.369478 166505.127733 181160.535519 183491.465006 198146.872793 183481.243075 198136.650862 200468.13085 215234.475292 14766.344442 17097.824429 17087.602498 -110.936655 -110.936655 -111.487156 110.936655 14696.967392 17021.287737 17011.09243
──────── Test point index: 282 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
282 1487 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 282 ===
f_AB(S)   = 167374.3092309793
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 163809.02261640874
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 321.0327870526962
|f_AB(S)-f_A|  = 2699.023892486148
|psi_ABC(A)|   = 357.6051867307203
|f_ABC(S)-f_A| = 2696.1051161104115
|ReSHAP_AB(A)|  = 321.03278705268514
|ReSHAP_ABC(A)| = 357.6051867307144

P   = -0.013693668874813214
R   = -0.013693668874815157
P/R = 0.9999999999998581
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 177960.531204 -647.189439 178607.720643 177960.531204 -647.189439 -647.189439
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 167374.309231 -321.032787 -2377.991105 170073.333123 169752.300336 167695.342018 167374.309231 -321.032787 -2377.991105 -0.0 -321.032787 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 167374.309231 -321.032787 -2377.991105 170073.333123 169752.300336 167695.342018 167374.309231 -321.032787 -2377.991105 -0.0 -321.032787 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 163809.022616 -357.605187 -1169.601883 -1168.898047 166505.127733 166147.522546 165335.52585 164977.920663 165336.229686 164978.624499 164166.627803 163809.022616 -357.605187 -1169.601883 -1168.898047 0.0 -0.0 0.0 -0.0 -357.605187 -1169.601883 -1168.898047
──────── Test point index: 2240 | True SalePrice: 152000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2240 1558 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2240 ===
f_AB(S)   = 134350.2123752003
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138522.7545609955
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3568.8514905194825
|f_AB(S)-f_A|  = 35723.12074826515
|psi_ABC(A)|   = 3325.4967024944926
|f_ABC(S)-f_A| = 27982.37317152365
|ReSHAP_AB(A)|  = 3568.851490519474
|ReSHAP_ABC(A)| = 2141.049473990765

P   = -0.018939426675299284
R   = 0.02338890699545161
P/R = -0.8097610837044412
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 185802.384515 7194.663872 178607.720643 185802.384515 7194.663872 7194.663872
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 134350.212375 3568.851491 -39291.972239 170073.333123 173642.184614 130781.360885 134350.212375 3568.851491 -39291.972239 0.0 3568.851491 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 134350.212375 3568.851491 -39291.972239 170073.333123 173642.184614 130781.360885 134350.212375 3568.851491 -39291.972239 0.0 3568.851491 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 138522.754561 3325.496702 -15623.73485 -15684.135024 166505.127733 170480.546615 147859.950307 151248.979516 147746.497458 151241.632017 135963.76522 138522.754561 2558.989341 -12718.877456 -12726.224955 936.145218 830.039868 -6512.689644 -349.755544 2141.049474 -14989.706588 -15133.716058
──────── Test point index: 2111 | True SalePrice: 191000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2111 1795 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2111 ===
f_AB(S)   = 221162.71793464344
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 219065.26203767146
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 16553.39478326027
|f_AB(S)-f_A|  = 51089.384811177995
|psi_ABC(A)|   = 18491.82725210257
|f_ABC(S)-f_A| = 52560.134305152314
|ReSHAP_AB(A)|  = 16553.39478326027
|ReSHAP_ABC(A)| = 18488.29541684652

P   = -0.027813801376688707
R   = -0.027746605291618875
P/R = 1.0024217768034538
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 211978.711764 33370.991121 178607.720643 211978.711764 33370.991121 33370.991121
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 221162.717935 16553.394783 34535.990028 170073.333123 186626.727907 204609.323151 221162.717935 16553.394783 34535.990028 0.0 16553.394783 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 221162.717935 16553.394783 34535.990028 170073.333123 186626.727907 204609.323151 221162.717935 16553.394783 34535.990028 0.0 16553.394783 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 219065.262038 18491.827252 17039.264492 17029.042561 166505.127733 184944.303017 183491.465006 201930.640291 183481.243075 201920.41836 200468.13085 219065.262038 18597.131188 17144.843678 17134.621747 -157.955903 -157.955903 -158.506404 157.955903 18488.295417 17041.01032 17030.828568
──────── Test point index: 581 | True SalePrice: 185750 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
581 1489 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 581 ===
f_AB(S)   = 167483.88343598138
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 163931.0796324549
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 211.45858205063269
|f_AB(S)-f_A|  = 2589.4496874840697
|psi_ABC(A)|   = 235.54817068457487
|f_ABC(S)-f_A| = 2574.0481000642467
|ReSHAP_AB(A)|  = 211.45858205063269
|ReSHAP_ABC(A)| = 235.54817068455728

P   = -0.009847254513400808
R   = -0.00984725451339398
P/R = 1.0000000000006934
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 178181.428480 -426.292163 178607.720643 178181.42848 -426.292163 -426.292163
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 167483.883436 -211.458582 -2377.991105 170073.333123 169861.874541 167695.342018 167483.883436 -211.458582 -2377.991105 0.0 -211.458582 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 167483.883436 -211.458582 -2377.991105 170073.333123 169861.874541 167695.342018 167483.883436 -211.458582 -2377.991105 0.0 -211.458582 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 163931.079632 -235.548171 -1169.601883 -1168.898047 166505.127733 166269.579562 165335.52585 165099.977679 165336.229686 165100.681515 164166.627803 163931.079632 -235.548171 -1169.601883 -1168.898047 -0.0 -0.0 -0.0 0.0 -235.548171 -1169.601883 -1168.898047
──────── Test point index: 1189 | True SalePrice: 215000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1189 1620 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1189 ===
f_AB(S)   = 211574.97499696564
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 208252.55751296802
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 6965.65184558247
|f_AB(S)-f_A|  = 41501.64187350019
|psi_ABC(A)|   = 7767.599807843663
|f_ABC(S)-f_A| = 41747.429780448874
|ReSHAP_AB(A)|  = 6965.65184558247
|ReSHAP_ABC(A)| = 7773.275623935621

P   = -0.018221357041178887
R   = -0.018357313101750927
P/R = 0.9925939019605722
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 192650.200082 14042.479439 178607.720643 192650.200082 14042.479439 14042.479439
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 211574.974997 6965.651846 34535.990028 170073.333123 177038.984969 204609.323151 211574.974997 6965.651846 34535.990028 0.0 6965.651846 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 211574.974997 6965.651846 34535.990028 170073.333123 177038.984969 204609.323151 211574.974997 6965.651846 34535.990028 0.0 6965.651846 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 208252.557513 7767.599808 16995.025952 16984.804021 166505.127733 174264.314113 183491.465006 191250.651387 183481.243075 191240.429456 200468.13085 208252.557513 7784.426663 17012.128057 17001.906126 -25.240283 -25.240283 -25.790783 25.240283 7773.275624 16992.185623 16981.968533
──────── Test point index: 70 | True SalePrice: 254900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
70 1947 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 70 ===
f_AB(S)   = 229490.35751479786
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 228456.86825341382
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24881.034363414714
|f_AB(S)-f_A|  = 59417.02439133241
|psi_ABC(A)|   = 27806.58480368744
|f_ABC(S)-f_A| = 61951.74052089467
|ReSHAP_AB(A)|  = 24881.034363414718
|ReSHAP_ABC(A)| = 27773.176481958275

P   = -0.03009005745499932
R   = -0.029550793804805298
P/R = 1.0182487026831182
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 228766.904767 50159.184125 178607.720643 228766.904767 50159.184125 50159.184125
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 229490.357515 24881.034363 34535.990028 170073.333123 194954.367487 204609.323151 229490.357515 24881.034363 34535.990028 -0.0 24881.034363 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 229490.357515 24881.034363 34535.990028 170073.333123 194954.367487 204609.323151 229490.357515 24881.034363 34535.990028 -0.0 24881.034363 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 228456.868253 27806.584804 17077.688824 17067.466893 166505.127733 194220.636236 183491.465006 211206.97351 183481.243075 211196.751579 200468.13085 228456.868253 27988.737403 17260.116674 17249.894743 -273.228899 -273.228899 -273.7794 273.228899 27773.176482 17094.355721 17084.208318
──────── Test point index: 1878 | True SalePrice: 119000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1878 1728 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1878 ===
f_AB(S)   = 143664.019800373
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 145747.00341007992
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 12882.658915692205
|f_AB(S)-f_A|  = 26409.313323092443
|psi_ABC(A)|   = 12621.982098625851
|f_ABC(S)-f_A| = 20758.124322439224
|ReSHAP_AB(A)|  = 12882.65891569222
|ReSHAP_ABC(A)| = 3673.090898725311

P   = -0.12024280485537309
R   = 0.31086026885668355
P/R = -0.3868066038082494
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 204578.653006 25970.932363 178607.720643 204578.653006 25970.932363 25970.932363
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 143664.019800 12882.658916 -39291.972239 170073.333123 182955.992039 130781.360885 143664.0198 12882.658916 -39291.972239 -0.0 12882.658916 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 143664.019800 12882.658916 -39291.972239 170073.333123 182955.992039 130781.360885 143664.0198 12882.658916 -39291.972239 -0.0 12882.658916 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 145747.003410 12621.982099 -16664.597399 -16715.509022 166505.127733 180855.392979 147859.950307 161529.85194 147746.497458 161541.481543 135963.76522 145747.00341 9783.23819 -15794.478133 -15782.84853 4011.745895 3886.663443 -3531.062906 -3331.382281 -3673.090899 -8361.780287 -8723.253136
──────── Test point index: 259 | True SalePrice: 150000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
259 1026 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 259 ===
f_AB(S)   = 106797.73472042411
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122066.06481135685
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24780.756602138885
|f_AB(S)-f_A|  = 63275.59840304134
|psi_ABC(A)|   = 20376.277699634542
|f_ABC(S)-f_A| = 44439.0629211623
|ReSHAP_AB(A)|  = 24949.277312083075
|ReSHAP_ABC(A)| = 22990.943290169096

P   = -0.06688965030412508
R   = -0.12306346916557398
P/R = 0.5435378244873738
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 127043.709002 -51564.011641 178607.720643 127043.709002 -51564.011641 -51564.011641
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106797.734720 -24780.756602 -38494.841801 170073.333123 144495.446083 130781.360885 106797.73472 -25577.88704 -39291.972239 1594.260876 -24949.277312 -38326.321091
5 AC 2 0.749226 44839.624931 170073.333123 106797.734720 -24780.756602 -38494.841801 170073.333123 144495.446083 130781.360885 106797.73472 -25577.88704 -39291.972239 1594.260876 -24949.277312 -38326.321091
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122066.064811 -20376.2777 -11936.916318 -12125.868904 166505.127733 138766.519563 147859.950307 128442.925524 147746.497458 128178.473201 135963.76522 122066.064811 -13897.700409 -6112.408389 -6376.860712 -5670.323848 -5519.324375 -4211.18565 -2651.259538 -22990.94329 -10533.97053 -10914.149101
──────── Test point index: 495 | True SalePrice: 403000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
495 2794 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 495 ===
f_AB(S)   = 336863.4565755761
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 331921.7160712017
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 83926.74633905319
|f_AB(S)-f_A|  = 166790.12345211065
|psi_ABC(A)|   = 80060.5789229812
|f_ABC(S)-f_A| = 165416.58833868257
|ReSHAP_AB(A)|  = 84011.05021149389
|ReSHAP_ABC(A)| = 79911.04951664736

P   = 0.01919409298458924
R   = 0.020603498349568372
P/R = 0.9315938807543012
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 322316.901308 143709.180665 178607.720643 322316.901308 143709.180665 143709.180665
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 336863.456576 83926.746339 82863.377113 170073.333123 242586.673511 241523.304285 336863.456576 72513.340387 71449.971161 22826.811904 84011.050211 82779.073241
5 AC 2 0.749226 44839.624931 170073.333123 336863.456576 83926.746339 82863.377113 170073.333123 242586.673511 241523.304285 336863.456576 72513.340387 71449.971161 22826.811904 84011.050211 82779.073241
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 331921.716071 80060.578923 42746.169385 42609.840031 166505.127733 245900.050329 202112.051547 282505.458633 201975.722193 282369.129279 251528.308985 331921.716071 80393.407086 49552.586792 49416.257438 0.0 0.0 -12947.178487 -998.48449 79911.049517 42811.661252 42693.87757
──────── Test point index: 1506 | True SalePrice: 96000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1506 1137 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1506 ===
f_AB(S)   = 111323.90661997691
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 124676.19584354322
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 19476.986463556103
|f_AB(S)-f_A|  = 58749.42650348853
|psi_ABC(A)|   = 16185.076067839022
|f_ABC(S)-f_A| = 41828.93188897593
|ReSHAP_AB(A)|  = 19483.56340965042
|ReSHAP_ABC(A)| = 17170.939338949982

P   = -0.05540854944452128
R   = -0.07886553280613445
P/R = 0.7025698993339128
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 139303.507840 -39304.212803 178607.720643 139303.50784 -39304.212803 -39304.212803
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 111323.906620 -19476.986464 -39272.44004 170073.333123 150576.814461 130781.360885 111323.90662 -19496.518662 -39291.972239 39.064398 -19483.56341 -39265.863094
5 AC 2 0.749226 44839.624931 170073.333123 111323.906620 -19476.986464 -39272.44004 170073.333123 150576.814461 130781.360885 111323.90662 -19496.518662 -39291.972239 39.064398 -19483.56341 -39265.863094
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 124676.195844 -16185.076068 -12743.225157 -12900.630664 166505.127733 144787.544738 147859.950307 132353.827133 147746.497458 132152.468968 135963.76522 124676.195844 -11287.569377 -7476.273124 -7677.631289 -4306.459113 -4218.553798 -4957.444481 -1905.000707 -17170.939339 -12193.239083 -12464.753467
──────── Test point index: 2841 | True SalePrice: 83000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2841 1414 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2841 ===
f_AB(S)   = 97282.61595175826
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 116785.9359975205
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 2114.434377707563
|f_AB(S)-f_A|  = 72790.71717170719
|psi_ABC(A)|   = 3245.897264889398
|f_ABC(S)-f_A| = 49719.19173499865
|ReSHAP_AB(A)|  = 4073.5767608227534
|ReSHAP_ABC(A)| = 2218.6749197461463

P   = -0.03623646213874965
R   = 0.011338745838026236
P/R = -3.1958086596513233
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 169897.780616 -8709.940026 178607.720643 169897.780616 -8709.940026 -8709.940026
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 97282.615952 -2114.434378 -70676.282794 170073.333123 165752.841854 97190.993438 97282.615952 -4320.49127 -72882.339686 4412.113784 -4073.576761 -68717.140411
5 AC 2 0.749226 44839.624931 170073.333123 97282.615952 -2114.434378 -70676.282794 170073.333123 165752.841854 97190.993438 97282.615952 -4320.49127 -72882.339686 4412.113784 -4073.576761 -68717.140411
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 116785.935998 -3245.897265 -23044.007582 -23429.286888 166505.127733 161692.44146 136767.390763 133793.749612 136359.403222 133431.178542 118760.008604 116785.935998 -1974.072607 -16645.242544 -17007.813614 -954.152073 -999.568544 -11253.449304 -884.893048 -2218.67492 -23482.269311 -24018.247505
──────── Test point index: 1608 | True SalePrice: 137500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1608 889 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1608 ===
f_AB(S)   = 107061.02784964658
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 118636.57995064126
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 28402.026558847276
|f_AB(S)-f_A|  = 63012.30527381887
|psi_ABC(A)|   = 24733.416590041215
|f_ABC(S)-f_A| = 47868.54778187789
|ReSHAP_AB(A)|  = 28803.613514097906
|ReSHAP_ABC(A)| = 28841.304047720376

P   = -0.06595674617989367
R   = -0.1453995982484027
P/R = 0.4536239919123589
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 111912.245571 -66695.475072 178607.720643 111912.245571 -66695.475072 -66695.475072
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107061.027850 -28402.026559 -34610.278715 170073.333123 136989.613041 130781.360885 107061.02785 -33083.720083 -39291.972239 9363.387048 -28803.613514 -34208.69176
5 AC 2 0.749226 44839.624931 170073.333123 107061.027850 -28402.026559 -34610.278715 170073.333123 136989.613041 130781.360885 107061.02785 -33083.720083 -39291.972239 9363.387048 -28803.613514 -34208.69176
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 118636.579951 -24733.41659 -11472.929333 -11662.201859 166505.127733 132905.091013 147859.950307 124662.742203 147746.497458 124397.649999 135963.76522 118636.579951 -17327.18527 -5761.070048 -6026.162252 -6021.662189 -5870.022835 -2481.278762 -4381.166426 -28841.304048 -9330.312349 -9696.931385
──────── Test point index: 2916 | True SalePrice: 92000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2916 630 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2916 ===
f_AB(S)   = 117058.04417650774
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 112153.1012723541
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 30489.59599288046
|f_AB(S)-f_A|  = 53015.288946957706
|psi_ABC(A)|   = 31365.691182312887
|f_ABC(S)-f_A| = 54352.02646016504
|ReSHAP_AB(A)|  = 28946.80750793697
|ReSHAP_ABC(A)| = 37072.843299231055

P   = -0.0019746228757993034
R   = -0.136078928399464
P/R = 0.014510864386018204
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 92304.316697 -86303.403946 178607.720643 92304.316697 -86303.403946 -86303.403946
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 117058.044177 -30489.595993 -22525.692954 170073.333123 122817.457846 130781.360885 117058.044177 -47255.875278 -39291.972239 33532.558569 -28946.807508 -24068.481439
5 AC 2 0.749226 44839.624931 170073.333123 117058.044177 -30489.595993 -22525.692954 170073.333123 122817.457846 130781.360885 117058.044177 -47255.875278 -39291.972239 33532.558569 -28946.807508 -24068.481439
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 112153.101272 -31365.691182 -11418.393533 -11567.941745 166505.127733 124118.870406 147859.950307 119995.893397 147746.497458 119810.249823 135963.76522 112153.101272 -23810.663948 -7657.148551 -7842.792125 -4125.583687 -4053.392963 3534.171542 -10396.61673 -37072.843299 -8550.72475 -8728.458411
──────── Test point index: 1691 | True SalePrice: 492000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1691 2656 9 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1691 ===
f_AB(S)   = 373823.67457176367
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 374526.4841356591
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 79572.83473905953
|f_AB(S)-f_A|  = 203750.34144829822
|psi_ABC(A)|   = 71572.78505254856
|f_ABC(S)-f_A| = 208021.35640313997
|ReSHAP_AB(A)|  = 75474.84984408767
|ReSHAP_ABC(A)| = 71563.35900852413

P   = 0.046476264416440705
R   = 0.0264088016882032
P/R = 1.7598778227488312
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 307074.989239 128467.268596 178607.720643 307074.989239 128467.268596 128467.268596
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
4 AB 2 0.749226 44839.624931 170073.333123 373823.674572 79572.834739 124177.506709 170073.333123 233832.613448 278437.285418 373823.674572 63759.280324 108363.952295 31627.108829 75474.849844 128275.491604
5 AC 2 0.749226 44839.624931 170073.333123 373823.674572 79572.834739 124177.506709 170073.333123 233832.613448 278437.285418 373823.674572 63759.280324 108363.952295 31627.108829 75474.849844 128275.491604
6 BC 2 0.697364 49258.516562 161570.634319 342240.984080 90290.340553 90380.009207 161570.634319 248413.207248 248502.875901 342240.98408 86842.572929 86932.241582 6895.53525 90288.561488 90381.788272
7 ABC 3 0.797024 40340.721518 166505.127733 374526.484136 71572.785053 68327.667016 68120.904334 166505.127733 237489.848425 227712.94717 299579.764403 227506.184488 299373.001721 302659.666903 374526.484136 71866.817233 75153.482415 74946.719733 0.0 0.0 -13063.566437 -882.09654 71563.359009 68322.983986 68135.013409
──────── Test point index: 2394 | True SalePrice: 329900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2394 2358 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2394 ===
f_AB(S)   = 294769.92235876445
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 304982.6350839407
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 50322.575782681204
|f_AB(S)-f_A|  = 124696.589235299
|psi_ABC(A)|   = 53235.62583326062
|f_ABC(S)-f_A| = 138477.50735142155
|ReSHAP_AB(A)|  = 49730.83572570526
|ReSHAP_ABC(A)| = 53202.25597417941

P   = 0.01912497957859899
R   = 0.014620517330350025
P/R = 1.308091851093283
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 274161.295060 95553.574418 178607.720643 274161.29506 95553.574418 95553.574418
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 294769.922359 50322.575783 74374.013453 170073.333123 217471.866615 241523.304285 294769.922359 47398.533491 71449.971161 5848.084583 49730.835726 74965.75351
5 AC 2 0.749226 44839.624931 170073.333123 294769.922359 50322.575783 74374.013453 170073.333123 217471.866615 241523.304285 294769.922359 47398.533491 71449.971161 5848.084583 49730.835726 74965.75351
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 304982.635084 53235.625833 42689.105436 42552.776082 166505.127733 219303.353034 202112.051547 255566.377646 201975.722193 255430.048292 251528.308985 304982.635084 53454.326099 49552.586792 49416.257438 0.0 0.0 -13289.56218 -656.100798 53202.255974 42696.35892 42578.892458
──────── Test point index: 188 | True SalePrice: 125500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
188 1363 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 188 ===
f_AB(S)   = 98499.79154778406
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 115509.26645083074
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 2902.9176934705683
|f_AB(S)-f_A|  = 71573.54157568139
|psi_ABC(A)|   = 5338.570875894875
|f_ABC(S)-f_A| = 50995.86128168841
|ReSHAP_AB(A)|  = 6365.484794537765
|ReSHAP_ABC(A)| = 3895.805297686954

P   = -0.064127824907625
R   = 0.012541744948342093
P/R = -5.113150137541438
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 164264.900069 -14342.820574 178607.720643 164264.900069 -14342.820574 -14342.820574
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 98499.791548 -2902.917693 -68670.623882 170073.333123 162958.699626 97190.993438 98499.791548 -7114.633497 -72882.339686 8423.431607 -6365.484795 -65208.056781
5 AC 2 0.749226 44839.624931 170073.333123 98499.791548 -2902.917693 -68670.623882 170073.333123 162958.699626 97190.993438 98499.791548 -7114.633497 -72882.339686 8423.431607 -6365.484795 -65208.056781
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 115509.266451 -5338.570876 -22633.585501 -23023.704905 166505.127733 158579.987551 136767.390763 131909.692333 136359.403222 131537.441066 118760.008604 115509.266451 -3250.742153 -16028.174615 -16400.425882 -1571.220002 -1606.956277 -10642.120603 -1496.22175 -3895.805298 -23277.400796 -23822.655188
──────── Test point index: 1117 | True SalePrice: 257000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1117 1861 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1117 ===
f_AB(S)   = 261692.6478330617
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 274274.55423378304
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 20169.34354832732
|f_AB(S)-f_A|  = 91619.31470959625
|psi_ABC(A)|   = 22653.182437334137
|f_ABC(S)-f_A| = 107769.4265012639
|ReSHAP_AB(A)|  = 20169.343548327328
|ReSHAP_ABC(A)| = 22687.47088511966

P   = 0.009942466318905469
R   = 0.009624301428373061
P/R = 1.0330584918708423
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 219268.321884 40660.601241 178607.720643 219268.321884 40660.601241 40660.601241
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 261692.647833 20169.343548 71449.971161 170073.333123 190242.676672 241523.304285 261692.647833 20169.343548 71449.971161 -0.0 20169.343548 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 261692.647833 20169.343548 71449.971161 170073.333123 190242.676672 241523.304285 261692.647833 20169.343548 71449.971161 -0.0 20169.343548 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 274274.554234 22653.182437 42626.286709 42489.957355 166505.127733 188972.184546 202112.051547 224858.296796 201975.722193 224721.967442 251528.308985 274274.554234 22746.245249 49552.586792 49416.257438 0.0 0.0 -13666.474543 -279.188435 22687.470885 42599.577907 42482.377709
──────── Test point index: 555 | True SalePrice: 135000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
555 980 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 555 ===
f_AB(S)   = 106012.19526378946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 120914.5589457151
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 26433.629687979585
|f_AB(S)-f_A|  = 64061.13785967599
|psi_ABC(A)|   = 21896.971146536525
|f_ABC(S)-f_A| = 45590.56878680404
|ReSHAP_AB(A)|  = 26710.10676561293
|ReSHAP_ABC(A)| = 25050.14147119592

P   = -0.06766478698439254
R   = -0.13251173208279016
P/R = 0.5106324241699385
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 121963.071645 -56644.648998 178607.720643 121963.071645 -56644.648998 -56644.648998
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
5 AC 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 120914.558946 -21896.971147 -11761.424342 -11932.173299 166505.127733 136684.208441 147859.950307 127096.458541 147746.497458 126868.413476 135963.76522 120914.558946 -15049.206274 -5953.854531 -6181.899595 -5828.877707 -5714.285492 -3633.89537 -3228.549818 -25050.141471 -10104.571143 -10435.856173
──────── Test point index: 175 | True SalePrice: 80000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
175 747 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 175 ===
f_AB(S)   = 125625.26581285498
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 100089.10094571539
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 6214.608131248067
|f_AB(S)-f_A|  = 44448.067310610466
|psi_ABC(A)|   = 26271.794928443534
|f_ABC(S)-f_A| = 66416.02678680376
|ReSHAP_AB(A)|  = 15968.08533804211
|ReSHAP_ABC(A)| = 27849.571804768064

P   = -0.25574680013285533
R   = -0.0600674161723066
P/R = 4.257662746791571
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 96715.478547 -81892.242096 178607.720643 96715.478547 -81892.242096 -81892.242096
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 125625.265813 -6214.608131 -38233.459179 170073.333123 129209.844486 97190.993438 125625.265813 -40863.488638 -72882.339686 69297.761013 -15968.085338 -28479.981973
5 AC 2 0.749226 44839.624931 170073.333123 125625.265813 -6214.608131 -38233.459179 170073.333123 129209.844486 97190.993438 125625.265813 -40863.488638 -72882.339686 69297.761013 -15968.085338 -28479.981973
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 100089.100946 -26271.794928 -19922.257975 -20221.973884 166505.127733 127359.198655 136767.390763 115660.571082 136359.403222 115469.126804 118760.008604 100089.100946 -18670.907659 -15380.025859 -15571.470136 -2219.368759 -2435.912023 3681.398285 -15819.740638 -27849.571805 -19103.425799 -19463.029183
──────── Test point index: 366 | True SalePrice: 501837 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
366 2234 9 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 366 ===
f_AB(S)   = 331499.0334934045
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 348452.41951037437
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 46833.34042826263
|f_AB(S)-f_A|  = 161425.70036993906
|psi_ABC(A)|   = 45605.398507114
|f_ABC(S)-f_A| = 181947.29177785522
|ReSHAP_AB(A)|  = 44000.32737939966
|ReSHAP_ABC(A)| = 45651.727617080374

P   = 0.0394714951675072
R   = 0.021666915388913133
P/R = 1.8217404027757726
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 260465.663926 81857.943283 178607.720643 260465.663926 81857.943283 81857.943283
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
4 AB 2 0.749226 44839.624931 170073.333123 331499.033493 46833.340428 114592.359942 170073.333123 210678.265905 278437.285418 331499.033493 40604.932781 108363.952295 12456.815294 44000.327379 117425.372991
5 AC 2 0.749226 44839.624931 170073.333123 331499.033493 46833.340428 114592.359942 170073.333123 210678.265905 278437.285418 331499.033493 40604.932781 108363.952295 12456.815294 44000.327379 117425.372991
6 BC 2 0.697364 49258.516562 161570.634319 342240.984080 90290.340553 90380.009207 161570.634319 248413.207248 248502.875901 342240.98408 86842.572929 86932.241582 6895.53525 90288.561488 90381.788272
7 ABC 3 0.797024 40340.721518 166505.127733 348452.419510 45605.398507 68274.327976 68067.565294 166505.127733 211735.818039 227712.94717 273505.699777 227506.184488 273298.937095 302659.666903 348452.41951 45792.752607 75153.482415 74946.719733 -0.0 0.0 -13383.600677 -562.062301 45651.727617 68241.655493 68053.908668
──────── Test point index: 456 | True SalePrice: 552000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
456 2492 10 10
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 456 ===
f_AB(S)   = 408170.405029704
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 415524.13358721934
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 72629.85880086129
|f_AB(S)-f_A|  = 238097.07190623853
|psi_ABC(A)|   = 61480.97418782578
|f_ABC(S)-f_A| = 249019.0058547002
|ReSHAP_AB(A)|  = 64420.743005324344
|ReSHAP_ABC(A)| = 61546.17272393104

P   = 0.058150359604101975
R   = 0.02341051728681942
P/R = 2.4839416785054027
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 288961.412577 110353.691934 178607.720643 288961.412577 110353.691934 110353.691934
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
4 AB 2 0.749226 44839.624931 170073.333123 408170.405030 72629.858801 165467.213105 170073.333123 224813.33835 317650.692654 408170.40503 54740.005226 147577.359531 35779.707149 64420.743005 173676.328901
5 AC 2 0.749226 44839.624931 170073.333123 408170.405030 72629.858801 165467.213105 170073.333123 224813.33835 317650.692654 408170.40503 54740.005226 147577.359531 35779.707149 64420.743005 173676.328901
6 BC 2 0.697364 49258.516562 161570.634319 408111.076369 122989.130608 123551.311442 161570.634319 278211.79659 278773.977424 408111.076369 116641.16227 117203.343104 12695.936675 122973.869586 123566.572464
7 ABC 3 0.797024 40340.721518 166505.127733 415524.133587 61480.974188 93907.613838 93630.417829 166505.127733 227481.173109 253313.842793 315047.611214 253036.646783 314770.415204 353791.024821 415524.133587 61733.108766 100753.718383 100476.522374 0.659655 0.659655 -13187.280278 -758.382699 61546.172724 93865.538761 93607.294369
──────── Test point index: 506 | True SalePrice: 244600 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
506 1468 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 506 ===
f_AB(S)   = 240161.31655016236
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 249992.3092154491
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 1361.987734572016
|f_AB(S)-f_A|  = 70087.98342669691
|psi_ABC(A)|   = 1529.7154592435497
|f_ABC(S)-f_A| = 83487.18148292994
|ReSHAP_AB(A)|  = 1361.987734572001
|ReSHAP_ABC(A)| = 1536.3467044214392

P   = 0.0011097843141570883
R   = 0.0010303560107921793
P/R = 1.0770882127468169
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175862.007078 -2745.713565 178607.720643 175862.007078 -2745.713565 -2745.713565
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 240161.316550 -1361.987735 71449.971161 170073.333123 168711.345389 241523.304285 240161.31655 -1361.987735 71449.971161 -0.0 -1361.987735 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 240161.316550 -1361.987735 71449.971161 170073.333123 168711.345389 241523.304285 240161.31655 -1361.987735 71449.971161 -0.0 -1361.987735 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 249992.309215 -1529.715459 42576.613148 42440.283794 166505.127733 164987.980893 202112.051547 200576.051778 201975.722193 200439.722423 251528.308985 249992.309215 -1535.999769 49552.586792 49416.257438 0.0 0.0 -13964.515908 18.85293 -1536.346704 42570.323951 42453.204237
──────── Test point index: 1538 | True SalePrice: 311500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1538 1954 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1538 ===
f_AB(S)   = 229873.867232305
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 228889.3764344019
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 25264.54408092184
|f_AB(S)-f_A|  = 59800.53410883955
|psi_ABC(A)|   = 28235.553901457784
|f_ABC(S)-f_A| = 62384.24870188275
|ReSHAP_AB(A)|  = 25264.544080921845
|ReSHAP_ABC(A)| = 28200.567542985642

P   = -0.030126858860890227
R   = -0.029566038474331813
P/R = 1.0189683980505302
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 229540.045235 50932.324592 178607.720643 229540.045235 50932.324592 50932.324592
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 229873.867232 25264.544081 34535.990028 170073.333123 195337.877204 204609.323151 229873.867232 25264.544081 34535.990028 -0.0 25264.544081 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 229873.867232 25264.544081 34535.990028 170073.333123 195337.877204 204609.323151 229873.867232 25264.544081 34535.990028 -0.0 25264.544081 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 228889.376434 28235.553901 17079.458366 17069.236435 166505.127733 194647.835793 183491.465006 211634.173066 183481.243075 211623.951135 200468.13085 228889.376434 28421.245584 17265.425299 17255.203368 -278.537524 -278.537524 -279.088025 278.537524 28200.567543 17096.913486 17086.767673
──────── Test point index: 2381 | True SalePrice: 252000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2381 1618 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2381 ===
f_AB(S)   = 248379.3819253148
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 259260.3416651949
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 6856.077640580435
|f_AB(S)-f_A|  = 78306.04880184936
|psi_ABC(A)|   = 7700.398241740317
|f_ABC(S)-f_A| = 92755.21393267575
|ReSHAP_AB(A)|  = 6856.077640580447
|ReSHAP_ABC(A)| = 7724.129668989228

P   = 0.0045363971744319664
R   = 0.004280547112287494
P/R = 1.0597704114527893
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 192429.302805 13821.582163 178607.720643 192429.302805 13821.582163 13821.582163
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 248379.381925 6856.077641 71449.971161 170073.333123 176929.410764 241523.304285 248379.381925 6856.077641 71449.971161 -0.0 6856.077641 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 248379.381925 6856.077641 71449.971161 170073.333123 176929.410764 241523.304285 248379.381925 6856.077641 71449.971161 -0.0 6856.077641 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 259260.341665 7700.398242 42595.572523 42459.243168 166505.127733 174142.257097 202112.051547 209844.084227 201975.722193 209707.754873 251528.308985 259260.341665 7732.03268 49552.586792 49416.257438 0.0 -0.0 -13850.759662 -94.903316 7724.129669 42574.107193 42456.977071
──────── Test point index: 1820 | True SalePrice: 132000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1820 1057 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1820 ===
f_AB(S)   = 107783.16623912643
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122842.07963385455
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 23438.840754021992
|f_AB(S)-f_A|  = 62290.16688433902
|psi_ABC(A)|   = 19260.820364036623
|f_ABC(S)-f_A| = 43663.0480986646
|ReSHAP_AB(A)|  = 23546.348983204898
|ReSHAP_ABC(A)| = 21445.44304820643

P   = -0.06483928017130197
R   = -0.11314702730823056
P/R = 0.5730533246328198
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130467.616785 -48140.103858 178607.720643 130467.616785 -48140.103858 -48140.103858
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107783.166239 -23438.840754 -38851.32613 170073.333123 146193.846261 130781.360885 107783.166239 -23879.486862 -39291.972239 881.292217 -23546.348983 -38743.817901
5 AC 2 0.749226 44839.624931 170073.333123 107783.166239 -23438.840754 -38851.32613 170073.333123 146193.846261 130781.360885 107783.166239 -23879.486862 -39291.972239 881.292217 -23546.348983 -38743.817901
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122842.079634 -19260.820364 -12109.187923 -12293.039811 166505.127733 140319.419108 147859.950307 129455.282465 147746.497458 129201.031538 135963.76522 122842.079634 -13121.685586 -6358.951904 -6613.202831 -5423.780333 -5282.982256 -4505.184739 -2357.260449 -21445.443048 -10927.81302 -11289.792031
──────── Test point index: 1600 | True SalePrice: 81000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1600 630 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1600 ===
f_AB(S)   = 130975.86874234876
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 97160.27080919182
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 6735.49998639281
|f_AB(S)-f_A|  = 39097.46438111669
|psi_ABC(A)|   = 29304.457769193
|f_ABC(S)-f_A| = 69344.85692332733
|ReSHAP_AB(A)|  = 15378.827636362057
|ReSHAP_ABC(A)| = 31681.370719943778

P   = -0.250315613025228
R   = -0.06352100484450407
P/R = 3.9406746420020724
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 92304.316697 -86303.403946 178607.720643 92304.316697 -86303.403946 -86303.403946
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 130975.868742 -6735.499986 -32361.964395 170073.333123 122817.457846 97190.993438 130975.868742 -47255.875278 -72882.339686 81040.750582 -15378.827636 -23718.636745
5 AC 2 0.749226 44839.624931 170073.333123 130975.868742 -6735.499986 -32361.964395 170073.333123 122817.457846 97190.993438 130975.868742 -47255.875278 -72882.339686 81040.750582 -15378.827636 -23718.636745
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 97160.270809 -29304.457769 -19870.341622 -20170.057532 166505.127733 124118.870406 136767.390763 112731.740945 136359.403222 112540.296668 118760.008604 97160.270809 -21599.737795 -15380.025859 -15571.470136 -2219.368759 -2435.912023 3992.896398 -16131.23875 -31681.37072 -18659.639769 -19003.846434
──────── Test point index: 1486 | True SalePrice: 145900 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1486 2200 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1486 ===
f_AB(S)   = 169523.53218085258
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 170971.46285931196
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 38742.17129617178
|f_AB(S)-f_A|  = 549.8009426128701
|psi_ABC(A)|   = 40231.922575684635
|f_ABC(S)-f_A| = 4466.335126792808
|ReSHAP_AB(A)|  = 38742.17129617281
|ReSHAP_ABC(A)| = 26986.56781599386

P   = 61.457999159417064
R   = 64.42359721323137
P/R = 0.9539672079471335
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 256710.410228 78102.689585 178607.720643 256710.410228 78102.689585 78102.689585
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 169523.532181 38742.171296 -39291.972239 170073.333123 208815.50442 130781.360885 169523.532181 38742.171296 -39291.972239 -0.0 38742.171296 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 169523.532181 38742.171296 -39291.972239 170073.333123 208815.50442 130781.360885 169523.532181 38742.171296 -39291.972239 -0.0 38742.171296 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 170971.462859 40231.922576 -17857.409389 -17908.17806 166505.127733 209660.848766 147859.950307 190329.615182 147746.497458 190341.530691 135963.76522 170971.462859 35007.697639 -19370.067832 -19358.152323 7587.335594 7461.967236 38.834248 -6901.279435 26986.567816 -11295.460512 -11224.772177
──────── Test point index: 1221 | True SalePrice: 118000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1221 1077 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1221 ===
f_AB(S)   = 108576.15180773073
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 123342.73435804661
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 22494.47694470967
|f_AB(S)-f_A|  = 61497.18131573472
|psi_ABC(A)|   = 18501.40049659516
|f_ABC(S)-f_A| = 43162.39337447254
|ReSHAP_AB(A)|  = 22571.403961825843
|ReSHAP_ABC(A)| = 20390.64666315909

P   = -0.06286568911531415
R   = -0.10538543952175145
P/R = 0.5965310710910755
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 132676.589549 -45931.131094 178607.720643 132676.589549 -45931.131094 -45931.131094
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 108576.151808 -22494.476945 -39002.704371 170073.333123 147289.588311 130781.360885 108576.151808 -22783.744812 -39291.972239 578.535736 -22571.403962 -38925.777354
5 AC 2 0.749226 44839.624931 170073.333123 108576.151808 -22494.476945 -39002.704371 170073.333123 147289.588311 130781.360885 108576.151808 -22783.744812 -39291.972239 578.535736 -22571.403962 -38925.777354
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 123342.734358 -18501.400497 -12252.744794 -12408.248084 166505.127733 141401.861596 147859.950307 130122.096256 147746.497458 129924.542526 135963.76522 123342.734358 -12621.030862 -6581.808168 -6779.361898 -5200.92407 -5116.823189 -4697.957172 -2164.488015 -20390.646663 -11242.463725 -11529.282987
──────── Test point index: 651 | True SalePrice: 152500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
651 1527 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 651 ===
f_AB(S)   = 206479.77446437112
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 202551.72013633
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 1870.4513129879488
|f_AB(S)-f_A|  = 36406.44134090567
|psi_ABC(A)|   = 2083.553184867531
|f_ABC(S)-f_A| = 36046.59240381085
|ReSHAP_AB(A)|  = 1870.4513129879488
|ReSHAP_ABC(A)| = 2083.580580186932

P   = -0.006424735080895735
R   = -0.006425495078376059
P/R = 0.999881721568369
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 182378.476731 3770.756088 178607.720643 182378.476731 3770.756088 3770.756088
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 206479.774464 1870.451313 34535.990028 170073.333123 171943.784436 204609.323151 206479.774464 1870.451313 34535.990028 0.0 1870.451313 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 206479.774464 1870.451313 34535.990028 170073.333123 171943.784436 204609.323151 206479.774464 1870.451313 34535.990028 0.0 1870.451313 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 202551.720136 2083.553185 16986.630575 16976.408644 166505.127733 168588.662867 183491.465006 185575.000141 183481.243075 185564.77821 200468.13085 202551.720136 2083.589286 16986.941927 16976.719996 -0.054152 -0.054152 -0.604653 0.054152 2083.58058 16986.616792 16976.395031
──────── Test point index: 1465 | True SalePrice: 183000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1465 1640 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1465 ===
f_AB(S)   = 212670.71704698596
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 209488.29517293413
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 8061.393895602814
|f_AB(S)-f_A|  = 42597.38392352051
|psi_ABC(A)|   = 8993.225801473242
|f_ABC(S)-f_A| = 42983.16744041498
|ReSHAP_AB(A)|  = 8061.393895602832
|ReSHAP_ABC(A)| = 9000.633970583822

P   = -0.019980468742661628
R   = -0.020152819212664375
P/R = 0.9914478233450118
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 194859.172845 16251.452203 178607.720643 194859.172845 16251.452203 16251.452203
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 212670.717047 8061.393896 34535.990028 170073.333123 178134.727019 204609.323151 212670.717047 8061.393896 34535.990028 -0.0 8061.393896 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 212670.717047 8061.393896 34535.990028 170073.333123 178134.727019 204609.323151 212670.717047 8061.393896 34535.990028 -0.0 8061.393896 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 209488.295173 8993.225801 17000.081785 16989.859854 166505.127733 175484.884273 183491.465006 192471.221547 183481.243075 192460.999616 200468.13085 209488.295173 9020.164323 17027.295557 17017.073626 -40.407782 -40.407782 -40.958283 40.407782 9000.633971 16996.373606 16986.159864
──────── Test point index: 120 | True SalePrice: 159000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
120 1209 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 120 ===
f_AB(S)   = 115229.51360234566
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 126511.54372942832
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 15551.84728233514
|f_AB(S)-f_A|  = 54843.81952111979
|psi_ABC(A)|   = 13152.703445682326
|f_ABC(S)-f_A| = 39993.58400309083
|ReSHAP_AB(A)|  = 15551.847282335137
|ReSHAP_ABC(A)| = 13303.043173166638

P   = -0.0453042501995401
R   = -0.049063346345379344
P/R = 0.9233828015036469
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 147255.809789 -31351.910854 178607.720643 147255.809789 -31351.910854 -31351.910854
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 115229.513602 -15551.847282 -39291.972239 170073.333123 154521.485841 130781.360885 115229.513602 -15551.847282 -39291.972239 -0.0 -15551.847282 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 115229.513602 -15551.847282 -39291.972239 170073.333123 154521.485841 130781.360885 115229.513602 -15551.847282 -39291.972239 -0.0 -15551.847282 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 126511.543729 -13152.703446 -13332.115283 -13508.765275 166505.127733 149181.597315 147859.950307 135240.789021 147746.497458 135000.941886 135963.76522 126511.543729 -9452.221491 -8489.398157 -8729.245291 -3293.334081 -3166.939796 -5451.410138 -1411.03505 -13303.043173 -13201.193807 -13489.347023
──────── Test point index: 1058 | True SalePrice: 390000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1058 2748 9 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1058 ===
f_AB(S)   = 383071.3997071001
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 380210.8773715032
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 87023.66929886007
|f_AB(S)-f_A|  = 212998.06658363467
|psi_ABC(A)|   = 77231.31602816649
|f_ABC(S)-f_A| = 213705.74963898407
|ReSHAP_AB(A)|  = 83165.24570575196
|ReSHAP_ABC(A)| = 77204.51001382181

P   = 0.04717462302602771
R   = 0.029185228061580804
P/R = 1.6163869929845776
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 317236.263952 138628.543309 178607.720643 317236.263952 138628.543309 138628.543309
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
4 AB 2 0.749226 44839.624931 170073.333123 383071.399707 87023.669299 125974.397285 170073.333123 239486.557432 278437.285418 383071.399707 69413.224309 108363.952295 35220.88998 83165.245706 129832.820878
5 AC 2 0.749226 44839.624931 170073.333123 383071.399707 87023.669299 125974.397285 170073.333123 239486.557432 278437.285418 383071.399707 69413.224309 108363.952295 35220.88998 83165.245706 129832.820878
6 BC 2 0.697364 49258.516562 161570.634319 342240.984080 90290.340553 90380.009207 161570.634319 248413.207248 248502.875901 342240.98408 86842.572929 86932.241582 6895.53525 90288.561488 90381.788272
7 ABC 3 0.797024 40340.721518 166505.127733 380210.877372 77231.316028 68340.598146 68133.835464 166505.127733 243096.65488 227712.94717 305264.157638 227506.184488 305057.394956 302659.666903 380210.877372 77551.210469 75153.482415 74946.719733 0.0 0.0 -12985.979657 -959.683321 77204.510014 68344.634884 68156.604741
──────── Test point index: 2659 | True SalePrice: 163000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2659 1768 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2659 ===
f_AB(S)   = 219683.466167116
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 217397.01619671716
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 15074.143015732843
|f_AB(S)-f_A|  = 49610.13304365057
|psi_ABC(A)|   = 16837.2321607026
|f_ABC(S)-f_A| = 50891.88846419801
|ReSHAP_AB(A)|  = 15074.143015732843
|ReSHAP_ABC(A)| = 16837.659643214676

P   = -0.026991041451799702
R   = -0.026999441268056867
P/R = 0.9996888892561231
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 208996.598533 30388.87789 178607.720643 208996.598533 30388.87789 30388.87789
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 219683.466167 15074.143016 34535.990028 170073.333123 185147.476139 204609.323151 219683.466167 15074.143016 34535.990028 0.0 15074.143016 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 219683.466167 15074.143016 34535.990028 170073.333123 185147.476139 204609.323151 219683.466167 15074.143016 34535.990028 0.0 15074.143016 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 217397.016197 16837.232161 17032.439117 17022.217186 166505.127733 183296.5333 183491.465006 200282.870574 183481.243075 200272.648643 200468.13085 217397.016197 16928.885347 17124.367554 17114.145623 -137.479779 -137.479779 -138.03028 137.479779 16837.659643 17032.208271 17022.02055
──────── Test point index: 755 | True SalePrice: 103000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
755 1208 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 755 ===
f_AB(S)   = 104542.97563767005
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 111629.19233834231
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4127.326092356314
|f_AB(S)-f_A|  = 65530.35748579539
|psi_ABC(A)|   = 11391.41774138881
|f_ABC(S)-f_A| = 54875.935394176835
|ReSHAP_AB(A)|  = 11557.46623947375
|ReSHAP_ABC(A)| = 9654.67914996479

P   = -0.1446015187514519
R   = 0.0004316400026232736
P/R = -335.0049065717783
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 147145.361151 -31462.359492 178607.720643 147145.361151 -31462.359492 -31462.359492
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 104542.975638 -4127.326092 -61403.031393 170073.333123 154466.698739 97190.993438 104542.975638 -15606.634385 -72882.339686 22958.616585 -11557.466239 -53972.891246
5 AC 2 0.749226 44839.624931 170073.333123 104542.975638 -4127.326092 -61403.031393 170073.333123 154466.698739 97190.993438 104542.975638 -15606.634385 -72882.339686 22958.616585 -11557.466239 -53972.891246
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 111629.192338 -11391.417741 -21528.337067 -21956.180586 166505.127733 149120.568807 136767.390763 127128.368707 136359.403222 126680.669212 118760.008604 111629.192338 -7130.816266 -15051.476873 -15499.176368 -2547.917744 -2508.20579 -6940.723227 -5197.619125 -9654.67915 -22313.25378 -22908.002464
──────── Test point index: 1226 | True SalePrice: 165500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1226 1661 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1226 ===
f_AB(S)   = 176907.26506615613
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 174427.98301242228
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 9211.923048124125
|f_AB(S)-f_A|  = 6833.931942690688
|psi_ABC(A)|   = 10261.355209282841
|f_ABC(S)-f_A| = 7922.855279903131
|ReSHAP_AB(A)|  = 9211.923048124125
|ReSHAP_ABC(A)| = 10261.35520928287

P   = 0.05280948245734285
R   = 0.052809482457339296
P/R = 1.0000000000000673
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 197178.594247 18570.873605 178607.720643 197178.594247 18570.873605 18570.873605
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 176907.265066 9211.923048 -2377.991105 170073.333123 179285.256172 167695.342018 176907.265066 9211.923048 -2377.991105 0.0 9211.923048 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 176907.265066 9211.923048 -2377.991105 170073.333123 179285.256172 167695.342018 176907.265066 9211.923048 -2377.991105 0.0 9211.923048 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 174427.983012 10261.355209 -1169.601883 -1168.898047 166505.127733 176766.482942 165335.52585 175596.881059 165336.229686 175597.584895 164166.627803 174427.983012 10261.355209 -1169.601883 -1168.898047 0.0 0.0 0.0 0.0 10261.355209 -1169.601883 -1168.898047
──────── Test point index: 2873 | True SalePrice: 145400 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2873 1285 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2873 ===
f_AB(S)   = 156307.31452577407
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 151481.2639957493
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 11388.027492257941
|f_AB(S)-f_A|  = 13766.018597691378
|psi_ABC(A)|   = 12685.363807390162
|f_ABC(S)-f_A| = 15023.863736769854
|ReSHAP_AB(A)|  = 11388.027492257941
|ReSHAP_ABC(A)| = 12685.363807390158

P   = -0.017091197843278394
R   = -0.017091197843278172
P/R = 1.000000000000013
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155649.906291 -22957.814352 178607.720643 155649.906291 -22957.814352 -22957.814352
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 156307.314526 -11388.027492 -2377.991105 170073.333123 158685.305631 167695.342018 156307.314526 -11388.027492 -2377.991105 0.0 -11388.027492 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 156307.314526 -11388.027492 -2377.991105 170073.333123 158685.305631 167695.342018 156307.314526 -11388.027492 -2377.991105 0.0 -11388.027492 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 151481.263996 -12685.363807 -1169.601883 -1168.898047 166505.127733 153819.763925 165335.52585 152650.162043 165336.229686 152650.865878 164166.627803 151481.263996 -12685.363807 -1169.601883 -1168.898047 -0.0 -0.0 -0.0 0.0 -12685.363807 -1169.601883 -1168.898047
──────── Test point index: 2554 | True SalePrice: 121600 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2554 1230 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2554 ===
f_AB(S)   = 103536.87936032933
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 112179.91253495356
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4027.7161035155077
|f_AB(S)-f_A|  = 66536.45376313612
|psi_ABC(A)|   = 10552.139040329663
|f_ABC(S)-f_A| = 54325.21519756559
|ReSHAP_AB(A)|  = 10978.144842365382
|ReSHAP_ABC(A)| = 8789.951429936962

P   = -0.1337061939686547
R   = 0.0031920160596582425
P/R = -41.887694632391714
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 149575.231191 -29032.489452 178607.720643 149575.231191 -29032.489452 -29032.489452
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 103536.879360 -4027.716104 -62508.73766 170073.333123 155672.014994 97190.993438 103536.87936 -14401.31813 -72882.339686 20747.204053 -10978.144842 -55558.308921
5 AC 2 0.749226 44839.624931 170073.333123 103536.879360 -4027.716104 -62508.73766 170073.333123 155672.014994 97190.993438 103536.87936 -14401.31813 -72882.339686 20747.204053 -10978.144842 -55558.308921
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 112179.912535 -10552.13904 -21674.006578 -22099.069579 166505.127733 150463.195984 136767.390763 127750.076919 136359.403222 127307.938459 118760.008604 112179.912535 -6580.096069 -15128.025924 -15570.164384 -2471.368693 -2437.217775 -7585.09314 -4553.249212 -8789.95143 -22471.659126 -23063.604642
──────── Test point index: 1364 | True SalePrice: 127000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1364 1236 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1364 ===
f_AB(S)   = 153622.74650322428
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 148490.86710261903
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 14072.595514807705
|f_AB(S)-f_A|  = 16450.58662024117
|psi_ABC(A)|   = 15675.760700520439
|f_ABC(S)-f_A| = 18014.26062990012
|ReSHAP_AB(A)|  = 14072.595514807725
|ReSHAP_ABC(A)| = 15675.760700520448

P   = -0.014739758106623091
R   = -0.014739758106622314
P/R = 1.0000000000000526
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 150237.923020 -28369.797623 178607.720643 150237.92302 -28369.797623 -28369.797623
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 153622.746503 -14072.595515 -2377.991105 170073.333123 156000.737609 167695.342018 153622.746503 -14072.595515 -2377.991105 -0.0 -14072.595515 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 153622.746503 -14072.595515 -2377.991105 170073.333123 156000.737609 167695.342018 153622.746503 -14072.595515 -2377.991105 -0.0 -14072.595515 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 148490.867103 -15675.760701 -1169.601883 -1168.898047 166505.127733 150829.367032 165335.52585 149659.765149 165336.229686 149660.468985 164166.627803 148490.867103 -15675.760701 -1169.601883 -1168.898047 -0.0 -0.0 0.0 0.0 -15675.760701 -1169.601883 -1168.898047
──────── Test point index: 291 | True SalePrice: 100000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
291 1666 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 291 ===
f_AB(S)   = 140267.21944531
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 142954.60445510183
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 9485.858560629196
|f_AB(S)-f_A|  = 29806.113678155438
|psi_ABC(A)|   = 9168.670782013069
|f_ABC(S)-f_A| = 23550.52327741732
|ReSHAP_AB(A)|  = 9485.858560629184
|ReSHAP_ABC(A)| = 1880.127436661754

P   = -0.07106708603660966
R   = 0.23841832367147592
P/R = -0.29807728257722016
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 197730.837438 19123.116795 178607.720643 197730.837438 19123.116795 19123.116795
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 140267.219445 9485.858561 -39291.972239 170073.333123 179559.191684 130781.360885 140267.219445 9485.858561 -39291.972239 0.0 9485.858561 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 140267.219445 9485.858561 -39291.972239 170073.333123 179559.191684 130781.360885 140267.219445 9485.858561 -39291.972239 0.0 9485.858561 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 142954.604455 9168.670782 -16334.141218 -16385.052842 166505.127733 177071.625482 147859.950307 157746.084443 147746.497458 157757.714045 135963.76522 142954.604455 6990.839235 -14803.10959 -14791.479988 3020.377353 2895.294901 -4522.431449 -2340.013739 1880.127437 -12593.332137 -12837.318577
──────── Test point index: 2662 | True SalePrice: 144000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2662 1690 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2662 ===
f_AB(S)   = 215410.07217203674
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 212577.63932284934
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 10800.749020653573
|f_AB(S)-f_A|  = 45336.7390485713
|psi_ABC(A)|   = 12057.2907855472
|f_ABC(S)-f_A| = 46072.51159033019
|ReSHAP_AB(A)|  = 10800.749020653573
|ReSHAP_ABC(A)| = 12064.986657662275

P   = -0.02346856621042731
R   = -0.023635604469237792
P/R = 0.9929327697530272
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 200381.604755 21773.884112 178607.720643 200381.604755 21773.884112 21773.884112
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 215410.072172 10800.749021 34535.990028 170073.333123 180874.082144 204609.323151 215410.072172 10800.749021 34535.990028 0.0 10800.749021 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 215410.072172 10800.749021 34535.990028 170073.333123 180874.082144 204609.323151 215410.072172 10800.749021 34535.990028 0.0 10800.749021 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 212577.639323 12057.290786 17012.721368 17002.499437 166505.127733 178536.309674 183491.465006 195522.646948 183481.243075 195512.425017 200468.13085 212577.639323 12109.508473 17065.214305 17054.992375 -78.326531 -78.326531 -78.877032 78.326531 12064.986658 17008.864593 16998.660339
──────── Test point index: 2834 | True SalePrice: 92000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2834 1338 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2834 ===
f_AB(S)   = 159211.0309583279
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 154715.7749209718
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 8484.31105970408
|f_AB(S)-f_A|  = 10862.302165137546
|psi_ABC(A)|   = 9450.852882167645
|f_ABC(S)-f_A| = 11789.352811547345
|ReSHAP_AB(A)|  = 8484.311059704096
|ReSHAP_ABC(A)| = 9450.852882167645

P   = -0.020564526802746452
R   = -0.02056452680274501
P/R = 1.0000000000000702
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 161503.684114 -17104.036528 178607.720643 161503.684114 -17104.036528 -17104.036528
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 159211.030958 -8484.31106 -2377.991105 170073.333123 161589.022064 167695.342018 159211.030958 -8484.31106 -2377.991105 -0.0 -8484.31106 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 159211.030958 -8484.31106 -2377.991105 170073.333123 161589.022064 167695.342018 159211.030958 -8484.31106 -2377.991105 -0.0 -8484.31106 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 154715.774921 -9450.852882 -1169.601883 -1168.898047 166505.127733 157054.27485 165335.52585 155884.672968 165336.229686 155885.376803 164166.627803 154715.774921 -9450.852882 -1169.601883 -1168.898047 0.0 0.0 0.0 0.0 -9450.852882 -1169.601883 -1168.898047
──────── Test point index: 1264 | True SalePrice: 148000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1264 1081 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1264 ===
f_AB(S)   = 145130.74561556676
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 139746.86905410097
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 22564.59640246522
|f_AB(S)-f_A|  = 24942.587507898686
|psi_ABC(A)|   = 24667.01709686509
|f_ABC(S)-f_A| = 26758.258678418177
|ReSHAP_AB(A)|  = 22564.596402465242
|ReSHAP_ABC(A)| = 24873.172188923003

P   = -0.017185462528023887
R   = -0.024889816332786308
P/R = 0.6904616047883889
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 133118.384102 -45489.336541 178607.720643 133118.384102 -45489.336541 -45489.336541
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
4 AB 2 0.749226 44839.624931 170073.333123 145130.745616 -22564.596402 -2377.991105 170073.333123 147508.736721 167695.342018 145130.745616 -22564.596402 -2377.991105 -0.0 -22564.596402 -2377.991105
5 AC 2 0.749226 44839.624931 170073.333123 145130.745616 -22564.596402 -2377.991105 170073.333123 147508.736721 167695.342018 145130.745616 -22564.596402 -2377.991105 -0.0 -22564.596402 -2377.991105
6 BC 2 0.697364 49258.516562 161570.634319 157757.239664 -1905.715523 -1907.679133 161570.634319 159664.918797 159662.955187 157757.239664 -1905.715523 -1907.679133 0.0 -1905.715523 -1907.679133
7 ABC 3 0.797024 40340.721518 166505.127733 139746.869054 -24667.017097 -1044.850457 -1046.391125 166505.127733 141618.350094 165335.52585 140643.255451 165336.229686 140639.470279 164166.627803 139746.869054 -24419.758749 -892.601225 -896.386397 -277.000658 -272.51165 -82.493418 82.493418 -24873.172189 -940.628785 -944.457704
──────── Test point index: 218 | True SalePrice: 131500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
218 1039 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 218 ===
f_AB(S)   = 107176.57745843756
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122391.49038208168
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24235.219066875557
|f_AB(S)-f_A|  = 62896.755665027886
|psi_ABC(A)|   = 19919.342460426353
|f_ABC(S)-f_A| = 44113.637350437464
|ReSHAP_AB(A)|  = 24376.977189295034
|ReSHAP_ABC(A)| = 22361.31941418904

P   = -0.06622866050383652
R   = -0.11933134680489899
P/R = 0.5549980141606647
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 128479.541298 -50128.179345 178607.720643 128479.541298 -50128.179345 -50128.179345
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107176.577458 -24235.219067 -38661.536598 170073.333123 145207.678416 130781.360885 107176.577458 -24865.654708 -39291.972239 1260.871281 -24376.977189 -38519.778476
5 AC 2 0.749226 44839.624931 170073.333123 107176.577458 -24235.219067 -38661.536598 170073.333123 145207.678416 130781.360885 107176.577458 -24865.654708 -39291.972239 1260.871281 -24376.977189 -38519.778476
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 122391.490382 -19919.34246 -11997.839541 -12196.455349 166505.127733 139400.06891 147859.950307 128864.419544 147746.497458 128580.640778 135963.76522 122391.490382 -13572.274838 -6189.150396 -6472.929162 -5593.581841 -5423.255925 -4346.49897 -2515.946218 -22361.319414 -10674.361291 -11077.956645
──────── Test point index: 2028 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2028 1173 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2028 ===
f_AB(S)   = 113257.17791230907
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 125557.42379731752
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 17524.182972371716
|f_AB(S)-f_A|  = 56816.15521115638
|psi_ABC(A)|   = 14692.145240206553
|f_ABC(S)-f_A| = 40947.70393520163
|ReSHAP_AB(A)|  = 17524.18297237171
|ReSHAP_ABC(A)| = 15222.513334088377

P   = -0.05036604188017496
R   = -0.06331836992086604
P/R = 0.795441227295039
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 143279.658814 -35328.061828 178607.720643 143279.658814 -35328.061828 -35328.061828
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 113257.177912 -17524.182972 -39291.972239 170073.333123 152549.150151 130781.360885 113257.177912 -17524.182972 -39291.972239 -0.0 -17524.182972 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 113257.177912 -17524.182972 -39291.972239 170073.333123 152549.150151 130781.360885 113257.177912 -17524.182972 -39291.972239 -0.0 -17524.182972 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 125557.423797 -14692.14524 -13045.398348 -13210.160347 166505.127733 146984.571027 147859.950307 133761.721865 147746.497458 133545.650716 135963.76522 125557.423797 -10406.341423 -7988.226919 -8204.298068 -3794.505319 -3691.887019 -5234.622243 -1627.822945 -15222.513334 -12724.294271 -13000.89633
──────── Test point index: 2726 | True SalePrice: 164000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2726 1000 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2726 ===
f_AB(S)   = 106239.2144026331
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 121415.21366990716
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 25772.24909354761
|f_AB(S)-f_A|  = 63834.11872083235
|psi_ABC(A)|   = 21253.958471905233
|f_ABC(S)-f_A| = 45089.91406261199
|ReSHAP_AB(A)|  = 26000.2851215747
|ReSHAP_ABC(A)| = 24184.396773261113

P   = -0.06763051357703631
R   = -0.12904918482485922
P/R = 0.5240677317630634
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 124172.044409 -54435.676234 178607.720643 124172.044409 -54435.676234 -54435.676234
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 106239.214403 -25772.249094 -38061.869627 170073.333123 143070.981418 130781.360885 106239.214403 -27002.351705 -39291.972239 2460.205223 -26000.285122 -37833.833599
5 AC 2 0.749226 44839.624931 170073.333123 106239.214403 -25772.249094 -38061.869627 170073.333123 143070.981418 130781.360885 106239.214403 -27002.351705 -39291.972239 2460.205223 -26000.285122 -37833.833599
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 121415.213670 -21253.958472 -11832.603317 -12003.352274 166505.127733 137543.000557 147859.950307 127666.049725 147746.497458 127438.004661 135963.76522 121415.21367 -14548.55155 -6022.790991 -6250.836055 -5759.941247 -5645.349032 -3854.15984 -3008.285347 -24184.396773 -10286.742438 -10618.774852
──────── Test point index: 1756 | True SalePrice: 170000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1756 1430 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1756 ===
f_AB(S)   = 201165.42552177262
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 196631.80100069783
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 3443.897629610554
|f_AB(S)-f_A|  = 31092.09239830717
|psi_ABC(A)|   = 3836.2633791681633
|f_ABC(S)-f_A| = 30126.67326817868
|ReSHAP_AB(A)|  = 3443.897629610554
|ReSHAP_ABC(A)| = 3836.375348695911

P   = -0.016573348030342203
R   = -0.016577064654701976
P/R = 0.9997757971970799
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 171664.958827 -6942.761816 178607.720643 171664.958827 -6942.761816 -6942.761816
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 201165.425522 -3443.89763 34535.990028 170073.333123 166629.435494 204609.323151 201165.425522 -3443.89763 34535.990028 0.0 -3443.89763 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 201165.425522 -3443.89763 34535.990028 170073.333123 166629.435494 204609.323151 201165.425522 -3443.89763 34535.990028 0.0 -3443.89763 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 196631.801001 -3836.263379 16986.579289 16976.357358 166505.127733 162668.897588 183491.465006 179655.234862 183481.243075 179645.012931 200468.13085 196631.801001 -3836.329849 16986.788069 16976.566138 0.099705 0.099705 -0.450795 -0.099705 -3836.375349 16986.635178 16976.413439
──────── Test point index: 856 | True SalePrice: 136500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
856 864 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 856 ===
f_AB(S)   = 107787.13159503626
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 118010.76154540118
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 28723.813467415122
|f_AB(S)-f_A|  = 62286.201528429185
|psi_ABC(A)|   = 25476.2125028759
|f_ABC(S)-f_A| = 48494.366187117965
|ReSHAP_AB(A)|  = 29099.74243052245
|ReSHAP_ABC(A)| = 29814.019060136554

P   = -0.06418522120293707
R   = -0.14759941270359273
P/R = 0.43486095254208784
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 109151.029616 -69456.691027 178607.720643 109151.029616 -69456.691027 -69456.691027
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 107787.131595 -28723.813467 -33562.388061 170073.333123 135619.935478 130781.360885 107787.131595 -34453.397645 -39291.972239 11459.168356 -29099.742431 -33186.459098
5 AC 2 0.749226 44839.624931 170073.333123 107787.131595 -28723.813467 -33562.388061 170073.333123 135619.935478 130781.360885 107787.131595 -34453.397645 -39291.972239 11459.168356 -29099.742431 -33186.459098
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 118010.761545 -25476.212503 -11420.385647 -11597.768038 166505.127733 131877.961044 147859.950307 124075.412703 147746.497458 123834.10077 135963.76522 118010.761545 -17953.003675 -5823.339225 -6064.651158 -5959.393012 -5831.533929 -1979.209116 -4883.236072 -29814.01906 -9176.484097 -9503.86303
──────── Test point index: 2718 | True SalePrice: 157000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2718 1949 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2718 ===
f_AB(S)   = 155771.96945309758
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 156513.64749124637
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 24990.608568416785
|f_AB(S)-f_A|  = 14301.363670367864
|psi_ABC(A)|   = 25202.39697441313
|f_ABC(S)-f_A| = 9991.480241272773
|ReSHAP_AB(A)|  = 24990.608568416817
|ReSHAP_ABC(A)| = 392854.7642970336

P   = -0.7749603444833821
R   = -37.57154688067281
P/R = 0.02062625600549946
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 228987.802044 50380.081401 178607.720643 228987.802044 50380.081401 50380.081401
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 155771.969453 24990.608568 -39291.972239 170073.333123 195063.941692 130781.360885 155771.969453 24990.608568 -39291.972239 -0.0 24990.608568 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 155771.969453 24990.608568 -39291.972239 170073.333123 195063.941692 130781.360885 155771.969453 24990.608568 -39291.972239 -0.0 24990.608568 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 156513.647491 25202.396974 -17571.482796 -17622.39442 166505.127733 194342.693252 147859.950307 175017.152214 147746.497458 175028.781816 135963.76522 156513.647491 20549.882271 -18515.134325 -18503.504722 6732.402087 6607.319635 -810.406714 -6052.038473 392854.764297 -203362.478359 -199483.76618
──────── Test point index: 283 | True SalePrice: 80000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
283 924 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 283 ===
f_AB(S)   = 117530.76394515927
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 104519.89525481508
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 5413.200493756005
|f_AB(S)-f_A|  = 52542.56917830618
|psi_ABC(A)|   = 20989.973950620544
|f_ABC(S)-f_A| = 61985.23247770406
|ReSHAP_AB(A)|  = 15738.338811558964
|ReSHAP_ABC(A)| = 20955.81562463015

P   = -0.23560357963658285
R   = -0.03854254477715763
P/R = 6.1128184710889695
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 115777.947907 -62829.772736 178607.720643 115777.947907 -62829.772736 -62829.772736
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 117530.763945 -5413.200494 -47129.368685 170073.333123 138907.161628 97190.993438 117530.763945 -31166.171495 -72882.339686 51505.942003 -15738.338812 -36804.230367
5 AC 2 0.749226 44839.624931 170073.333123 117530.763945 -5413.200494 -47129.368685 170073.333123 138907.161628 97190.993438 117530.763945 -31166.171495 -72882.339686 51505.942003 -15738.338812 -36804.230367
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 104519.895255 -20989.973951 -20347.771309 -20647.487218 166505.127733 134343.07297 136767.390763 120091.365391 136359.403222 119899.921114 118760.008604 104519.895255 -14240.113349 -15380.025859 -15571.470136 -2219.368759 -2435.912023 1128.318279 -13266.660631 -20955.815625 -20321.04714 -20708.369713
──────── Test point index: 2525 | True SalePrice: 262500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2525 2156 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2525 ===
f_AB(S)   = 240940.86193751023
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 241370.32680005959
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 36331.53878612707
|f_AB(S)-f_A|  = 70867.52881404478
|psi_ABC(A)|   = 40614.376437116625
|f_ABC(S)-f_A| = 74865.19906754044
|ReSHAP_AB(A)|  = 36331.53878612707
|ReSHAP_ABC(A)| = 40530.30092353466

P   = -0.029831733326994292
R   = -0.02870870801444403
P/R = 1.0391179328580458
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 251850.670148 73242.949505 178607.720643 251850.670148 73242.949505 73242.949505
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
4 AB 2 0.749226 44839.624931 170073.333123 240940.861938 36331.538786 34535.990028 170073.333123 206404.87191 204609.323151 240940.861938 36331.538786 34535.990028 -0.0 36331.538786 34535.990028
5 AC 2 0.749226 44839.624931 170073.333123 240940.861938 36331.538786 34535.990028 170073.333123 206404.87191 204609.323151 240940.861938 36331.538786 34535.990028 -0.0 36331.538786 34535.990028
6 BC 2 0.697364 49258.516562 161570.634319 216953.246765 27677.039076 27705.573369 161570.634319 189247.681614 189276.215906 216953.246765 27677.047294 27705.581587 -0.016436 27677.039081 27705.573365
7 ABC 3 0.797024 40340.721518 166505.127733 241370.326800 40614.376437 17130.522281 17120.30035 166505.127733 206975.594413 183491.465006 223961.931687 183481.243075 223951.709756 200468.13085 241370.3268 40902.19595 17418.617044 17408.395113 -431.729269 -431.729269 -432.27977 431.729269 40530.300924 17172.499183 17162.398961
──────── Test point index: 527 | True SalePrice: 264500 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
527 1836 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 527 ===
f_AB(S)   = 260322.97027053632
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 272729.88215882535
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 18799.66598580194
|f_AB(S)-f_A|  = 90249.63714707087
|psi_ABC(A)|   = 21114.83015383681
|f_ABC(S)-f_A| = 106224.7544263062
|ReSHAP_AB(A)|  = 18799.66598580195
|ReSHAP_ABC(A)| = 21149.760558894763

P   = 0.009532342589317316
R   = 0.009203507702886221
P/R = 1.0357292998546601
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 216507.105929 37899.385286 178607.720643 216507.105929 37899.385286 37899.385286
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 260322.970271 18799.665986 71449.971161 170073.333123 188872.999109 241523.304285 260322.970271 18799.665986 71449.971161 -0.0 18799.665986 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 260322.970271 18799.665986 71449.971161 170073.333123 188872.999109 241523.304285 260322.970271 18799.665986 71449.971161 -0.0 18799.665986 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 272729.882159 21114.830154 42623.126813 42486.797459 166505.127733 187446.471846 202112.051547 223313.624721 201975.722193 223177.295367 251528.308985 272729.882159 21201.573174 49552.586792 49416.257438 0.0 0.0 -13685.433917 -260.229061 21149.760559 42596.092238 42478.90163
──────── Test point index: 2421 | True SalePrice: 215000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
2421 1456 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2421 ===
f_AB(S)   = 239503.87132015018
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 249250.86661946942
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 2019.4329645841935
|f_AB(S)-f_A|  = 69430.53819668473
|psi_ABC(A)|   = 2268.124555322264
|f_ABC(S)-f_A| = 82745.73888695028
|ReSHAP_AB(A)|  = 2019.432964584178
|ReSHAP_ABC(A)| = 2278.211998114703

P   = 0.0016748874219854847
R   = 0.0015529785124978393
P/R = 1.0785000619819038
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 174536.623420 -4071.097223 178607.720643 174536.62342 -4071.097223 -4071.097223
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
4 AB 2 0.749226 44839.624931 170073.333123 239503.871320 -2019.432965 71449.971161 170073.333123 168053.900159 241523.304285 239503.87132 -2019.432965 71449.971161 -0.0 -2019.432965 71449.971161
5 AC 2 0.749226 44839.624931 170073.333123 239503.871320 -2019.432965 71449.971161 170073.333123 168053.900159 241523.304285 239503.87132 -2019.432965 71449.971161 -0.0 -2019.432965 71449.971161
6 BC 2 0.697364 49258.516562 161570.634319 276370.891790 57370.593657 57429.663814 161570.634319 218830.444431 218889.514587 276370.89179 57259.810111 57318.880268 221.567092 57370.536544 57429.720927
7 ABC 3 0.797024 40340.721518 166505.127733 249250.866619 -2268.124555 42575.096398 42438.767044 166505.127733 164255.638797 202112.051547 199834.609182 201975.722193 199698.279827 251528.308985 249250.866619 -2277.442365 49552.586792 49416.257438 0.0 0.0 -13973.616408 27.95343 -2278.211998 42570.535591 42453.415295
──────── Test point index: 1509 | True SalePrice: 135000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1509 1470 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1509 ===
f_AB(S)   = 96953.97560639406
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 118187.76922525826
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 744.7156803612088
|f_AB(S)-f_A|  = 73119.35751707139
|psi_ABC(A)|   = 946.147878412036
|f_ABC(S)-f_A| = 48317.358507260884
|ReSHAP_AB(A)|  = 1235.2596947568961
|ReSHAP_ABC(A)| = 593.9164700238595

P   = -0.009397013499012903
R   = 0.004601754191655288
P/R = -2.042050293788665
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 176082.904354 -2524.816288 178607.720643 176082.904354 -2524.816288 -2524.816288
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
4 AB 2 0.749226 44839.624931 170073.333123 96953.975606 -744.71568 -72374.641837 170073.333123 168820.919594 97190.993438 96953.975606 -1252.41353 -72882.339686 1015.395698 -1235.259695 -71884.097822
5 AC 2 0.749226 44839.624931 170073.333123 96953.975606 -744.71568 -72374.641837 170073.333123 168820.919594 97190.993438 96953.975606 -1252.41353 -72882.339686 1015.395698 -1235.259695 -71884.097822
6 BC 2 0.697364 49258.516562 161570.634319 123838.112811 -18897.059703 -18835.461806 161570.634319 120942.918792 121004.516689 123838.112811 -40627.715527 -40566.117631 43461.311649 -18880.573688 -18851.94782
7 ABC 3 0.797024 40340.721518 166505.127733 118187.769225 -946.147878 -23492.755426 -23878.455202 166505.127733 165110.037909 136767.390763 135873.988564 136359.403222 135510.576554 118760.008604 118187.769225 -572.239379 -17322.807329 -17686.219339 -276.587289 -321.16282 -11913.242016 -225.100336 -593.91647 -23595.263539 -24128.178498
──────── Test point index: 1662 | True SalePrice: 139000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1662 1109 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1662 ===
f_AB(S)   = 109992.72067384052
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 124098.97227225179
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 20909.59887163851
|f_AB(S)-f_A|  = 60080.612449624925
|psi_ABC(A)|   = 17265.180981127167
|f_ABC(S)-f_A| = 42406.15546026736
|ReSHAP_AB(A)|  = 20946.2166374308
|ReSHAP_ABC(A)| = 18666.404645923336

P   = -0.05911283016922192
R   = -0.0915462838346684
P/R = 0.6457152348857661
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 136210.945971 -42396.774672 178607.720643 136210.945971 -42396.774672 -42396.774672
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 109992.720674 -20909.598872 -39171.013578 170073.333123 149042.775591 130781.360885 109992.720674 -21030.557532 -39291.972239 241.917322 -20946.216637 -39134.395812
5 AC 2 0.749226 44839.624931 170073.333123 109992.720674 -20909.598872 -39171.013578 170073.333123 149042.775591 130781.360885 109992.720674 -21030.557532 -39291.972239 241.917322 -20946.216637 -39134.395812
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 124098.972272 -17265.180981 -12495.388399 -12645.58608 166505.127733 143171.360402 147859.950307 131299.712474 147746.497458 131112.769962 135963.76522 124098.972272 -11864.792948 -7013.797689 -7200.740202 -4768.934548 -4695.444885 -4857.850238 -2004.59495 -18666.404646 -11737.380832 -12002.369982
──────── Test point index: 594 | True SalePrice: 143000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
594 1568 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 594 ===
f_AB(S)   = 134898.08340021042
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138933.11103267202
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 4116.722515529611
|f_AB(S)-f_A|  = 35175.24972325502
|psi_ABC(A)|   = 3857.704019300853
|f_ABC(S)-f_A| = 27572.016699847125
|ReSHAP_AB(A)|  = 4116.722515529602
|ReSHAP_ABC(A)| = 2383.019544988055

P   = -0.022879106487948123
R   = 0.030605718012429006
P/R = -0.7475435302206241
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186906.870896 8299.150254 178607.720643 186906.870896 8299.150254 8299.150254
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 134898.083400 4116.722516 -39291.972239 170073.333123 174190.055639 130781.360885 134898.0834 4116.722516 -39291.972239 0.0 4116.722516 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 134898.083400 4116.722516 -39291.972239 170073.333123 174190.055639 130781.360885 134898.0834 4116.722516 -39291.972239 0.0 4116.722516 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 138933.111033 3857.704019 -15685.284697 -15744.436022 166505.127733 171090.831696 147859.950307 151823.711065 147746.497458 151818.861264 135963.76522 138933.111033 2969.345813 -12885.750232 -12890.600032 1103.017994 994.414945 -6381.370399 -481.074789 2383.019545 -14902.55064 -15052.485605
──────── Test point index: 1362 | True SalePrice: 120000 ────────
Gr Liv Area Overall Qual Overall Qual (DUP)
1362 1308 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1362 ===
f_AB(S)   = 120653.43674994627
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 129345.99699343345
f_ABC(A)  = 166505.12773251915
|psi_AB(A)|    = 10127.924134734545
|f_AB(S)-f_A|  = 49419.89637351918
|psi_ABC(A)|   = 8838.133367002074
|f_ABC(S)-f_A| = 37159.1307390857
|ReSHAP_AB(A)|  = 10127.924134734549
|ReSHAP_ABC(A)| = 8375.059015912975

P   = -0.032909369007184
R   = -0.020447442641344182
P/R = 1.6094613680755427
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158190.224969 -20417.495674 178607.720643 158190.224969 -20417.495674 -20417.495674
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
4 AB 2 0.749226 44839.624931 170073.333123 120653.436750 -10127.924135 -39291.972239 170073.333123 159945.408989 130781.360885 120653.43675 -10127.924135 -39291.972239 0.0 -10127.924135 -39291.972239
5 AC 2 0.749226 44839.624931 170073.333123 120653.436750 -10127.924135 -39291.972239 170073.333123 159945.408989 130781.360885 120653.43675 -10127.924135 -39291.972239 0.0 -10127.924135 -39291.972239
6 BC 2 0.697364 49258.516562 161570.634319 121067.551636 -20235.310692 -20267.771992 161570.634319 130082.172203 130049.710903 121067.551636 -31488.462117 -31520.923416 22506.302849 -20241.108113 -20261.974571
7 ABC 3 0.797024 40340.721518 166505.127733 129345.996993 -8838.133367 -14072.17369 -14248.823682 166505.127733 155223.41961 147859.950307 139308.223698 147746.497458 139068.376564 135963.76522 129345.996993 -6617.768227 -9722.37957 -9962.226705 -2060.352667 -1933.958382 -6192.816341 -669.628847 -8375.059016 -14264.014043 -14520.05768
================ Overall Summary (averaged over test points) ================
mean(|P/R|) n_points
Model
MLP 5.49048 100
In [156]:
# ============================================================
# 6) Settings & Run (edit here, then run this cell)
# ============================================================

SETTINGS = dict(
    # Data slice / feature modes:
    MODE="totrms",           # 'two' | 'dup_grliv' | 'dup_qual' | 'totrms'

    # Baseline used inside each subset for f(∅):
    BASELINE="mean",           # 'mean' | 'median' | 'zero'

    # Randomness & test row:
    RANDOM_STATE=42,
    PICK_RANDOM_POINT=True,    # True → draw a random test row (with the seed)
    N_RANDOM_POINTS=100,         # ← NEW: number of distinct random test points to assess
    CUSTOM_TEST_INDEX=None,    # if PICK_RANDOM_POINT=False, use this test index (must be in X_te)

    # Which models to run:
    USE_LINEAR=False,
    USE_MLP=True,
    USE_LGB=False,              # will only run if LightGBM is installed
    USE_XGB=False,              # will only run if XGBoost is installed

    # MLP hyperparameters (used if USE_MLP=True):
    MLP_hidden=(64, 32),
    MLP_max_iter=5000,
    MLP_early_stopping=True,
    MLP_n_iter_no_change=20,
    MLP_tol=1e-4,
    MLP_lr=1e-3,
    MLP_alpha=1e-4,

    # Warnings:
    SILENCE_MLP_WARNINGS=True
)

# ---- Go! (expects df_ames to already be in memory) ----
run_pipeline(SETTINGS)
Mode: totrms | Features (A,B,C): [('A', 'Gr Liv Area'), ('B', 'Overall Qual'), ('C', 'TotRms AbvGrd')]
Evaluating 100 test point(s): [1093, 1032, 2783, 411, 1385, 1577, 1427, 518, 2181, 1880, 961, 1476, 1423, 1398, 2212, 96, 1080, 2044, 1397, 2786, 2233, 2684, 2359, 2227, 781, 2351, 1873, 1337, 2550, 211, 1557, 1910, 881, 554, 1462, 87, 2652, 478, 1505, 1283, 485, 134, 2605, 56, 282, 2240, 2111, 581, 1189, 70, 1878, 259, 495, 1506, 2841, 1608, 2916, 1691, 2394, 188, 1117, 555, 175, 366, 456, 506, 1538, 2381, 1820, 1600, 1486, 1221, 651, 1465, 120, 1058, 2659, 755, 1226, 2873, 2554, 1364, 291, 2662, 2834, 1264, 218, 2028, 2726, 1756, 856, 2718, 283, 2525, 527, 2421, 1509, 1662, 594, 1362]

──────── Test point index: 1093 | True SalePrice: 226750 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1093 1892 7 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1093 ===
f_AB(S)   = 226477.06687724203
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 227390.7173978073
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791774
|f_AB(S)-f_A|  = 56403.733753776585
|psi_ABC(A)|   = 37189.86674034053
|f_ABC(S)-f_A| = 60992.0553323206
|ReSHAP_AB(A)|  = 34535.99002791774
|ReSHAP_ABC(A)| = 36638.014587376914

P   = 0.0025503530835897514
R   = 0.011598288096712372
P/R = 0.21989047541530457
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 222692.229667 44084.509025 178607.720643 222692.229667 44084.509025 44084.509025
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 226477.066877 21867.743726 34535.990028 170073.333123 191941.076849 204609.323151 226477.066877 21867.743726 34535.990028 0.0 21867.743726 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 215711.861930 50861.384877 -13711.683054 178562.160107 229423.544984 164850.477053 215711.86193 50861.384877 -13711.683054 -0.0 50861.384877 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 227390.717398 29113.368662 37189.86674 -5311.18007 166398.662065 195540.60821 203189.349614 232274.140795 160659.725322 189801.671466 198305.926217 227390.717398 29084.79118 37589.045932 -4883.423397 57.154964 0.0 -855.513347 0.0 29109.723602 36638.014587 -4755.682857
──────── Test point index: 1032 | True SalePrice: 306000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1032 2340 8 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1032 ===
f_AB(S)   = 293320.0938323849
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 300504.07793133816
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 74142.18311193716
|f_AB(S)-f_A|  = 123246.76070891946
|psi_ABC(A)|   = 80964.56142862815
|f_ABC(S)-f_A| = 134105.41586585145
|ReSHAP_AB(A)|  = 74714.0921934067
|ReSHAP_ABC(A)| = 80125.87757910366

P   = -0.0021630870761817356
R   = 0.008731185671548292
P/R = -0.2477426500309617
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 272173.219573 93565.49893 178607.720643 272173.219573 93565.49893 93565.49893
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 293320.093832 49104.577597 74142.183112 170073.333123 216485.69877 241523.304285 293320.093832 46412.365646 71449.971161 5384.423901 48532.668516 74714.092193
5 AC 2 0.520030 62033.710779 178562.160107 264180.317516 107948.822785 -22330.665376 178562.160107 286510.982892 156231.494731 264180.317516 107948.822785 -22330.665376 -0.0 107948.822785 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 297600.953652 91592.161818 33760.300928 172248.490907 253063.830223 195231.969333 297600.953652 80815.339316 22983.478426 21553.645004 97596.50477 27755.957976
7 ABC 3 0.786106 41411.391808 166398.662065 300504.077931 61790.575612 80964.561429 -8649.721175 166398.662065 228249.883529 246727.241511 308457.156582 157052.305711 218903.527175 238774.140826 300504.077931 61729.937105 81600.550757 -7953.078651 121.284358 -0.022033 -1393.277703 0.022033 61782.157018 80125.877579 -7802.618732
──────── Test point index: 2783 | True SalePrice: 174000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2783 1282 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2783 ===
f_AB(S)   = 156142.95321827102
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 150684.75613624236
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.9911054334516
|f_AB(S)-f_A|  = 13930.379905194422
|psi_ABC(A)|   = 2307.32369244559
|f_ABC(S)-f_A| = 15713.905929244356
|ReSHAP_AB(A)|  = 2377.991105433442
|ReSHAP_ABC(A)| = 2746.09175208566

P   = 0.023872164060555784
R   = -0.0040501145642329595
P/R = -5.894194764605842
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155318.560376 -23289.160267 178607.720643 155318.560376 -23289.160267 -23289.160267
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 156142.953218 -11552.3888 -2377.991105 170073.333123 158520.944324 167695.342018 156142.953218 -11552.3888 -2377.991105 -0.0 -11552.3888 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 155219.163350 -26869.278347 3526.28159 178562.160107 151692.88176 182088.441697 155219.16335 -26869.278347 3526.28159 -0.0 -26869.278347 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 150684.756136 -14985.381285 -2307.323692 1578.799048 166398.662065 151134.985759 163880.012288 148967.132883 167874.564543 152744.82611 165355.914765 150684.756136 -14671.158629 -2060.069973 1717.623253 -458.579804 -241.720776 -107.782902 107.782902 -15242.497951 -2746.091752 2274.683774
──────── Test point index: 411 | True SalePrice: 123000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
411 1055 7 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 411 ===
f_AB(S)   = 180620.26208389158
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 179699.29508187875
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791775
|f_AB(S)-f_A|  = 10546.928960426128
|psi_ABC(A)|   = 33361.450634454224
|f_ABC(S)-f_A| = 13300.633016392036
|ReSHAP_AB(A)|  = 34535.99002791791
|ReSHAP_ABC(A)| = 28061.353522773963

P   = 0.7662463800014185
R   = 1.1647309562729755
P/R = 0.657874143272822
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130246.719509 -48361.001134 178607.720643 130246.719509 -48361.001134 -48361.001134
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 180620.262084 -23989.061067 34535.990028 170073.333123 146084.272056 204609.323151 180620.262084 -23989.061067 34535.990028 0.0 -23989.061067 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 143531.127090 -55795.279251 20764.246234 178562.160107 122766.880856 199326.406341 143531.12709 -55795.279251 20764.246234 -0.0 -55795.279251 20764.246234
6 BC 2 0.753073 44494.356934 172248.490907 187268.215739 36390.991532 -21371.266701 172248.490907 208639.482439 150877.224206 187268.215739 36390.991532 -21371.266701 0.0 36390.991532 -21371.266701
7 ABC 3 0.786106 41411.391808 166398.662065 179699.295082 -29266.33181 33361.450634 9205.514191 166398.662065 138530.777358 203189.349614 171283.139239 175089.403763 151071.212204 211668.204835 179699.295082 -31968.909753 28628.082878 8416.155843 7950.718193 62.699378 4124.279002 -3912.392525 34152.042926 -28061.353523 7209.943613
──────── Test point index: 1385 | True SalePrice: 108959 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1385 1258 4 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1385 ===
f_AB(S)   = 102256.39318916843
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 109160.35647912517
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 63916.000180215284
|f_AB(S)-f_A|  = 67816.93993429701
|psi_ABC(A)|   = 45194.891384522685
|f_ABC(S)-f_A| = 57238.305586361545
|ReSHAP_AB(A)|  = 57640.57395847591
|ReSHAP_ABC(A)| = 47250.44913395942

P   = 0.15288668080009515
R   = 0.024439617256626023
P/R = 6.255690471529165
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 152667.793060 -25939.927583 178607.720643 152667.79306 -25939.927583 -25939.927583
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 102256.393189 -3900.939754 -63916.00018 170073.333123 157206.053864 97190.993438 102256.393189 -12867.27926 -72882.339686 17932.679011 -10176.365976 -57640.573958
5 AC 2 0.520030 62033.710779 178562.160107 152160.907748 -29927.533949 3526.28159 178562.160107 148634.626158 182088.441697 152160.907748 -29927.533949 3526.28159 -0.0 -29927.533949 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 95427.443060 -74263.509264 -2557.538583 172248.490907 96913.151575 168619.122257 95427.44306 -75335.339331 -3629.36865 2143.660135 -73290.205907 -3530.84194
7 ABC 3 0.786106 41411.391808 166398.662065 109160.356479 -13759.826737 -45194.891385 1716.412535 166398.662065 149516.86715 117851.808625 107400.555541 167874.564543 151200.407462 119995.336691 109160.356479 -10834.980212 -42040.050982 1759.800938 -5839.176869 383.727127 -76.260627 -591.364961 -11694.208615 -47250.449134 1706.352162
──────── Test point index: 1577 | True SalePrice: 167500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1577 1394 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1577 ===
f_AB(S)   = 125365.12756503362
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 129540.85544750438
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 44708.20555843183
|psi_ABC(A)|   = 28536.740727617955
|f_ABC(S)-f_A| = 36857.806617982336
|ReSHAP_AB(A)|  = 39291.972238784656
|ReSHAP_ABC(A)| = 29262.746622977957

P   = 0.10461499600429047
R   = 0.08491751636626477
P/R = 1.2319601476928137
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167688.807853 -10918.91279 178607.720643 167688.807853 -10918.91279 -10918.91279
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 125365.127565 -5416.23332 -39291.972239 170073.333123 164657.099804 130781.360885 125365.127565 -5416.23332 -39291.972239 -0.0 -5416.23332 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 160872.040505 -12597.41887 -5092.700732 178562.160107 165964.741237 173469.459375 160872.040505 -12597.41887 -5092.700732 0.0 -12597.41887 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 129540.855448 -5707.452073 -28536.740728 -2613.613817 166398.662065 159180.744127 136833.552157 132636.565949 164267.144932 157049.226994 133737.841655 129540.855448 -4196.986208 -27508.371547 -3095.710501 -3020.93173 0.0 964.193368 -0.0 -4596.881043 -29262.746623 -2998.178952
──────── Test point index: 1427 | True SalePrice: 228000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1427 2097 7 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1427 ===
f_AB(S)   = 237708.4228899503
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 239259.1210672805
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 67635.08976648486
|psi_ABC(A)|   = 37444.07141553455
|f_ABC(S)-f_A| = 72860.45900179379
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36376.71898540872

P   = -0.0032924710149698777
R   = 0.011356797528610851
P/R = -0.2899119233811513
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 245334.200495 66726.479852 178607.720643 245334.200495 66726.479852 66726.479852
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 237708.422890 33099.099739 34535.990028 170073.333123 203172.432862 204609.323151 237708.42289 33099.099739 34535.990028 0.0 33099.099739 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 233215.479543 76983.984812 -22330.665376 178562.160107 255546.144919 156231.494731 233215.479543 76983.984812 -22330.665376 -0.0 76983.984812 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 239102.818452 40130.920326 26723.40722 172248.490907 208639.482439 195231.969333 239102.818452 36390.991532 22983.478426 7479.857587 40975.443979 25878.883566
7 ABC 3 0.786106 41411.391808 166398.662065 239259.121067 44066.105089 37444.071416 -8649.717503 166398.662065 210508.022139 203189.349614 247212.199718 157052.305711 201161.665785 195236.270963 239259.121067 44022.850104 38097.455282 -7953.078651 86.50997 -0.0 -1393.277703 0.0 44069.226154 36376.718985 -7585.486138
──────── Test point index: 518 | True SalePrice: 229456 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
518 1314 8 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 518 ===
f_AB(S)   = 231724.10276500593
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 234949.88049571522
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126894
|f_AB(S)-f_A|  = 61650.76964154048
|psi_ABC(A)|   = 80198.75696310816
|f_ABC(S)-f_A| = 68551.2184302285
|ReSHAP_AB(A)|  = 71449.97116126894
|ReSHAP_ABC(A)| = 80323.29346066226

P   = -0.010963069265601089
R   = -0.012779761957662394
P/R = 0.8578461243582032
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158852.916798 -19754.803845 178607.720643 158852.916798 -19754.803845 -19754.803845
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 231724.102765 -9799.20152 71449.971161 170073.333123 160274.131604 241523.304285 231724.102765 -9799.20152 71449.971161 0.0 -9799.20152 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 159296.837486 -22791.604211 3526.28159 178562.160107 155770.555896 182088.441697 159296.837486 -22791.604211 3526.28159 -0.0 -22791.604211 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 247204.347482 79700.28227 -4744.425695 172248.490907 253063.830223 168619.122257 247204.347482 80815.339316 -3629.36865 -2230.114091 78480.362824 -3524.506249
7 ABC 3 0.786106 41411.391808 166398.662065 234949.880496 -13026.504801 80198.756963 1378.966268 166398.662065 153352.757642 246727.241511 233693.993385 167874.564543 154907.088962 247983.128622 234949.880496 -13033.248126 80042.791534 1255.887111 65.772545 0.0 298.444208 -78.428843 -13031.131977 80323.293461 1259.056947
──────── Test point index: 2181 | True SalePrice: 184750 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2181 4676 10 11
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2181 ===
f_AB(S)   = 627703.3582424738
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 551661.5543637843
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 173815.00390692742
|f_AB(S)-f_A|  = 457630.0251190084
|psi_ABC(A)|   = 168410.84220926402
|f_ABC(S)-f_A| = 385262.89229829755
|ReSHAP_AB(A)|  = 166691.45066581471
|ReSHAP_ABC(A)| = 166916.96055414234

P   = -0.05731670961646296
R   = -0.06900532941480697
P/R = 0.8306127961786688
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 539859.565336 361251.844693 178607.720643 539859.565336 361251.844693 361251.844693
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.292963 75290.806037 177835.958675 292775.570492 114939.611817 177835.958675 292775.570492 114939.611817 114939.611817
4 AB 2 0.749226 44839.624931 170073.333123 627703.358242 283815.021212 173815.003907 170073.333123 427650.70996 317650.692654 627703.358242 257577.376836 147577.359531 52475.288752 290938.574453 166691.450666
5 AC 2 0.520030 62033.710779 178562.160107 544612.564823 405257.464227 -39207.059511 178562.160107 584181.199765 139716.676027 544612.564823 405619.039658 -38845.48408 -723.150861 404819.298909 -38768.894193
6 BC 2 0.753073 44494.356934 172248.490907 449973.929119 213372.304348 64353.133863 172248.490907 368768.996693 219749.826208 449973.929119 196520.505786 47501.335301 33703.597125 223663.354657 54062.083555
7 ABC 3 0.786106 41411.391808 166398.662065 551661.554364 232178.895794 168410.842209 -15326.845705 166398.662065 398805.39055 333803.025303 565753.943523 149837.398023 382244.194975 319710.453053 551661.554364 231951.10131 169417.359389 -14092.389159 455.695642 -0.18309 -2468.806416 0.114622 232214.32228 166916.960554 -13868.390536
──────── Test point index: 1880 | True SalePrice: 139900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1880 1422 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1880 ===
f_AB(S)   = 126899.16643506207
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 135968.8057261776
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878466
|f_AB(S)-f_A|  = 43174.16668840338
|psi_ABC(A)|   = 28148.640247220817
|f_ABC(S)-f_A| = 30429.856339309103
|ReSHAP_AB(A)|  = 39291.97223878468
|ReSHAP_ABC(A)| = 29404.28696903205

P   = -0.01495300613193451
R   = -0.056216648911252665
P/R = 0.2659889271511064
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170781.369722 -7826.350921 178607.720643 170781.369722 -7826.350921 -7826.350921
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 126899.166435 -3882.19445 -39291.972239 170073.333123 166191.138674 130781.360885 126899.166435 -3882.19445 -39291.972239 -0.0 -3882.19445 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 173058.987696 -9029.454001 3526.28159 178562.160107 169532.706106 182088.441697 173058.987696 -9029.454001 3526.28159 0.0 -9029.454001 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 135968.805726 -4090.931363 -28148.640247 1809.715271 166398.662065 161225.073835 136833.552157 133825.277661 167874.564543 162700.976312 138977.080222 135968.805726 -3008.274496 -26732.170586 2143.528066 -2165.313734 0.0 -667.625588 -0.0 -3227.298054 -29404.286969 2201.728683
──────── Test point index: 961 | True SalePrice: 201000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
961 1466 8 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 961 ===
f_AB(S)   = 240051.74234516037
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 242956.24941910134
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126894
|f_AB(S)-f_A|  = 69978.40922169492
|psi_ABC(A)|   = 80489.37702549108
|f_ABC(S)-f_A| = 76557.58735361462
|ReSHAP_AB(A)|  = 71449.97116126894
|ReSHAP_ABC(A)| = 80321.17341829235

P   = -0.03032848336130267
R   = -0.0281313973112709
P/R = 1.0781008502962455
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175641.109802 -2966.610841 178607.720643 175641.109802 -2966.610841 -2966.610841
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 240051.742345 -1471.56194 71449.971161 170073.333123 168601.771184 241523.304285 240051.742345 -1471.56194 71449.971161 0.0 -1471.56194 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 170046.807312 -3422.652063 -5092.700732 178562.160107 175139.508044 173469.459375 170046.807312 -3422.652063 -5092.700732 0.0 -3422.652063 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 262265.809925 82795.538979 7221.780039 172248.490907 253063.830223 177490.071282 262265.809925 80815.339316 5241.580375 3960.399327 84534.517466 5482.801552
7 ABC 3 0.786106 41411.391808 166398.662065 242956.249419 -1959.147034 80489.377025 -1972.642638 166398.662065 164437.591946 246727.241511 244770.017562 164267.144932 162306.074813 244913.473368 242956.249419 -1957.223949 80650.174606 -1813.768143 -3.84617 -0.0 -317.74899 0.0 -1957.128288 80321.173418 -1806.457777
──────── Test point index: 1476 | True SalePrice: 185900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1476 1582 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1476 ===
f_AB(S)   = 209493.06510192703
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 210961.66142616406
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 39419.73197846158
|psi_ABC(A)|   = 36682.61689912642
|f_ABC(S)-f_A| = 44562.99936067735
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36781.742792495315

P   = 0.05294610007747347
R   = 0.05072170102913731
P/R = 1.04385497732141
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 188453.151831 9845.431188 178607.720643 188453.151831 9845.431188 9845.431188
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 209493.065102 4883.741951 34535.990028 170073.333123 174957.075074 204609.323151 209493.065102 4883.741951 34535.990028 0.0 4883.741951 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 193447.358378 11358.916681 3526.28159 178562.160107 189921.076788 182088.441697 193447.358378 11358.916681 3526.28159 -0.0 11358.916681 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 210961.661426 6506.168401 36682.616899 1374.214061 166398.662065 172906.957877 203189.349614 209684.880959 167874.564543 174382.860354 204453.365614 210961.661426 6508.295812 36578.801072 1276.780467 0.0 -12.764467 199.12201 12.764467 6499.486453 36781.742792 1281.770115
──────── Test point index: 1423 | True SalePrice: 177439 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1423 1356 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1423 ===
f_AB(S)   = 197111.1799366974
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 194472.46896438987
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 27037.84681323194
|psi_ABC(A)|   = 36691.833499260276
|f_ABC(S)-f_A| = 28073.80689890316
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36776.56483033887

P   = -0.02965720498652491
R   = -0.03267536819409145
P/R = 0.9076318531549922
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 163491.759602 -15115.961041 178607.720643 163491.759602 -15115.961041 -15115.961041
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 197111.179937 -7498.143215 34535.990028 170073.333123 162575.189909 204609.323151 197111.179937 -7498.143215 34535.990028 0.0 -7498.143215 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 164648.784790 -17439.656907 3526.28159 178562.160107 161122.5032 182088.441697 164648.78479 -17439.656907 3526.28159 -0.0 -17439.656907 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 194472.468964 -9985.276209 36691.833499 1367.249609 166398.662065 156406.296667 203189.349614 193216.581853 167874.564543 157882.199144 204453.365614 194472.468964 -9980.89665 36590.26982 1255.887111 -11.468748 8.128889 220.015366 -8.128889 -9964.410636 36776.56483 1261.652705
──────── Test point index: 1398 | True SalePrice: 173000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1398 1291 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1398 ===
f_AB(S)   = 156636.03714078016
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 151220.98093081932
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.9911054334516
|f_AB(S)-f_A|  = 13437.29598268529
|psi_ABC(A)|   = 2339.347753569702
|f_ABC(S)-f_A| = 15177.681134667393
|ReSHAP_AB(A)|  = 2377.991105433442
|ReSHAP_ABC(A)| = 2578.215008376592

P   = 0.022838691970114816
R   = 0.00710063207232825
P/R = 3.216430838476913
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 156312.598120 -22295.122523 178607.720643 156312.59812 -22295.122523 -22295.122523
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 156636.037141 -11059.304877 -2377.991105 170073.333123 159014.028246 167695.342018 156636.037141 -11059.304877 -2377.991105 -0.0 -11059.304877 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 156366.009201 -25722.432496 3526.28159 178562.160107 152839.727611 182088.441697 156366.009201 -25722.432496 3526.28159 0.0 -25722.432496 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 151220.980931 -14409.268919 -2339.347754 1570.935538 166398.662065 151743.122186 163880.012288 149526.948209 167874.564543 153352.962537 165355.914765 151220.980931 -14134.933834 -2131.981606 1694.032722 -386.668171 -218.130245 -84.192371 84.192371 -14637.56033 -2578.215008 2038.094203
──────── Test point index: 2212 | True SalePrice: 153500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2212 1226 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2212 ===
f_AB(S)   = 116160.89434486291
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122408.5851779739
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784656
|f_AB(S)-f_A|  = 53912.43877860253
|psi_ABC(A)|   = 26173.23507683187
|f_ABC(S)-f_A| = 43990.07688751281
|ReSHAP_AB(A)|  = 39291.97223878466
|ReSHAP_ABC(A)| = 27147.87046024405

P   = 0.1338304504941601
R   = 0.11167464963242446
P/R = 1.1983959737922718
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 149133.436638 -29474.284005 178607.720643 149133.436638 -29474.284005 -29474.284005
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 116160.894345 -14620.46654 -39291.972239 170073.333123 155452.866584 130781.360885 116160.894345 -14620.46654 -39291.972239 -0.0 -14620.46654 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 139464.251289 -34005.208085 -5092.700732 178562.160107 144556.952022 173469.459375 139464.251289 -34005.208085 -5092.700732 0.0 -34005.208085 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 122408.585178 -15150.135056 -26173.235077 -2666.706754 166398.662065 147533.834304 136833.552157 125504.295679 164267.144932 145083.759549 133737.841655 122408.585178 -11329.256477 -22675.174371 -3095.710501 -7854.128906 0.0 645.635747 318.557621 -13845.751558 -27147.87046 -2996.454869
──────── Test point index: 96 | True SalePrice: 151000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
96 1200 7 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 96 ===
f_AB(S)   = 188564.3919465389
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 190286.00249544013
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 18491.058823073457
|psi_ABC(A)|   = 35265.20370907041
|f_ABC(S)-f_A| = 23887.340429953416
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 41202.603392437064

P   = 0.3913994271022294
R   = 0.14284100341427974
P/R = 2.740105556155043
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 146261.772045 -32345.948598 178607.720643 146261.772045 -32345.948598 -32345.948598
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 188564.391947 -16044.931205 34535.990028 170073.333123 154028.401919 204609.323151 188564.391947 -16044.931205 34535.990028 0.0 -16044.931205 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 162008.088020 -37318.318321 20764.246234 178562.160107 141243.841786 199326.406341 162008.08802 -37318.318321 20764.246234 0.0 -37318.318321 20764.246234
6 BC 2 0.753073 44494.356934 172248.490907 187268.215739 36390.991532 -21371.266701 172248.490907 208639.482439 150877.224206 187268.215739 36390.991532 -21371.266701 0.0 36390.991532 -21371.266701
7 ABC 3 0.786106 41411.391808 166398.662065 190286.002495 -20413.173412 35265.203709 9035.310133 166398.662065 146011.558154 203189.349614 181849.083355 175089.403763 157489.242053 211668.204835 190286.002495 -21382.20234 32796.760442 8436.91914 3782.04063 41.936081 3040.764759 -2828.878282 -25390.689606 41202.603392 8075.426643
──────── Test point index: 1080 | True SalePrice: 195000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1080 1455 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1080 ===
f_AB(S)   = 202535.103084298
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 201689.16596738965
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 32461.76996083255
|psi_ABC(A)|   = 36685.64786315784
|f_ABC(S)-f_A| = 35290.503901902935
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36783.2298736926

P   = 0.024364192217888325
R   = 0.021599085468321677
P/R = 1.1280196216465783
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 174426.174782 -4181.545861 178607.720643 174426.174782 -4181.545861 -4181.545861
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 202535.103084 -2074.220067 34535.990028 170073.333123 167999.113056 204609.323151 202535.103084 -2074.220067 34535.990028 0.0 -2074.220067 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 177264.089149 -4824.352547 3526.28159 178562.160107 173737.80756 182088.441697 177264.089149 -4824.352547 3526.28159 -0.0 -4824.352547 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 201689.165967 -2763.296094 36685.647863 1368.152133 166398.662065 163634.462419 203189.349614 200430.571285 167874.564543 165110.364896 204453.365614 201689.165967 -2764.199647 36578.801072 1258.594683 0.0 5.421317 217.307794 -5.421317 -2753.823395 36783.229874 1261.097423
──────── Test point index: 2044 | True SalePrice: 219000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2044 2267 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2044 ===
f_AB(S)   = 247022.23031512307
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 257786.0901949473
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 76948.89719165763
|psi_ABC(A)|   = 36894.13547133035
|f_ABC(S)-f_A| = 91387.42812946058
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36707.36137406971

P   = 0.04510594006353663
R   = 0.04714970145483055
P/R = 0.9566537787465771
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 264110.468986 85502.748343 178607.720643 264110.468986 85502.748343 85502.748343
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 247022.230315 42412.907164 34535.990028 170073.333123 212486.240287 204609.323151 247022.230315 42412.907164 34535.990028 0.0 42412.907164 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 272116.088037 98646.628662 -5092.700732 178562.160107 277208.788769 173469.459375 272116.088037 98646.628662 -5092.700732 -0.0 98646.628662 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 213881.062815 36390.991532 5241.580375 172248.490907 208639.482439 177490.071282 213881.062815 36390.991532 5241.580375 0.0 36390.991532 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 257786.090195 56465.935296 36894.135471 -1972.642638 166398.662065 222920.023934 203189.349614 259599.858338 164267.144932 220788.506801 201375.581472 257786.090195 56410.508723 36997.583394 -1813.768143 110.853146 0.0 -317.74899 -0.0 56477.454554 36707.361374 -1797.387799
──────── Test point index: 1397 | True SalePrice: 144000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1397 904 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1397 ===
f_AB(S)   = 106720.21185667881
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 117982.72297993771
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 35191.58998021312
|f_AB(S)-f_A|  = 63353.121266786635
|psi_ABC(A)|   = 24918.75357014032
|f_ABC(S)-f_A| = 48415.939085549006
|ReSHAP_AB(A)|  = 34788.73376034042
|ReSHAP_ABC(A)| = 24116.34974258097

P   = 0.04080234243925829
R   = 0.05101657515373692
P/R = 0.7997859973214911
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 113568.975143 -65038.745499 178607.720643 113568.975143 -65038.745499 -65038.745499
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 106720.211857 -28161.531287 -35191.58998 170073.333123 137811.419578 130781.360885 106720.211857 -32261.913545 -39291.972239 8200.764517 -28564.387506 -34788.73376
5 AC 2 0.520030 62033.710779 178562.160107 115670.619937 -75036.804082 12145.263912 178562.160107 103525.356025 190707.424019 115670.619937 -75036.804082 12145.263912 -0.0 -75036.804082 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 117982.722980 -29713.654391 -24918.75357 6216.468876 166398.662065 132038.648112 136833.552157 111799.408296 171481.984153 139421.421192 144216.318789 117982.72298 -26233.59581 -21438.698212 6183.314684 -5826.967152 1199.451949 1199.458396 -3498.902941 -30041.81675 -24116.349743 5742.227407
──────── Test point index: 2786 | True SalePrice: 149900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2786 894 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2786 ===
f_AB(S)   = 106942.91881037506
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 117485.8199600873
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34806.30099085991
|f_AB(S)-f_A|  = 63130.41431309038
|psi_ABC(A)|   = 24946.043052949408
|f_ABC(S)-f_A| = 48912.84210539941
|ReSHAP_AB(A)|  = 34403.02423123985
|ReSHAP_ABC(A)| = 24073.25281372594

P   = 0.041329566278664354
R   = 0.05278535647814786
P/R = 0.7829740866820519
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 112464.488761 -66143.231881 178607.720643 112464.488761 -66143.231881 -66143.231881
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 106942.918810 -28324.113322 -34806.300991 170073.333123 137263.548553 130781.360885 106942.91881 -32809.78457 -39291.972239 8971.342496 -28727.390082 -34403.024231
5 AC 2 0.520030 62033.710779 178562.160107 114396.346769 -76311.07725 12145.263912 178562.160107 102251.082857 190707.424019 114396.346769 -76311.07725 12145.263912 -0.0 -76311.07725 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 117485.819960 -30165.485248 -24946.043053 6198.686196 166398.662065 131614.105261 136833.552157 111355.854794 171481.984153 138996.881295 144216.318789 117485.81996 -26730.498829 -21511.061335 6129.965166 -5754.604029 1252.801466 1252.810867 -3552.255413 -30509.225425 -24073.252814 5669.636133
──────── Test point index: 2233 | True SalePrice: 87000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2233 803 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2233 ===
f_AB(S)   = 109883.57230562333
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 113263.73578325482
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 30843.161079439546
|f_AB(S)-f_A|  = 60189.76081784212
|psi_ABC(A)|   = 25163.907478222012
|f_ABC(S)-f_A| = 53134.9262822319
|ReSHAP_AB(A)|  = 30679.13729702105
|ReSHAP_ABC(A)| = 24173.838425619506

P   = 0.03884696171745816
R   = 0.05475496304330718
P/R = 0.7094692345373889
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 102507.256627 -76100.464016 178607.720643 102507.256627 -76100.464016 -76100.464016
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 109883.572306 -29346.599738 -30843.161079 170073.333123 132277.922226 130781.360885 109883.572306 -37795.410898 -39291.972239 16897.622319 -29510.623521 -30679.137297
5 AC 2 0.520030 62033.710779 178562.160107 103278.354377 -87668.016359 12384.210628 178562.160107 90655.197032 190707.424019 103278.354377 -87906.963075 12145.263912 477.893433 -87352.45914 12068.653409
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 113263.735783 -34246.676178 -25163.907478 6275.657374 166398.662065 127750.765314 136833.552157 106902.870523 171481.984153 135133.568232 144216.318789 113263.735783 -30952.583006 -21869.832449 6360.86526 -5395.832915 1021.901373 1021.937658 -3321.382203 -35017.03877 -24173.838426 6055.950913
──────── Test point index: 2684 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2684 1048 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2684 ===
f_AB(S)   = 107477.33854400982
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 125231.30969455883
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38757.698016620765
|f_AB(S)-f_A|  = 62595.994579455626
|psi_ABC(A)|   = 24544.08906053718
|f_ABC(S)-f_A| = 41167.35237092788
|ReSHAP_AB(A)|  = 38632.494076903575
|ReSHAP_ABC(A)| = 25273.545422600477

P   = 0.022969414598161153
R   = 0.0032499310864539943
P/R = 7.067662047943029
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 129473.579042 -49134.141601 178607.720643 129473.579042 -49134.141601 -49134.141601
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 107477.338544 -23838.296563 -38757.698017 170073.333123 145700.762338 130781.360885 107477.338544 -24372.570785 -39291.972239 1068.548444 -23963.500503 -38632.494077
5 AC 2 0.520030 62033.710779 178562.160107 134020.153550 -56687.270469 12145.263912 178562.160107 121874.889638 190707.424019 134020.15355 -56687.270469 12145.263912 -0.0 -56687.270469 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 125231.309695 -23197.045346 -24544.089061 6573.782035 166398.662065 138209.162667 136833.552157 117947.485513 171481.984153 145534.795709 144216.318789 125231.309695 -18985.009095 -20303.486014 7283.824182 -6962.17935 98.942451 41.80886 -2341.253405 -23526.748594 -25273.545423 7632.941646
──────── Test point index: 2359 | True SalePrice: 138800 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2359 1096 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2359 ===
f_AB(S)   = 109403.87612744633
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122128.80508411251
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39109.31968492689
|f_AB(S)-f_A|  = 60669.456996019115
|psi_ABC(A)|   = 24866.695086224703
|f_ABC(S)-f_A| = 44269.85698137421
|ReSHAP_AB(A)|  = 39056.80202338103
|ReSHAP_ABC(A)| = 25481.823840015735

P   = 0.08292231879214962
R   = 0.06816170339919392
P/R = 1.2165529125131909
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 134775.113674 -43832.606968 178607.720643 134775.113674 -43832.606968 -43832.606968
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 109403.876127 -21560.137311 -39109.319685 170073.333123 148330.543259 130781.360885 109403.876127 -21742.789865 -39291.972239 365.305108 -21612.654973 -39056.802023
5 AC 2 0.520030 62033.710779 178562.160107 131517.682433 -50570.759264 3526.28159 178562.160107 127991.400843 182088.441697 131517.682433 -50570.759264 3526.28159 -0.0 -50570.759264 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 122128.805084 -21301.360952 -24866.695086 1898.199057 166398.662065 140467.247729 136833.552157 119985.277018 167874.564543 142474.052919 138977.080222 122128.805084 -16848.275138 -20345.247835 2143.528066 -8552.236486 0.0 -136.722876 -530.902713 -20973.87594 -25481.82384 2185.842799
──────── Test point index: 2227 | True SalePrice: 200000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2227 1020 8 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2227 ===
f_AB(S)   = 215616.69462970717
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 219665.82845009153
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 45543.36150624172
|psi_ABC(A)|   = 75859.74849086588
|f_ABC(S)-f_A| = 53267.16638460482
|ReSHAP_AB(A)|  = 71449.9711612689
|ReSHAP_ABC(A)| = 86068.07517163397

P   = 0.14469682571381437
R   = -0.04694705875830052
P/R = -3.082127603749641
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 126381.017172 -52226.70347 178607.720643 126381.017172 -52226.70347 -52226.70347
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 215616.694630 -25906.609655 71449.971161 170073.333123 144166.723468 241523.304285 215616.69463 -25906.609655 71449.971161 -0.0 -25906.609655 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 139071.171003 -60255.235338 20764.246234 178562.160107 118306.924769 199326.406341 139071.171003 -60255.235338 20764.246234 0.0 -60255.235338 20764.246234
6 BC 2 0.753073 44494.356934 172248.490907 226164.925338 78051.520224 -24135.085793 172248.490907 253063.830223 150877.224206 226164.925338 80815.339316 -21371.266701 -5527.638184 73300.410816 -19383.976384
7 ABC 3 0.786106 41411.391808 166398.662065 219665.828450 -31290.771865 75859.748491 8698.189759 166398.662065 136963.317364 246727.241511 212270.602281 175089.403763 149585.322564 254122.43913 219665.82845 -34456.61068 70080.505886 7395.226169 8952.52948 -0.02855 5226.779031 -3931.234952 -40152.487328 86068.075172 7351.578541
──────── Test point index: 781 | True SalePrice: 161500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
781 1788 4 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 781 ===
f_AB(S)   = 110037.26481708279
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 122259.18650452576
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 74544.14652902738
|f_AB(S)-f_A|  = 60036.06830638266
|psi_ABC(A)|   = 54253.87043492385
|f_ABC(S)-f_A| = 44139.47556096096
|ReSHAP_AB(A)|  = 77153.58386475043
|ReSHAP_ABC(A)| = 46447.20265761696

P   = 0.012509794670889862
R   = 0.23283791656397979
P/R = 0.05372748071061008
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 211205.571296 32597.850654 178607.720643 211205.571296 32597.850654 32597.850654
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 110037.264817 14508.078223 -74544.146529 170073.333123 186243.218189 97190.993438 110037.264817 16169.885066 -72882.339686 -3323.613686 17117.515558 -77153.583865
5 AC 2 0.520030 62033.710779 178562.160107 202459.420987 37608.943934 -13711.683054 178562.160107 216171.104041 164850.477053 202459.420987 37608.943934 -13711.683054 -0.0 37608.943934 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 107736.151807 -76980.103916 12467.764816 172248.490907 96913.151575 186361.020308 107736.151807 -75335.339331 14112.529401 -3289.529169 -79383.141065 14870.801966
7 ABC 3 0.786106 41411.391808 166398.662065 122259.186505 17050.930721 -54253.870435 -6936.535847 166398.662065 187947.383582 117851.808625 130381.694359 160659.725322 182208.446838 109694.419484 122259.186505 12564.767021 -59949.260334 -8122.507854 8983.954496 -34.881287 2383.571111 34.881287 10075.902821 -46447.202658 -7768.175725
──────── Test point index: 2351 | True SalePrice: 179900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2351 1552 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2351 ===
f_AB(S)   = 170935.47089354537
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 169673.8573190838
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.9911054334516
|f_AB(S)-f_A|  = 862.1377700799203
|psi_ABC(A)|   = 2518.649777395214
|f_ABC(S)-f_A| = 3275.1952535970777
|ReSHAP_AB(A)|  = 2377.9911054333566
|ReSHAP_ABC(A)| = 2518.6497773953256

P   = 1.9892420179630206
R   = 1.989242017962876
P/R = 1.0000000000000726
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 185139.692685 6531.972043 178607.720643 185139.692685 6531.972043 6531.972043
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 170935.470894 3240.128876 -2377.991105 170073.333123 173313.461999 167695.342018 170935.470894 3240.128876 -2377.991105 -0.0 3240.128876 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 189624.538875 7536.097178 3526.28159 178562.160107 186098.257285 182088.441697 189624.538875 7536.097178 3526.28159 -0.0 7536.097178 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 169673.857319 4317.942554 -2518.649777 1475.902477 166398.662065 170716.604619 163880.012288 168197.954842 167874.564543 172192.507096 165355.914765 169673.857319 4317.942554 -2518.649777 1475.902477 0.0 -0.0 -0.0 0.0 4317.942554 -2518.649777 1475.902477
──────── Test point index: 1873 | True SalePrice: 127000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1873 1052 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1873 ===
f_AB(S)   = 143541.91964303734
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 139242.5730689896
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 26531.413480428106
|psi_ABC(A)|   = 1890.606435811962
|f_ABC(S)-f_A| = 27156.08899649713
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 1246.1427943596798

P   = 0.02000928327461532
R   = 0.043741111577851904
P/R = 0.4574479832091629
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 129915.373594 -48692.347049 178607.720643 129915.373594 -48692.347049 -48692.347049
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 143541.919643 -24153.422375 -2377.991105 170073.333123 145919.910748 167695.342018 143541.919643 -24153.422375 -2377.991105 0.0 -24153.422375 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 125910.880495 -56177.561202 3526.28159 178562.160107 122384.598905 182088.441697 125910.880495 -56177.561202 3526.28159 -0.0 -56177.561202 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 139242.573069 -27068.211768 -1890.606436 1802.729208 166398.662065 138389.552353 163880.012288 137099.045003 167874.564543 140491.164036 165355.914765 139242.573069 -26113.341696 -1248.590967 2143.528066 -1270.058811 -667.625588 -41.916382 41.916382 -28009.539719 -1246.142794 2099.593517
──────── Test point index: 1337 | True SalePrice: 157000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1337 1497 6 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1337 ===
f_AB(S)   = 167922.18025598946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 169265.62928988968
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433466
|f_AB(S)-f_A|  = 2151.15286747599
|psi_ABC(A)|   = 2518.6497773952287
|f_ABC(S)-f_A| = 2866.9672244029643
|ReSHAP_AB(A)|  = 2377.991105433501
|ReSHAP_ABC(A)| = 2518.6497773952474

P   = 0.22694296137652215
R   = 0.2269429613765318
P/R = 0.9999999999999575
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 179065.017585 457.296943 178607.720643 179065.017585 457.296943 457.296943
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 167922.180256 226.838238 -2377.991105 170073.333123 170300.171361 167695.342018 167922.180256 226.838238 -2377.991105 -0.0 226.838238 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 191235.018775 527.594756 12145.263912 178562.160107 179089.754863 190707.424019 191235.018775 527.594756 12145.263912 0.0 527.594756 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 157242.455164 -2505.718067 -12500.317675 172248.490907 169742.77284 159748.173232 157242.455164 -2505.718067 -12500.317675 0.0 -2505.718067 -12500.317675
7 ABC 3 0.786106 41411.391808 166398.662065 169265.629290 302.294914 -2518.649777 5083.322088 166398.662065 166700.95698 163880.012288 164182.307202 171481.984153 171784.279067 168963.334376 169265.62929 302.294914 -2518.649777 5083.322088 -0.0 0.0 0.0 0.0 302.294914 -2518.649777 5083.322088
──────── Test point index: 2550 | True SalePrice: 151500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2550 1248 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2550 ===
f_AB(S)   = 117366.21059988528
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 128581.81151844961
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784656
|f_AB(S)-f_A|  = 52707.12252358017
|psi_ABC(A)|   = 25746.26799531745
|f_ABC(S)-f_A| = 37816.8505470371
|ReSHAP_AB(A)|  = 39291.97223878466
|ReSHAP_ABC(A)| = 27340.830207040002

P   = 0.06466276167339435
R   = 0.022497372735486576
P/R = 2.874236135644299
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 151563.306678 -27044.413965 178607.720643 151563.306678 -27044.413965 -27044.413965
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 117366.210600 -13415.150285 -39291.972239 170073.333123 156658.182839 130781.360885 117366.2106 -13415.150285 -39291.972239 -0.0 -13415.150285 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 150886.634580 -31201.807117 3526.28159 178562.160107 147360.35299 182088.441697 150886.63458 -31201.807117 3526.28159 -0.0 -31201.807117 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 128581.811518 -13916.071057 -25746.267995 1845.488506 166398.662065 148890.242186 136833.552157 126438.283453 167874.564543 150580.784069 138977.080222 128581.811518 -10395.268704 -21998.97255 2143.528066 -6898.51177 0.0 -452.986183 -214.639406 -12680.966039 -27340.830207 2204.945699
──────── Test point index: 211 | True SalePrice: 100000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
211 1392 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 211 ===
f_AB(S)   = 125255.5533600316
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 134695.18603519004
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 44817.77976343385
|psi_ABC(A)|   = 27690.273463725494
|f_ABC(S)-f_A| = 31703.476030296675
|ReSHAP_AB(A)|  = 39291.972238784656
|ReSHAP_ABC(A)| = 29187.087011493815

P   = 0.003290583527092328
R   = -0.043922332379602635
P/R = -0.07491823290833395
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 167467.910576 -11139.810067 178607.720643 167467.910576 -11139.810067 -11139.810067
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 125255.553360 -5525.807525 -39291.972239 170073.333123 164547.525599 130781.360885 125255.55336 -5525.807525 -39291.972239 -0.0 -5525.807525 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 169236.168194 -12852.273503 3526.28159 178562.160107 165709.886604 182088.441697 169236.168194 -12852.273503 3526.28159 -0.0 -12852.273503 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 134695.186035 -5822.917838 -27690.273464 1809.715271 166398.662065 159034.720577 136833.552157 132551.65797 167874.564543 160510.623054 138977.080222 134695.186035 -4281.894187 -25815.437019 2143.528066 -3082.047301 0.0 -667.625588 -0.0 -4720.371617 -29187.087011 2203.982598
──────── Test point index: 1557 | True SalePrice: 81000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1557 894 3 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1557 ===
f_AB(S)   = 132820.53798010125
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 76575.96762997033
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34541.32231514594
|f_AB(S)-f_A|  = 37252.7951433642
|psi_ABC(A)|   = 61416.32314269362
|f_ABC(S)-f_A| = 89822.69443551639
|ReSHAP_AB(A)|  = 24710.32746620691
|ReSHAP_ABC(A)| = 62894.695628008325

P   = 0.2434636233159938
R   = -0.036894684925716015
P/R = -6.598880673630496
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 112464.488761 -66143.231881 178607.720643 112464.488761 -66143.231881 -66143.231881
2 B 1 0.653707 52691.751258 167830.652805 125996.448016 -41834.204789 167830.652805 125996.448016 -41834.204789 -41834.204789
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 132820.537980 -2711.472828 -34541.322315 170073.333123 137263.548553 105433.699066 132820.53798 -32809.78457 -64639.634057 60196.623484 -12542.467677 -24710.327466
5 AC 2 0.520030 62033.710779 178562.160107 105777.364447 -76311.07725 3526.28159 178562.160107 102251.082857 182088.441697 105777.364447 -76311.07725 3526.28159 -0.0 -76311.07725 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 97520.342912 -74240.342302 -487.805694 172248.490907 94866.585649 168619.122257 97520.342912 -77381.905258 -3629.36865 6283.125913 -71380.268317 -3347.879678
7 ABC 3 0.786106 41411.391808 166398.662065 76575.967630 -29910.352685 -61416.323143 1503.981392 166398.662065 131614.105261 100665.790842 75125.807359 167874.564543 133757.642728 101694.016231 76575.96763 -25118.048601 -57181.675098 1450.160271 -8998.873214 -421.934882 693.377196 -245.700108 -28389.968597 -62894.695628 1461.96979
──────── Test point index: 1910 | True SalePrice: 126000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1910 1560 5 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1910 ===
f_AB(S)   = 134459.7865802023
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 131348.97917055085
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878466
|f_AB(S)-f_A|  = 35613.54654326316
|psi_ABC(A)|   = 31888.946407951455
|f_ABC(S)-f_A| = 35049.68289493586
|ReSHAP_AB(A)|  = 39291.97223878469
|ReSHAP_ABC(A)| = 29887.194046985864

P   = 0.19346600854350116
R   = 0.25057786222177914
P/R = 0.7720794120761955
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186023.281791 7415.561148 178607.720643 186023.281791 7415.561148 7415.561148
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 134459.786580 3678.425696 -39291.972239 170073.333123 173751.758819 130781.360885 134459.78658 3678.425696 -39291.972239 -0.0 3678.425696 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 173405.992765 8555.515712 -13711.683054 178562.160107 187117.675819 164850.477053 173405.992765 8555.515712 -13711.683054 0.0 8555.515712 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 144958.592641 -41402.427667 14112.529401 172248.490907 130846.06324 186361.020308 144958.592641 -41402.427667 14112.529401 0.0 -41402.427667 14112.529401
7 ABC 3 0.786106 41411.391808 166398.662065 131348.979171 3876.206419 -31888.946408 -7036.942906 166398.662065 171300.698821 136833.552157 139683.928239 160659.725322 165561.762078 128498.603088 131348.979171 2850.376082 -34212.782907 -8334.949069 2051.660674 0.0 2596.012325 -0.0 2663.910657 -29887.194047 -7826.399505
──────── Test point index: 881 | True SalePrice: 190000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
881 1500 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 881 ===
f_AB(S)   = 205000.52269684372
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 204974.69585435695
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791774
|f_AB(S)-f_A|  = 34927.18957337827
|psi_ABC(A)|   = 36684.57389952465
|f_ABC(S)-f_A| = 38576.033788870234
|ReSHAP_AB(A)|  = 34535.99002791775
|ReSHAP_ABC(A)| = 36783.59130779518

P   = 0.037831564729679634
R   = 0.03526475320072764
P/R = 1.0727868848065278
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 179396.363500 788.642857 178607.720643 179396.3635 788.642857 788.642857
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 205000.522697 391.199545 34535.990028 170073.333123 170464.532669 204609.323151 205000.522697 391.199545 34535.990028 0.0 391.199545 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 182998.318404 909.876707 3526.28159 178562.160107 179472.036814 182088.441697 182998.318404 909.876707 3526.28159 -0.0 909.876707 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 204974.695854 521.159829 36684.5739 1370.30006 166398.662065 166919.992306 203189.349614 203709.65739 167874.564543 168395.894783 204453.365614 204974.695854 521.33024 36578.801072 1265.038465 0.0 -1.022465 210.864013 1.022465 521.045656 36783.591308 1271.396825
──────── Test point index: 554 | True SalePrice: 110000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
554 980 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 554 ===
f_AB(S)   = 106012.19526378946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 121759.18593080073
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 37627.508171696405
|f_AB(S)-f_A|  = 64061.13785967599
|psi_ABC(A)|   = 24729.650950041087
|f_ABC(S)-f_A| = 44639.47613468599
|ReSHAP_AB(A)|  = 37351.03109406306
|ReSHAP_ABC(A)| = 24598.808413789706

P   = 0.03338257696110691
R   = 0.03199784024382035
P/R = 1.0432759432116356
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 121963.071645 -56644.648998 178607.720643 121963.071645 -56644.648998 -56644.648998
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
5 AC 2 0.520030 62033.710779 178562.160107 125355.096011 -65352.328008 12145.263912 178562.160107 113209.832099 190707.424019 125355.096011 -65352.328008 12145.263912 -0.0 -65352.328008 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 121759.185931 -26298.045331 -24729.65095 6388.220146 166398.662065 135265.157776 136833.552157 115060.614213 171481.984153 142647.924409 144216.318789 121759.185931 -22457.132859 -20888.738478 6698.571717 -6376.926885 684.194915 684.194915 -2983.63946 -26561.252706 -24598.808414 6520.584985
──────── Test point index: 1462 | True SalePrice: 255900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1462 2290 8 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1462 ===
f_AB(S)   = 289400.26577273966
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 296860.6489255956
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 73551.94664463995
|f_AB(S)-f_A|  = 119326.93264927421
|psi_ABC(A)|   = 80968.14130740541
|f_ABC(S)-f_A| = 130461.98686010891
|ReSHAP_AB(A)|  = 74059.1128890729
|ReSHAP_ABC(A)| = 80130.67326879693

P   = -0.004236152404847537
R   = 0.006433321485523225
P/R = -0.6584704983856422
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 266650.787664 88043.067021 178607.720643 266650.787664 88043.067021 88043.067021
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 289400.265773 45774.986005 73551.946645 170073.333123 213746.343645 241523.304285 289400.265773 43673.010521 71449.971161 4203.950967 45267.81976 74059.112889
5 AC 2 0.520030 62033.710779 178562.160107 257808.951678 101577.456947 -22330.665376 178562.160107 280139.617054 156231.494731 257808.951678 101577.456947 -22330.665376 -0.0 101577.456947 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 297600.953652 91592.161818 33760.300928 172248.490907 253063.830223 195231.969333 297600.953652 80815.339316 22983.478426 21553.645004 97596.50477 27755.957976
7 ABC 3 0.786106 41411.391808 166398.662065 296860.648926 58143.566727 80968.141307 -8649.721175 166398.662065 224599.294766 246727.241511 304813.727577 157052.305711 215252.938411 238774.140826 296860.648926 58086.508099 81607.710514 -7953.078651 114.124601 -0.022033 -1393.277703 0.022033 58133.946747 80130.673269 -7802.633156
──────── Test point index: 87 | True SalePrice: 133000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
87 1040 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 87 ===
f_AB(S)   = 107209.99535683448
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 124816.5569457882
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38672.22120020438
|f_AB(S)-f_A|  = 62863.33776663097
|psi_ABC(A)|   = 24565.984964812622
|f_ABC(S)-f_A| = 41582.10511969852
|ReSHAP_AB(A)|  = 38532.21680011161
|ReSHAP_ABC(A)| = 25170.86980126426

P   = 0.02439663453865526
R   = 0.007622752916615627
P/R = 3.2005018141775152
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 128589.989936 -50017.730707 178607.720643 128589.989936 -50017.730707 -50017.730707
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 107209.995357 -24191.116566 -38672.2212 170073.333123 145262.465518 130781.360885 107209.995357 -24810.867605 -39291.972239 1239.502077 -24331.120967 -38532.2168
5 AC 2 0.520030 62033.710779 178562.160107 133000.735016 -57706.689003 12145.263912 178562.160107 120855.471104 190707.424019 133000.735016 -57706.689003 12145.263912 -0.0 -57706.689003 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 124816.556946 -23568.006286 -24565.984965 6551.886131 166398.662065 137850.664513 136833.552157 117607.853595 171481.984153 145195.163791 144216.318789 124816.556946 -19399.761844 -20378.606845 7208.70335 -6887.058518 174.063282 135.795928 -2435.240473 -23880.09463 -25170.869801 7468.859312
──────── Test point index: 2652 | True SalePrice: 139000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2652 1566 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2652 ===
f_AB(S)   = 134788.5091952084
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 136842.94167583322
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 35284.82392825704
|psi_ABC(A)|   = 31164.71028632442
|f_ABC(S)-f_A| = 29555.720389653492
|ReSHAP_AB(A)|  = 39291.972238784656
|ReSHAP_ABC(A)| = 29414.357688104697

P   = 0.05912656899895463
R   = 0.11834869504236101
P/R = 0.4995962902488382
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186685.973620 8078.252977 178607.720643 186685.97362 8078.252977 8078.252977
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 134788.509195 4007.148311 -39291.972239 170073.333123 174080.481434 130781.360885 134788.509195 4007.148311 -39291.972239 -0.0 4007.148311 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 182789.538988 9320.079613 -5092.700732 178562.160107 187882.23972 173469.459375 182789.538988 9320.079613 -5092.700732 0.0 9320.079613 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 136842.941676 4222.603714 -31164.710286 -2613.613817 166398.662065 171738.769473 136833.552157 139938.652177 164267.144932 169607.25234 133737.841655 136842.941676 3105.100021 -32764.310664 -3095.710501 2235.007387 0.0 964.193368 0.0 2871.111275 -29414.357688 -3012.473977
──────── Test point index: 478 | True SalePrice: 184100 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
478 1419 8 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 478 ===
f_AB(S)   = 237476.74852761257
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 239531.42615370333
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126891
|f_AB(S)-f_A|  = 67403.41540414712
|psi_ABC(A)|   = 80492.74211154168
|f_ABC(S)-f_A| = 73132.76408821662
|ReSHAP_AB(A)|  = 71449.97116126888
|ReSHAP_ABC(A)| = 80320.51350921749

P   = -0.040603709696341195
R   = -0.03824869680340037
P/R = 1.0615710622781651
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 170450.023807 -8157.696836 178607.720643 170450.023807 -8157.696836 -8157.696836
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 237476.748528 -4046.555757 71449.971161 170073.333123 166026.777366 241523.304285 237476.748528 -4046.555757 71449.971161 -0.0 -4046.555757 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 164057.723424 -9411.735951 -5092.700732 178562.160107 169150.424156 173469.459375 164057.723424 -9411.735951 -5092.700732 0.0 -9411.735951 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 262265.809925 82795.538979 7221.780039 172248.490907 253063.830223 177490.071282 262265.809925 80815.339316 5241.580375 3960.399327 84534.517466 5482.801552
7 ABC 3 0.786106 41411.391808 166398.662065 239531.426154 -5387.335385 80492.742112 -1972.642638 166398.662065 161006.038509 246727.241511 241345.194297 164267.144932 158874.521376 244913.473368 239531.426154 -5382.047214 80656.904778 -1813.768143 -10.576343 0.0 -317.74899 0.0 -5381.29102 80320.513509 -1806.458401
──────── Test point index: 1505 | True SalePrice: 120000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1505 1053 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1505 ===
f_AB(S)   = 107644.42803599441
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 125469.10617795784
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38811.12102688102
|f_AB(S)-f_A|  = 62428.90508747104
|psi_ABC(A)|   = 24538.11267202786
|f_ABC(S)-f_A| = 40929.55588752887
|ReSHAP_AB(A)|  = 38695.87154781898
|ReSHAP_ABC(A)| = 25330.93557731857

P   = 0.022164496033146297
R   = 0.0009479790945913225
P/R = 23.3807856730232
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130025.822232 -48581.89841 178607.720643 130025.822232 -48581.89841 -48581.89841
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 107644.428036 -23617.784061 -38811.121027 170073.333123 145974.697851 130781.360885 107644.428036 -24098.635272 -39291.972239 961.702424 -23733.03354 -38695.871548
5 AC 2 0.520030 62033.710779 178562.160107 134657.290134 -56050.133885 12145.263912 178562.160107 122512.026222 190707.424019 134657.290134 -56050.133885 12145.263912 -0.0 -56050.133885 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 125469.106178 -22971.201639 -24538.112672 6579.758424 166398.662065 138436.627354 136833.552157 118159.755461 171481.984153 145747.065657 144216.318789 125469.106178 -18747.212612 -20277.959479 7309.350717 -6987.705884 73.415916 1.087586 -2300.532131 -23292.915827 -25330.935577 7694.295517
──────── Test point index: 1283 | True SalePrice: 143000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1283 1133 6 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1283 ===
f_AB(S)   = 147979.67494561963
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 138016.91128585787
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.9911054334516
|f_AB(S)-f_A|  = 22093.65817784582
|psi_ABC(A)|   = 2082.1581182692803
|f_ABC(S)-f_A| = 28381.75077962884
|ReSHAP_AB(A)|  = 2377.99110543344
|ReSHAP_ABC(A)| = 1702.794213739044

P   = 0.03426972579351578
R   = 0.047636198759747034
P/R = 0.7194051306728944
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 138861.713287 -39746.007356 178607.720643 138861.713287 -39746.007356 -39746.007356
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 147979.674946 -19715.667072 -2377.991105 170073.333123 150357.666051 167695.342018 147979.674946 -19715.667072 -2377.991105 -0.0 -19715.667072 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 127613.510831 -45855.948544 -5092.700732 178562.160107 132706.211563 173469.459375 127613.510831 -45855.948544 -5092.700732 0.0 -45855.948544 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 174984.353215 -2505.718067 5241.580375 172248.490907 169742.77284 177490.071282 174984.353215 -2505.718067 5241.580375 0.0 -2505.718067 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 138016.911286 -23865.066031 -2082.158118 -2434.52663 166398.662065 142358.579091 163880.012288 140795.982193 164267.144932 139704.112525 161748.495155 138016.911286 -23731.583869 -1687.201239 -2779.070907 -831.448538 647.553773 124.60434 -124.60434 -24033.687383 -1702.794214 -2645.269183
──────── Test point index: 485 | True SalePrice: 244000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
485 2054 6 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 485 ===
f_AB(S)   = 198438.59634905544
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 202718.34889215324
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 28365.263225589995
|psi_ABC(A)|   = 2518.6497773952287
|f_ABC(S)-f_A| = 36319.686826666526
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 2518.6497773952124

P   = 0.01448793934163363
R   = 0.014487939341634073
P/R = 0.9999999999999694
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 240584.909053 61977.18841 178607.720643 240584.909053 61977.18841 61977.18841
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 198438.596349 30743.254331 -2377.991105 170073.333123 200816.587454 167695.342018 198438.596349 30743.254331 -2377.991105 0.0 30743.254331 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 244974.069567 71504.610192 -5092.700732 178562.160107 250066.770299 173469.459375 244974.069567 71504.610192 -5092.700732 0.0 71504.610192 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 174984.353215 -2505.718067 5241.580375 172248.490907 169742.77284 177490.071282 174984.353215 -2505.718067 5241.580375 0.0 -2505.718067 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 202718.348892 40969.853737 -2518.649777 -2131.517133 166398.662065 207368.515803 163880.012288 204849.866025 164267.144932 205236.99867 161748.495155 202718.348892 40969.853737 -2518.649777 -2131.517133 -0.0 0.0 0.0 0.0 40969.853737 -2518.649777 -2131.517133
──────── Test point index: 134 | True SalePrice: 178000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
134 1128 6 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 134 ===
f_AB(S)   = 147705.73943311456
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 147724.69835841385
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 22367.593690350885
|psi_ABC(A)|   = 1952.943219013813
|f_ABC(S)-f_A| = 18673.963707072864
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 1315.2858397816008

P   = 0.0017330468665296417
R   = 0.035879915160522094
P/R = 0.04830130892941676
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 138309.470096 -40298.250547 178607.720643 138309.470096 -40298.250547 -40298.250547
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 147705.739433 -19989.602585 -2377.991105 170073.333123 150083.730539 167695.342018 147705.739433 -19989.602585 -2377.991105 0.0 -19989.602585 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 144214.338891 -46493.085128 12145.263912 178562.160107 132069.074979 190707.424019 144214.338891 -46493.085128 12145.263912 -0.0 -46493.085128 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 157242.455164 -2505.718067 -12500.317675 172248.490907 169742.77284 159748.173232 157242.455164 -2505.718067 -12500.317675 0.0 -2505.718067 -12500.317675
7 ABC 3 0.786106 41411.391808 166398.662065 147724.698358 -22806.994811 -1952.943219 6085.974323 166398.662065 142098.773077 163880.012288 140560.607183 171481.984153 149036.470403 168963.334376 147724.698358 -21238.636017 -1311.772044 7164.091175 -1206.877733 -2080.769088 -226.39385 226.39385 -25209.582846 -1315.28584 7850.904978
──────── Test point index: 2605 | True SalePrice: 170000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2605 1466 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2605 ===
f_AB(S)   = 129309.7989451068
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 132597.54270587457
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784634
|f_AB(S)-f_A|  = 40763.53417835865
|psi_ABC(A)|   = 29636.82100800672
|f_ABC(S)-f_A| = 33801.119359612145
|ReSHAP_AB(A)|  = 39291.97223878463
|ReSHAP_ABC(A)| = 29625.761017483088

P   = 0.08710005354631534
R   = 0.08742726136467072
P/R = 0.996257370833217
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175641.109802 -2966.610841 178607.720643 175641.109802 -2966.610841 -2966.610841
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 129309.798945 -1471.56194 -39291.972239 170073.333123 168601.771184 130781.360885 129309.798945 -1471.56194 -39291.972239 0.0 -1471.56194 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 170046.807312 -3422.652063 -5092.700732 178562.160107 175139.508044 173469.459375 170046.807312 -3422.652063 -5092.700732 0.0 -3422.652063 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 132597.542706 -1550.684534 -29636.821008 -2613.613817 166398.662065 164437.591946 136833.552157 135693.253207 164267.144932 162306.074813 133737.841655 132597.542706 -1140.298949 -29708.532107 -3095.710501 -820.77117 0.0 964.193368 -0.0 -1170.638008 -29625.761017 -3004.720334
──────── Test point index: 56 | True SalePrice: 185088 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
56 1733 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 56 ===
f_AB(S)   = 217765.91757958042
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 218874.2684136173
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.990027917695
|f_AB(S)-f_A|  = 47692.58445611497
|psi_ABC(A)|   = 36932.36857667148
|f_ABC(S)-f_A| = 52475.60634813059
|ReSHAP_AB(A)|  = 34535.99002791768
|ReSHAP_ABC(A)| = 36763.316595479824

P   = 0.020336822312463476
R   = 0.023558356906463573
P/R = 0.8632530016082648
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 205130.896196 26523.175554 178607.720643 205130.896196 26523.175554 26523.175554
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 217765.917580 13156.594428 34535.990028 170073.333123 183229.927552 204609.323151 217765.91758 13156.594428 34535.990028 -0.0 13156.594428 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 204069.900887 30600.441512 -5092.700732 178562.160107 209162.601619 173469.459375 204069.900887 30600.441512 -5092.700732 0.0 30600.441512 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 213881.062815 36390.991532 5241.580375 172248.490907 208639.482439 177490.071282 213881.062815 36390.991532 5241.580375 0.0 36390.991532 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 218874.268414 17515.88041 36932.368577 -1972.642638 166398.662065 183931.735942 203189.349614 220688.036556 164267.144932 181800.218809 201375.581472 218874.268414 17498.686942 37074.049604 -1813.768143 34.386935 -0.0 -317.74899 0.0 17509.713074 36763.316595 -1797.423322
──────── Test point index: 282 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
282 1487 6 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 282 ===
f_AB(S)   = 167374.3092309793
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 161320.672316365
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.9911054334516
|f_AB(S)-f_A|  = 2699.023892486148
|psi_ABC(A)|   = 2518.6497773952287
|f_ABC(S)-f_A| = 5077.9897491217125
|ReSHAP_AB(A)|  = 2377.9911054334625
|ReSHAP_ABC(A)| = 2518.649777395239

P   = 0.3850624928006781
R   = 0.38506249280068006
P/R = 0.999999999999995
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 177960.531204 -647.189439 178607.720643 177960.531204 -647.189439 -647.189439
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 167374.309231 -321.032787 -2377.991105 170073.333123 169752.300336 167695.342018 167374.309231 -321.032787 -2377.991105 -0.0 -321.032787 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 172722.780964 -746.678411 -5092.700732 178562.160107 177815.481696 173469.459375 172722.780964 -746.678411 -5092.700732 0.0 -746.678411 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 174984.353215 -2505.718067 5241.580375 172248.490907 169742.77284 177490.071282 174984.353215 -2505.718067 5241.580375 0.0 -2505.718067 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 161320.672316 -427.822838 -2518.649777 -2131.517133 166398.662065 165970.839227 163880.012288 163452.18945 164267.144932 163839.322094 161748.495155 161320.672316 -427.822838 -2518.649777 -2131.517133 -0.0 0.0 0.0 0.0 -427.822838 -2518.649777 -2131.517133
──────── Test point index: 2240 | True SalePrice: 152000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2240 1558 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2240 ===
f_AB(S)   = 134350.2123752003
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 141742.54832532132
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784634
|f_AB(S)-f_A|  = 35723.12074826515
|psi_ABC(A)|   = 30226.569665732914
|f_ABC(S)-f_A| = 24656.113740165398
|ReSHAP_AB(A)|  = 39291.97223878463
|ReSHAP_ABC(A)| = 29419.153120787636

P   = -0.1260228226197906
R   = -0.09327570953237929
P/R = 1.3510786811655784
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 185802.384515 7194.663872 178607.720643 185802.384515 7194.663872 7194.663872
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 134350.212375 3568.851491 -39291.972239 170073.333123 173642.184614 130781.360885 134350.212375 3568.851491 -39291.972239 0.0 3568.851491 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 190389.102776 8300.661079 3526.28159 178562.160107 186862.821186 182088.441697 190389.102776 8300.661079 3526.28159 -0.0 8300.661079 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 141742.548325 3760.740654 -30226.569666 1809.715271 166398.662065 171154.675271 136833.552157 139599.02026 167874.564543 172630.577748 138977.080222 141742.548325 2765.468103 -30888.029423 2143.528066 1990.545103 0.0 -667.625588 -0.0 2569.7253 -29419.153121 2193.314081
──────── Test point index: 2111 | True SalePrice: 191000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2111 1795 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2111 ===
f_AB(S)   = 221162.71793464344
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 223392.12038073796
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 51089.384811177995
|psi_ABC(A)|   = 36927.92952698765
|f_ABC(S)-f_A| = 56993.45831525125
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36758.21608696519

P   = 0.02805872937494014
R   = 0.031036499895264025
P/R = 0.9040558526131268
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 211978.711764 33370.991121 178607.720643 211978.711764 33370.991121 33370.991121
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 221162.717935 16553.394783 34535.990028 170073.333123 186626.727907 204609.323151 221162.717935 16553.394783 34535.990028 0.0 16553.394783 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 211970.394526 38500.935151 -5092.700732 178562.160107 217063.095258 173469.459375 211970.394526 38500.935151 -5092.700732 0.0 38500.935151 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 213881.062815 36390.991532 5241.580375 172248.490907 208639.482439 177490.071282 213881.062815 36390.991532 5241.580375 0.0 36390.991532 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 223392.120381 22038.171426 36927.929527 -1972.642638 166398.662065 188458.466009 203189.349614 225205.888524 164267.144932 186326.948876 201375.581472 223392.120381 22016.538909 37065.171505 -1813.768143 43.265034 0.0 -317.74899 0.0 22032.661433 36758.216087 -1797.419205
──────── Test point index: 581 | True SalePrice: 185750 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
581 1489 6 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 581 ===
f_AB(S)   = 167483.88343598138
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 161466.69586689686
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 2589.4496874840697
|psi_ABC(A)|   = 2518.6497773952383
|f_ABC(S)-f_A| = 4931.9661985898565
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 2518.6497773952424

P   = 0.4076597714303194
R   = 0.4076597714303185
P/R = 1.0000000000000022
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 178181.428480 -426.292163 178607.720643 178181.42848 -426.292163 -426.292163
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 167483.883436 -211.458582 -2377.991105 170073.333123 169861.874541 167695.342018 167483.883436 -211.458582 -2377.991105 0.0 -211.458582 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 172977.635597 -491.823778 -5092.700732 178562.160107 178070.336329 173469.459375 172977.635597 -491.823778 -5092.700732 0.0 -491.823778 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 174984.353215 -2505.718067 5241.580375 172248.490907 169742.77284 177490.071282 174984.353215 -2505.718067 5241.580375 0.0 -2505.718067 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 161466.695867 -281.799288 -2518.649777 -2131.517133 166398.662065 166116.862778 163880.012288 163598.213 164267.144932 163985.345644 161748.495155 161466.695867 -281.799288 -2518.649777 -2131.517133 0.0 0.0 0.0 -0.0 -281.799288 -2518.649777 -2131.517133
──────── Test point index: 1189 | True SalePrice: 215000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1189 1620 7 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1189 ===
f_AB(S)   = 211574.97499696564
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 207570.463606568
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 41501.64187350019
|psi_ABC(A)|   = 37209.34128088879
|f_ABC(S)-f_A| = 41171.80154108128
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36659.502025257956

P   = -0.07159830136942535
R   = -0.05824354799332643
P/R = 1.2292915496430459
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 192650.200082 14042.479439 178607.720643 192650.200082 14042.479439 14042.479439
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 211574.974997 6965.651846 34535.990028 170073.333123 177038.984969 204609.323151 211574.974997 6965.651846 34535.990028 0.0 6965.651846 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 181051.631771 16201.154718 -13711.683054 178562.160107 194763.314825 164850.477053 181051.631771 16201.154718 -13711.683054 0.0 16201.154718 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 207570.463607 9273.640331 37209.341281 -5311.18007 166398.662065 175681.405337 203189.349614 212453.887004 160659.725322 169942.468594 198305.926217 207570.463607 9264.537389 37627.995013 -4883.423397 18.205883 0.0 -855.513347 -0.0 9268.134318 36659.502025 -4755.834802
──────── Test point index: 70 | True SalePrice: 254900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
70 1947 7 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 70 ===
f_AB(S)   = 229490.35751479786
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 231398.48930412406
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.990027917695
|f_AB(S)-f_A|  = 59417.02439133241
|psi_ABC(A)|   = 37185.92887368557
|f_ABC(S)-f_A| = 64999.82723863734
|ReSHAP_AB(A)|  = 34535.99002791769
|ReSHAP_ABC(A)| = 36632.46599600579

P   = 0.009154650607864023
R   = 0.017669486741932583
P/R = 0.518105066749819
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 228766.904767 50159.184125 178607.720643 228766.904767 50159.184125 50159.184125
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 229490.357515 24881.034363 34535.990028 170073.333123 194954.367487 204609.323151 229490.357515 24881.034363 34535.990028 -0.0 24881.034363 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 222720.364351 57869.887298 -13711.683054 178562.160107 236432.047405 164850.477053 222720.364351 57869.887298 -13711.683054 0.0 57869.887298 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 231398.489304 33125.078435 37185.928874 -5311.18007 166398.662065 199556.255849 203189.349614 236281.912701 160659.725322 193817.319106 198305.926217 231398.489304 33092.563087 37581.170198 -4883.423397 65.030697 0.0 -855.513347 0.0 33123.013331 36632.465996 -4755.652088
──────── Test point index: 1878 | True SalePrice: 119000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1878 1728 5 10
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1878 ===
f_AB(S)   = 143664.019800373
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 130976.7179912214
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 26409.313323092443
|psi_ABC(A)|   = 35008.54908886797
|f_ABC(S)-f_A| = 35421.94407426531
|ReSHAP_AB(A)|  = 39291.97223878466
|ReSHAP_ABC(A)| = 28785.21257438238

P   = 0.4994780069296655
R   = 0.6751695641586484
P/R = 0.7397815799829217
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 204578.653006 25970.932363 178607.720643 204578.653006 25970.932363 25970.932363
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 267739.007895 89903.049221 177835.958675 267739.007895 89903.049221 89903.049221
4 AB 2 0.749226 44839.624931 170073.333123 143664.019800 12882.658916 -39291.972239 170073.333123 182955.992039 130781.360885 143664.0198 12882.658916 -39291.972239 -0.0 12882.658916 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 177575.817337 29963.304928 -30949.647698 178562.160107 208525.465035 147612.512409 177575.817337 29963.304928 -30949.647698 -0.0 29963.304928 -30949.647698
6 BC 2 0.753073 44494.356934 172248.490907 162700.891581 -41500.204664 31952.605337 172248.490907 130846.06324 204298.873241 162700.891581 -41402.427667 32050.382334 -195.553994 -42268.164497 32720.565171
7 ABC 3 0.786106 41411.391808 166398.662065 130976.717991 14486.475984 -35008.549089 -14899.870969 166398.662065 183566.677066 136833.552157 147006.861742 153444.869037 170612.901102 118355.975866 130976.717991 12620.742125 -39636.18311 -16030.143751 4547.289939 -2447.43254 3076.367786 2447.415476 18292.297624 -28785.212574 -24929.029124
──────── Test point index: 259 | True SalePrice: 150000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
259 1026 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 259 ===
f_AB(S)   = 106797.73472042411
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 124090.7396354395
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38494.84180090245
|f_AB(S)-f_A|  = 63275.59840304134
|psi_ABC(A)|   = 24604.30279729467
|f_ABC(S)-f_A| = 42307.92243004721
|ReSHAP_AB(A)|  = 38326.32109095826
|ReSHAP_ABC(A)| = 24997.378802157673

P   = 0.026814797566876658
R   = 0.014860679541330968
P/R = 1.8044126106278342
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 127043.709002 -51564.011641 178607.720643 127043.709002 -51564.011641 -51564.011641
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 106797.734720 -24780.756602 -38494.841801 170073.333123 144495.446083 130781.360885 106797.73472 -25577.88704 -39291.972239 1594.260876 -24949.277312 -38326.321091
5 AC 2 0.520030 62033.710779 178562.160107 131216.752581 -59490.671438 12145.263912 178562.160107 119071.48867 190707.424019 131216.752581 -59490.671438 12145.263912 -0.0 -59490.671438 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 124090.739635 -24217.187931 -24604.302797 6513.568299 166398.662065 137223.292743 136833.552157 117013.49774 171481.984153 144600.807935 144216.318789 124090.739635 -20125.579154 -20510.0683 7077.241896 -6755.597064 305.524737 300.273296 -2599.717841 -24506.014531 -24997.378802 7195.470903
──────── Test point index: 495 | True SalePrice: 403000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
495 2794 8 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 495 ===
f_AB(S)   = 336863.4565755761
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 333586.41330348013
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 82863.37711305746
|f_AB(S)-f_A|  = 166790.12345211065
|psi_ABC(A)|   = 80932.05612933067
|f_ABC(S)-f_A| = 167187.75123799342
|ReSHAP_AB(A)|  = 82779.07324061677
|ReSHAP_ABC(A)| = 80077.816800575

P   = 0.012733392993012405
R   = 0.017337405369019787
P/R = 0.7344462866263546
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 322316.901308 143709.180665 178607.720643 322316.901308 143709.180665 143709.180665
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 336863.456576 83926.746339 82863.377113 170073.333123 242586.673511 241523.304285 336863.456576 72513.340387 71449.971161 22826.811904 84011.050211 82779.073241
5 AC 2 0.520030 62033.710779 178562.160107 322032.319325 165800.824594 -22330.665376 178562.160107 344362.984701 156231.494731 322032.319325 165800.824594 -22330.665376 0.0 165800.824594 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 297600.953652 91592.161818 33760.300928 172248.490907 253063.830223 195231.969333 297600.953652 80815.339316 22983.478426 21553.645004 97596.50477 27755.957976
7 ABC 3 0.786106 41411.391808 166398.662065 333586.413303 94905.416283 80932.056129 -8649.721175 166398.662065 261397.2295 246727.241511 341539.491954 157052.305711 252050.873145 238774.140826 333586.413303 94812.272477 81535.540158 -7953.078651 186.294957 -0.022033 -1393.277703 0.022033 94912.421226 80077.816801 -7802.486788
──────── Test point index: 1506 | True SalePrice: 96000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1506 1137 5 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1506 ===
f_AB(S)   = 111323.90661997691
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 113390.94152762594
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39272.44003993243
|f_AB(S)-f_A|  = 58749.42650348853
|psi_ABC(A)|   = 26100.314213107526
|f_ABC(S)-f_A| = 53007.720537860776
|ReSHAP_AB(A)|  = 39265.86309383811
|ReSHAP_ABC(A)| = 26529.706369378157

P   = 0.17608655744983576
R   = 0.16787404955349916
P/R = 1.0489206516324572
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 139303.507840 -39304.212803 178607.720643 139303.50784 -39304.212803 -39304.212803
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 111323.906620 -19476.986464 -39272.44004 170073.333123 150576.814461 130781.360885 111323.90662 -19496.518662 -39291.972239 39.064398 -19483.56341 -39265.863094
5 AC 2 0.520030 62033.710779 178562.160107 119504.237776 -45346.239277 -13711.683054 178562.160107 133215.92083 164850.477053 119504.237776 -45346.239277 -13711.683054 -0.0 -45346.239277 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 144958.592641 -41402.427667 14112.529401 172248.490907 130846.06324 186361.020308 144958.592641 -41402.427667 14112.529401 0.0 -41402.427667 14112.529401
7 ABC 3 0.786106 41411.391808 166398.662065 113390.941528 -19670.20664 -26100.314213 -7237.199685 166398.662065 142566.423903 136833.552157 121725.890596 160659.725322 135625.946488 128498.603088 113390.941528 -15107.661561 -22235.00496 -8334.949069 -9926.117273 -0.0 1394.471653 1201.540671 -18699.585913 -26529.706369 -7778.428255
──────── Test point index: 2841 | True SalePrice: 83000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2841 1414 4 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2841 ===
f_AB(S)   = 97282.61595175826
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 106337.19301022212
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 70676.28279399962
|f_AB(S)-f_A|  = 72790.71717170719
|psi_ABC(A)|   = 48554.298316235065
|f_ABC(S)-f_A| = 60061.46905526459
|ReSHAP_AB(A)|  = 68717.14041088444
|ReSHAP_ABC(A)| = 48739.372048064855

P   = 0.16254176591729397
R   = 0.13254563216592874
P/R = 1.2263079760622684
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 169897.780616 -8709.940026 178607.720643 169897.780616 -8709.940026 -8709.940026
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 97282.615952 -2114.434378 -70676.282794 170073.333123 165752.841854 97190.993438 97282.615952 -4320.49127 -72882.339686 4412.113784 -4073.576761 -68717.140411
5 AC 2 0.520030 62033.710779 178562.160107 154801.604518 -10048.872535 -13711.683054 178562.160107 168513.287572 164850.477053 154801.604518 -10048.872535 -13711.683054 0.0 -10048.872535 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 107736.151807 -76980.103916 12467.764816 172248.490907 96913.151575 186361.020308 107736.151807 -75335.339331 14112.529401 -3289.529169 -79383.141065 14870.801966
7 ABC 3 0.786106 41411.391808 166398.662065 106337.193010 -4555.90111 -48554.298316 -6951.269629 166398.662065 160640.979633 117851.808625 114503.902211 160659.725322 154902.042889 109694.419484 106337.19301 -3357.226474 -48564.849879 -8166.709201 -2400.455959 9.32006 2427.772457 -9.32006 -3510.266942 -48739.372048 -7811.830066
──────── Test point index: 1608 | True SalePrice: 137500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1608 889 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1608 ===
f_AB(S)   = 107061.02784964658
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 112936.04060889882
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34610.278714971595
|f_AB(S)-f_A|  = 63012.30527381887
|psi_ABC(A)|   = 24919.02071185545
|f_ABC(S)-f_A| = 53462.62145658789
|ReSHAP_AB(A)|  = 34208.69175972096
|ReSHAP_ABC(A)| = 24356.361558029635

P   = 0.0831604837780236
R   = 0.08731167899366699
P/R = 0.9524554416603935
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 111912.245571 -66695.475072 178607.720643 111912.245571 -66695.475072 -66695.475072
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 107061.027850 -28402.026559 -34610.278715 170073.333123 136989.613041 130781.360885 107061.02785 -33083.720083 -39291.972239 9363.387048 -28803.613514 -34208.69176
5 AC 2 0.520030 62033.710779 178562.160107 105140.227863 -76948.213834 3526.28159 178562.160107 101613.946273 182088.441697 105140.227863 -76948.213834 3526.28159 -0.0 -76948.213834 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 112936.040609 -30350.733594 -24919.020712 1807.132849 166398.662065 131401.833835 136833.552157 111134.078042 167874.564543 133545.372779 138977.080222 112936.040609 -26041.039614 -20609.33217 1801.962566 -8288.15215 341.565499 341.576377 -1009.201966 -30883.654602 -24356.361558 1777.394703
──────── Test point index: 2916 | True SalePrice: 92000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2916 630 5 3
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2916 ===
f_AB(S)   = 117058.04417650774
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 115408.19773843972
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 22525.692954077247
|f_AB(S)-f_A|  = 53015.288946957706
|psi_ABC(A)|   = 24564.478523934875
|f_ABC(S)-f_A| = 50990.464327047
|ReSHAP_AB(A)|  = 24068.48143902074
|ReSHAP_ABC(A)| = 26403.095083124528

P   = -0.056856009564279164
R   = -0.06381323460032551
P/R = 0.8909752016236009
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 92304.316697 -86303.403946 178607.720643 92304.316697 -86303.403946 -86303.403946
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 135358.784698 -42477.173977 177835.958675 135358.784698 -42477.173977 -42477.173977
4 AB 2 0.749226 44839.624931 170073.333123 117058.044177 -30489.595993 -22525.692954 170073.333123 122817.457846 130781.360885 117058.044177 -47255.875278 -39291.972239 33532.558569 -28946.807508 -24068.481439
5 AC 2 0.520030 62033.710779 178562.160107 132279.506846 -91767.209537 45484.556276 178562.160107 70693.62285 207945.388663 132279.506846 -107868.537257 29383.228556 32202.65544 -63609.893243 17327.239982
6 BC 2 0.753073 44494.356934 172248.490907 109003.589395 -37402.606122 -25842.29539 172248.490907 130846.06324 142406.373973 109003.589395 -41402.427667 -29842.116934 7999.643089 -36753.585482 -26491.31603
7 ABC 3 0.786106 41411.391808 166398.662065 115408.197738 -41802.522062 -24564.478524 15376.536259 166398.662065 120406.173986 136833.552157 98834.003526 178903.498233 136646.087017 154694.795924 115408.197738 -39286.598185 -21237.889279 16574.194212 -2970.813031 1287.049555 -334.281181 -5022.126418 -43248.284479 -26403.095083 18660.915235
──────── Test point index: 1691 | True SalePrice: 492000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1691 2656 9 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1691 ===
f_AB(S)   = 373823.67457176367
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 370138.09639785107
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 124177.50670923869
|f_AB(S)-f_A|  = 203750.34144829822
|psi_ABC(A)|   = 124210.94998492223
|f_ABC(S)-f_A| = 203739.43433236436
|ReSHAP_AB(A)|  = 128275.49160421055
|ReSHAP_ABC(A)| = 123764.117289012

P   = -0.00019677445973831365
R   = 0.02210915877012054
P/R = -0.008900133278894573
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 307074.989239 128467.268596 178607.720643 307074.989239 128467.268596 128467.268596
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 373823.674572 79572.834739 124177.506709 170073.333123 233832.613448 278437.285418 373823.674572 63759.280324 108363.952295 31627.108829 75474.849844 128275.491604
5 AC 2 0.520030 62033.710779 178562.160107 313066.331934 148215.854881 -13711.683054 178562.160107 326778.014988 164850.477053 313066.331934 148215.854881 -13711.683054 -0.0 148215.854881 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 338410.736320 145036.043299 21126.202114 172248.490907 310270.861492 186361.020308 338410.73632 138022.370586 14112.529401 14027.345426 150748.49371 15413.751703
7 ABC 3 0.786106 41411.391808 166398.662065 370138.096398 84839.664418 124210.949985 -5311.18007 166398.662065 251321.604513 290265.133407 375021.519795 160659.725322 245582.667769 285381.71001 370138.096398 84756.386388 124555.428629 -4883.423397 166.556059 -0.0 -855.513347 0.0 84823.829737 123764.117289 -4848.512693
──────── Test point index: 2394 | True SalePrice: 329900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2394 2358 8 10
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2394 ===
f_AB(S)   = 294769.92235876445
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 298746.05711933435
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 74374.0134526178
|f_AB(S)-f_A|  = 124696.589235299
|psi_ABC(A)|   = 81232.14912712334
|f_ABC(S)-f_A| = 132347.39505384763
|ReSHAP_AB(A)|  = 74965.75350959373
|ReSHAP_ABC(A)| = 79975.46898779196

P   = -0.017339897717141506
R   = -0.0030991442228445187
P/R = 5.595059948912688
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 274161.295060 95553.574418 178607.720643 274161.29506 95553.574418 95553.574418
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 267739.007895 89903.049221 177835.958675 267739.007895 89903.049221 89903.049221
4 AB 2 0.749226 44839.624931 170073.333123 294769.922359 50322.575783 74374.013453 170073.333123 217471.866615 241523.304285 294769.922359 47398.533491 71449.971161 5848.084583 49730.835726 74965.75351
5 AC 2 0.520030 62033.710779 178562.160107 257855.026896 110242.514487 -30949.647698 178562.160107 288804.674594 147612.512409 257855.026896 110242.514487 -30949.647698 -0.0 110242.514487 -30949.647698
6 BC 2 0.753073 44494.356934 172248.490907 315289.306185 95902.88613 47137.929148 172248.490907 253063.830223 204298.873241 315289.306185 80815.339316 32050.382334 30175.093628 102421.637444 40619.177834
7 ABC 3 0.786106 41411.391808 166398.662065 298746.057119 63103.518789 81232.149127 -11988.272862 166398.662065 229564.095484 246727.241511 309768.791024 153444.869037 216610.319519 235704.434168 298746.057119 63041.622951 82135.7376 -11022.733905 123.827531 -0.073437 -1931.04206 0.056373 63095.33153 79975.468988 -10723.405464
──────── Test point index: 188 | True SalePrice: 125500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
188 1363 4 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 188 ===
f_AB(S)   = 98499.79154778406
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 114383.69875759371
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 68670.62388221081
|f_AB(S)-f_A|  = 71573.54157568139
|psi_ABC(A)|   = 46263.39628849274
|f_ABC(S)-f_A| = 52014.963307893006
|ReSHAP_AB(A)|  = 65208.05678114363
|ReSHAP_ABC(A)| = 48130.936224159224

P   = 0.07001670854350872
R   = -0.014264942225095223
P/R = -4.908306492846054
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 164264.900069 -14342.820574 178607.720643 164264.900069 -14342.820574 -14342.820574
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 98499.791548 -2902.917693 -68670.623882 170073.333123 162958.699626 97190.993438 98499.791548 -7114.633497 -72882.339686 8423.431607 -6365.484795 -65208.056781
5 AC 2 0.520030 62033.710779 178562.160107 165540.776008 -16547.665689 3526.28159 178562.160107 162014.494418 182088.441697 165540.776008 -16547.665689 3526.28159 -0.0 -16547.665689 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 95427.443060 -74263.509264 -2557.538583 172248.490907 96913.151575 168619.122257 95427.44306 -75335.339331 -3629.36865 2143.660135 -73290.205907 -3530.84194
7 ABC 3 0.786106 41411.391808 166398.662065 114383.698758 -7531.638614 -46263.396288 1780.071594 166398.662065 156917.379094 117851.808625 112329.101723 167874.564543 158393.281571 119995.336691 114383.698758 -5611.637933 -44009.582814 2054.597035 -3869.645038 88.931031 -578.694557 -88.931031 -5919.075181 -48130.936224 2035.048098
──────── Test point index: 1117 | True SalePrice: 257000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1117 1861 8 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1117 ===
f_AB(S)   = 261692.6478330617
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 271739.3385644671
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 91619.31470959625
|psi_ABC(A)|   = 80461.0959831507
|f_ABC(S)-f_A| = 105340.67649898038
|ReSHAP_AB(A)|  = 71449.97116126891
|ReSHAP_ABC(A)| = 80308.09908546145

P   = 0.016039163227533737
R   = 0.0174915641686908
P/R = 0.9169656339964838
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 219268.321884 40660.601241 178607.720643 219268.321884 40660.601241 40660.601241
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 261692.647833 20169.343548 71449.971161 170073.333123 190242.676672 241523.304285 261692.647833 20169.343548 71449.971161 -0.0 20169.343548 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 220380.597432 46911.138057 -5092.700732 178562.160107 225473.298164 173469.459375 220380.597432 46911.138057 -5092.700732 0.0 46911.138057 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 262265.809925 82795.538979 7221.780039 172248.490907 253063.830223 177490.071282 262265.809925 80815.339316 5241.580375 3960.399327 84534.517466 5482.801552
7 ABC 3 0.786106 41411.391808 166398.662065 271739.338564 26852.223154 80461.095983 -1972.642638 166398.662065 193277.243177 246727.241511 273553.106707 164267.144932 191145.726043 244913.473368 271739.338564 26825.865197 80593.612521 -1813.768143 52.715914 0.0 -317.74899 -0.0 26839.029942 80308.099085 -1806.452528
──────── Test point index: 555 | True SalePrice: 135000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
555 980 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 555 ===
f_AB(S)   = 106012.19526378946
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 117204.14227896054
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 37627.508171696405
|f_AB(S)-f_A|  = 64061.13785967599
|psi_ABC(A)|   = 24773.55580440155
|f_ABC(S)-f_A| = 49194.51978652617
|ReSHAP_AB(A)|  = 37351.03109406306
|ReSHAP_ABC(A)| = 24724.152705303888

P   = 0.08378506475474834
R   = 0.08047347312934772
P/R = 1.0411513446185898
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 121963.071645 -56644.648998 178607.720643 121963.071645 -56644.648998 -56644.648998
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 106012.195264 -26433.629688 -37627.508172 170073.333123 141975.239368 130781.360885 106012.195264 -28098.093755 -39291.972239 3328.928134 -26710.106766 -37351.031094
5 AC 2 0.520030 62033.710779 178562.160107 116736.113688 -65352.328008 3526.28159 178562.160107 113209.832099 182088.441697 116736.113688 -65352.328008 3526.28159 -0.0 -65352.328008 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 117204.142279 -26341.950185 -24773.555804 1920.986203 166398.662065 135265.157776 136833.552157 115060.614213 167874.564543 137408.685842 138977.080222 117204.142279 -21772.937943 -20204.543563 2143.528066 -8692.940757 0.0 0.0 -667.625588 -26643.385478 -24724.152705 2173.018397
──────── Test point index: 175 | True SalePrice: 80000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
175 747 4 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 175 ===
f_AB(S)   = 125625.26581285498
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 96447.5346177467
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38233.4591793624
|f_AB(S)-f_A|  = 44448.067310610466
|psi_ABC(A)|   = 44538.15322782295
|f_ABC(S)-f_A| = 69951.12744774001
|ReSHAP_AB(A)|  = 28479.981972568356
|ReSHAP_ABC(A)| = 45594.43816481503

P   = 0.2234788488850057
R   = -0.01105682865805857
P/R = -20.211839741418725
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 96715.478547 -81892.242096 178607.720643 96715.478547 -81892.242096 -81892.242096
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 125625.265813 -6214.608131 -38233.459179 170073.333123 129209.844486 97190.993438 125625.265813 -40863.488638 -72882.339686 69297.761013 -15968.085338 -28479.981973
5 AC 2 0.520030 62033.710779 178562.160107 116192.757401 -89088.270877 26718.868171 178562.160107 83519.267293 199326.406341 116192.757401 -95042.892814 20764.246234 11909.243874 -79804.475838 17435.073132
6 BC 2 0.753073 44494.356934 172248.490907 98208.184502 -64002.189518 -10038.116887 172248.490907 96913.151575 150877.224206 98208.184502 -75335.339331 -21371.266701 22666.299627 -57678.082564 -16362.223841
7 ABC 3 0.786106 41411.391808 166398.662065 96447.534618 -36496.332518 -44538.153228 11083.358298 166398.662065 125373.325347 117851.808625 83989.955462 175089.403763 137995.383376 129433.258679 96447.534618 -32985.724061 -41547.848758 12457.579156 -4108.296326 -876.129102 164.478873 -3055.187229 -37865.526669 -45594.438165 13508.837386
──────── Test point index: 366 | True SalePrice: 501837 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
366 2234 9 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 366 ===
f_AB(S)   = 331499.0334934045
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 336317.90033531305
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 114592.35994167643
|f_AB(S)-f_A|  = 161425.70036993906
|psi_ABC(A)|   = 124510.03781367264
|f_ABC(S)-f_A| = 169919.23826982634
|ReSHAP_AB(A)|  = 117425.3729905394
|ReSHAP_ABC(A)| = 123739.93565317345

P   = -0.022883296732149838
R   = -0.0008011804445965387
P/R = 28.561976127205014
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 260465.663926 81857.943283 178607.720643 260465.663926 81857.943283 81857.943283
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 331499.033493 46833.340428 114592.359942 170073.333123 210678.265905 278437.285418 331499.033493 40604.932781 108363.952295 12456.815294 44000.327379 117425.372991
5 AC 2 0.520030 62033.710779 178562.160107 250673.021939 94441.527209 -22330.665376 178562.160107 273003.687316 156231.494731 250673.021939 94441.527209 -22330.665376 0.0 94441.527209 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 356099.088853 149444.745053 34405.852893 172248.490907 310270.861492 195231.969333 356099.088853 138022.370586 22983.478426 22844.748934 157606.046723 26244.551223
7 ABC 3 0.786106 41411.391808 166398.662065 336317.900335 54058.926485 124510.037814 -8649.726029 166398.662065 220510.635351 290265.133407 344270.978986 157052.305711 211164.278997 282312.003598 336317.900335 54005.896737 125153.621339 -7953.078651 106.076548 -0.051158 -1393.277703 0.051158 54037.828759 123739.935653 -7858.526143
──────── Test point index: 456 | True SalePrice: 552000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
456 2492 10 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 456 ===
f_AB(S)   = 408170.405029704
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 401725.54115516465
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 165467.21310537725
|f_AB(S)-f_A|  = 238097.07190623853
|psi_ABC(A)|   = 167760.579070353
|f_ABC(S)-f_A| = 235326.87908967794
|ReSHAP_AB(A)|  = 173676.32890091417
|ReSHAP_ABC(A)| = 167335.53353922864

P   = -0.01792625947010884
R   = 0.018357953341305855
P/R = -0.9764846405712508
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 288961.412577 110353.691934 178607.720643 288961.412577 110353.691934 110353.691934
2 B 1 0.653707 52691.751258 167830.652805 357997.585228 190166.932423 167830.652805 357997.585228 190166.932423 190166.932423
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 408170.405030 72629.858801 165467.213105 170073.333123 224813.33835 317650.692654 408170.40503 54740.005226 147577.359531 35779.707149 64420.743005 173676.328901
5 AC 2 0.520030 62033.710779 178562.160107 292168.251986 127317.774933 -13711.683054 178562.160107 305879.93504 164850.477053 292168.251986 127317.774933 -13711.683054 0.0 127317.774933 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 396908.871520 203534.178499 21126.202114 172248.490907 368768.996693 186361.020308 396908.87152 196520.505786 14112.529401 14027.345426 209608.011341 15052.369272
7 ABC 3 0.786106 41411.391808 166398.662065 401725.541155 72877.484903 167760.57907 -5311.184883 166398.662065 239347.673369 333803.025303 406608.964552 160659.725322 233608.736626 328919.573027 401725.541155 72805.968128 168116.80453 -4883.423397 143.043176 -0.028879 -855.513347 0.028879 72849.17323 167335.533539 -4857.827679
──────── Test point index: 506 | True SalePrice: 244600 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
506 1468 8 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 506 ===
f_AB(S)   = 240161.31655016236
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 246171.6418334022
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 70087.98342669691
|psi_ABC(A)|   = 80220.35165215925
|f_ABC(S)-f_A| = 79772.97976791547
|ReSHAP_AB(A)|  = 71449.97116126891
|ReSHAP_ABC(A)| = 80325.1012283833

P   = 0.013824479905631293
R   = 0.012511383961004263
P/R = 1.1049520939265962
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 175862.007078 -2745.713565 178607.720643 175862.007078 -2745.713565 -2745.713565
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 240161.316550 -1361.987735 71449.971161 170073.333123 168711.345389 241523.304285 240161.31655 -1361.987735 71449.971161 -0.0 -1361.987735 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 178920.644267 -3167.79743 3526.28159 178562.160107 175394.362677 182088.441697 178920.644267 -3167.79743 3526.28159 -0.0 -3167.79743 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 247204.347482 79700.28227 -4744.425695 172248.490907 253063.830223 168619.122257 247204.347482 80815.339316 -3629.36865 -2230.114091 78480.362824 -3524.506249
7 ABC 3 0.786106 41411.391808 166398.662065 246171.641833 -1813.266678 80220.351652 1365.894794 166398.662065 164583.615497 246727.241511 244915.754722 167874.564543 166059.517974 247983.128622 246171.641833 -1811.486788 80112.123859 1255.887111 -3.55978 -0.0 220.015366 -0.0 -1811.404433 80325.101228 1259.282972
──────── Test point index: 1538 | True SalePrice: 311500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1538 1954 7 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1538 ===
f_AB(S)   = 229873.867232305
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 231908.56936492797
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791771
|f_AB(S)-f_A|  = 59800.53410883955
|psi_ABC(A)|   = 37185.4276906567
|f_ABC(S)-f_A| = 65509.907299441256
|ReSHAP_AB(A)|  = 34535.99002791771
|ReSHAP_ABC(A)| = 36631.74067630974

P   = 0.009889163200806905
R   = 0.01834112164758206
P/R = 0.5391798490203357
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 229540.045235 50932.324592 178607.720643 229540.045235 50932.324592 50932.324592
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 229873.867232 25264.544081 34535.990028 170073.333123 195337.877204 204609.323151 229873.867232 25264.544081 34535.990028 -0.0 25264.544081 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 223612.355569 58761.878516 -13711.683054 178562.160107 237324.038623 164850.477053 223612.355569 58761.878516 -13711.683054 0.0 58761.878516 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 231908.569365 33635.659679 37185.427691 -5311.18007 166398.662065 200067.338276 203189.349614 236791.992762 160659.725322 194328.401533 198305.926217 231908.569365 33602.643147 37580.167832 -4883.423397 66.033063 -0.0 -855.513347 0.0 33633.814795 36631.740676 -4755.648171
──────── Test point index: 2381 | True SalePrice: 252000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2381 1618 8 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2381 ===
f_AB(S)   = 248379.3819253148
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 257101.9288506297
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 78306.04880184936
|psi_ABC(A)|   = 80209.61201582747
|f_ABC(S)-f_A| = 90703.26678514297
|ReSHAP_AB(A)|  = 71449.97116126891
|ReSHAP_ABC(A)| = 80323.35099478475

P   = 0.02813724508928661
R   = 0.026883277255617344
P/R = 1.0466449020238873
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 192429.302805 13821.582163 178607.720643 192429.302805 13821.582163 13821.582163
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 248379.381925 6856.077641 71449.971161 170073.333123 176929.410764 241523.304285 248379.381925 6856.077641 71449.971161 -0.0 6856.077641 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 198034.741781 15946.300084 3526.28159 178562.160107 194508.460191 182088.441697 198034.741781 15946.300084 3526.28159 -0.0 15946.300084 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 247204.347482 79700.28227 -4744.425695 172248.490907 253063.830223 168619.122257 247204.347482 80815.339316 -3629.36865 -2230.114091 78480.362824 -3524.506249
7 ABC 3 0.786106 41411.391808 166398.662065 257101.928851 9127.759975 80209.612016 1365.894794 166398.662065 175535.381787 246727.241511 255846.041739 167874.564543 177011.284264 247983.128622 257101.928851 9118.800229 80090.644587 1255.887111 17.919493 0.0 220.015366 -0.0 9120.631921 80323.350995 1259.283869
──────── Test point index: 1820 | True SalePrice: 132000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1820 1057 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1820 ===
f_AB(S)   = 107783.16623912643
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 120473.09948582864
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38851.32613031703
|f_AB(S)-f_A|  = 62290.16688433902
|psi_ABC(A)|   = 24801.218645326375
|f_ABC(S)-f_A| = 45925.56257965807
|ReSHAP_AB(A)|  = 38743.81790113412
|ReSHAP_ABC(A)| = 25160.071911759627

P   = 0.08368442835290157
R   = 0.07414469988427563
P/R = 1.1286636601606785
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 130467.616785 -48140.103858 178607.720643 130467.616785 -48140.103858 -48140.103858
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 107783.166239 -23438.840754 -38851.32613 170073.333123 146193.846261 130781.360885 107783.166239 -23879.486862 -39291.972239 881.292217 -23546.348983 -38743.817901
5 AC 2 0.520030 62033.710779 178562.160107 126548.017079 -55540.424618 3526.28159 178562.160107 123021.735489 182088.441697 126548.017079 -55540.424618 3526.28159 0.0 -55540.424618 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 120473.099486 -23036.458463 -24801.218645 1912.114529 166398.662065 138624.927362 136833.552157 118329.57142 167874.564543 140715.225382 138977.080222 120473.099486 -18503.980737 -20242.125896 2143.528066 -8655.358424 0.0 -53.230045 -614.395543 -22944.473689 -25160.071912 2178.983021
──────── Test point index: 1600 | True SalePrice: 81000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1600 630 4 3
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1600 ===
f_AB(S)   = 130975.86874234876
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 98116.00386519573
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 32361.96439472388
|f_AB(S)-f_A|  = 39097.46438111669
|psi_ABC(A)|   = 43400.05879762935
|f_ABC(S)-f_A| = 68282.65820029099
|ReSHAP_AB(A)|  = 23718.63674475463
|ReSHAP_ABC(A)| = 47529.59843625653

P   = 0.19213124106336388
R   = -0.08941720493659588
P/R = -2.1487055114236755
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 92304.316697 -86303.403946 178607.720643 92304.316697 -86303.403946 -86303.403946
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 135358.784698 -42477.173977 177835.958675 135358.784698 -42477.173977 -42477.173977
4 AB 2 0.749226 44839.624931 170073.333123 130975.868742 -6735.499986 -32361.964395 170073.333123 122817.457846 97190.993438 130975.868742 -47255.875278 -72882.339686 81040.750582 -15378.827636 -23718.636745
5 AC 2 0.520030 62033.710779 178562.160107 132279.506846 -91767.209537 45484.556276 178562.160107 70693.62285 207945.388663 132279.506846 -107868.537257 29383.228556 32202.65544 -63609.893243 17327.239982
6 BC 2 0.753073 44494.356934 172248.490907 105666.304316 -56037.704494 -10544.482097 172248.490907 96913.151575 142406.373973 105666.304316 -75335.339331 -29842.116934 38595.269675 -47690.748553 -18891.438037
7 ABC 3 0.786106 41411.391808 166398.662065 98116.003865 -40667.125869 -43400.058798 15784.526466 166398.662065 120406.173986 117851.808625 79328.606098 178903.498233 136646.087017 133734.58652 98116.003865 -35618.582655 -38530.083152 18787.397767 -6638.828561 -2904.619873 -2547.484737 -830.456991 -45507.958388 -47529.598436 24754.898624
──────── Test point index: 1486 | True SalePrice: 145900 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1486 2200 5 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1486 ===
f_AB(S)   = 169523.53218085258
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 167065.77801101314
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 549.8009426128701
|psi_ABC(A)|   = 36295.24381451539
|f_ABC(S)-f_A| = 667.1159455264278
|ReSHAP_AB(A)|  = 39291.97223878568
|ReSHAP_ABC(A)| = 35645.44150969054

P   = 17.059614224666852
R   = 18.03366125379811
P/R = 0.9459872837011336
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 256710.410228 78102.689585 178607.720643 256710.410228 78102.689585 78102.689585
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 169523.532181 38742.171296 -39291.972239 170073.333123 208815.50442 130781.360885 169523.532181 38742.171296 -39291.972239 -0.0 38742.171296 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 246340.493170 90108.998439 -22330.665376 178562.160107 268671.158546 156231.494731 246340.49317 90108.998439 -22330.665376 -0.0 90108.998439 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 153829.541666 -41402.427667 22983.478426 172248.490907 130846.06324 195231.969333 153829.541666 -41402.427667 22983.478426 0.0 -41402.427667 22983.478426
7 ABC 3 0.786106 41411.391808 166398.662065 167065.778011 47013.354661 -36295.243815 -10050.994901 166398.662065 218028.234992 136833.552157 176412.134365 157052.305711 208681.878638 123259.364521 167065.778011 43806.41349 -41616.100626 -9346.356354 7823.159436 -4227.831281 0.0 4227.831281 -36514.57074 35645.44151 1536.245176
──────── Test point index: 1221 | True SalePrice: 118000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1221 1077 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1221 ===
f_AB(S)   = 108576.15180773073
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 126533.63458384936
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39002.70437102505
|f_AB(S)-f_A|  = 61497.18131573472
|psi_ABC(A)|   = 24541.385836286896
|f_ABC(S)-f_A| = 39865.02748163736
|ReSHAP_AB(A)|  = 38925.777353908874
|ReSHAP_ABC(A)| = 25606.887369023294

P   = 0.018607469522984754
R   = -0.0093711596462277
P/R = -1.9856101299560158
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 132676.589549 -45931.131094 178607.720643 132676.589549 -45931.131094 -45931.131094
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 108576.151808 -22494.476945 -39002.704371 170073.333123 147289.588311 130781.360885 108576.151808 -22783.744812 -39291.972239 578.535736 -22571.403962 -38925.777354
5 AC 2 0.520030 62033.710779 178562.160107 137715.545736 -52991.878283 12145.263912 178562.160107 125570.281824 190707.424019 137715.545736 -52991.878283 12145.263912 -0.0 -52991.878283 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 126533.634584 -21900.126905 -24541.385836 6576.48526 166398.662065 139566.427399 136833.552157 119178.651214 171481.984153 146765.96141 144216.318789 126533.634584 -17682.684206 -20232.326826 7354.98337 -7033.338538 27.783263 -155.449359 -2143.995186 -22104.089068 -25606.887369 7845.948956
──────── Test point index: 651 | True SalePrice: 152500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
651 1527 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 651 ===
f_AB(S)   = 206479.77446437112
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 206946.01378653734
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 36406.44134090567
|psi_ABC(A)|   = 36683.92952134475
|f_ABC(S)-f_A| = 40547.351721050625
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36783.29379238086

P   = 0.04390480596731405
R   = 0.04145423233380785
P/R = 1.059115161360922
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 182378.476731 3770.756088 178607.720643 182378.476731 3770.756088 3770.756088
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 206479.774464 1870.451313 34535.990028 170073.333123 171943.784436 204609.323151 206479.774464 1870.451313 34535.990028 0.0 1870.451313 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 186438.855956 4350.414259 3526.28159 178562.160107 182912.574366 182088.441697 186438.855956 4350.414259 3526.28159 -0.0 4350.414259 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 206946.013787 2491.833383 36683.929521 1371.588816 166398.662065 168891.310238 203189.349614 205677.109053 167874.564543 170367.212715 204453.365614 206946.013787 2492.648172 36578.801072 1268.904734 0.0 -4.888734 206.997743 4.888734 2489.708762 36783.293792 1274.349167
──────── Test point index: 1465 | True SalePrice: 183000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1465 1640 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1465 ===
f_AB(S)   = 212670.71704698596
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 215196.34439158865
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.990027917695
|f_AB(S)-f_A|  = 42597.38392352051
|psi_ABC(A)|   = 36681.23267933255
|f_ABC(S)-f_A| = 48797.68232610193
|ReSHAP_AB(A)|  = 34535.99002791768
|ReSHAP_ABC(A)| = 36779.02190552195

P   = 0.0590534750287498
R   = 0.057049502267586316
P/R = 1.0351269105164846
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 194859.172845 16251.452203 178607.720643 194859.172845 16251.452203 16251.452203
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 212670.717047 8061.393896 34535.990028 170073.333123 178134.727019 204609.323151 212670.717047 8061.393896 34535.990028 -0.0 8061.393896 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 200838.142750 18749.701053 3526.28159 178562.160107 197311.86116 182088.441697 200838.14275 18749.701053 3526.28159 0.0 18749.701053 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 215196.344392 10739.467146 36681.232679 1376.9825 166398.662065 177141.640843 203189.349614 213911.258606 167874.564543 178617.54332 204453.365614 215196.344392 10742.978777 36578.801072 1285.085786 0.0 -21.069786 190.816691 21.069786 10728.792561 36779.021906 1289.867859
──────── Test point index: 120 | True SalePrice: 159000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
120 1209 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 120 ===
f_AB(S)   = 115229.51360234566
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 126926.10592016575
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 54843.81952111979
|psi_ABC(A)|   = 25426.515890669867
|f_ABC(S)-f_A| = 39472.55614532097
|ReSHAP_AB(A)|  = 39291.97223878465
|ReSHAP_ABC(A)| = 26798.246598864036

P   = 0.07227710222812045
R   = 0.03752559783747378
P/R = 1.9260746368694266
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 147255.809789 -31351.910854 178607.720643 147255.809789 -31351.910854 -31351.910854
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 115229.513602 -15551.847282 -39291.972239 170073.333123 154521.485841 130781.360885 115229.513602 -15551.847282 -39291.972239 -0.0 -15551.847282 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 145916.969227 -36171.47247 3526.28159 178562.160107 142390.687637 182088.441697 145916.969227 -36171.47247 3526.28159 0.0 -36171.47247 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 126926.105920 -15908.74315 -25426.515891 1862.702895 166398.662065 146526.174821 136833.552157 124782.577855 167874.564543 148320.00304 138977.080222 126926.10592 -12050.974302 -21393.89712 2143.528066 -7503.5872 0.0 -349.699846 -317.925742 -14874.497764 -26798.246599 2200.188218
──────── Test point index: 1058 | True SalePrice: 390000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1058 2748 9 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1058 ===
f_AB(S)   = 383071.3997071001
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 373772.3505143461
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 125974.3972847746
|f_AB(S)-f_A|  = 212998.06658363467
|psi_ABC(A)|   = 124473.2366598425
|f_ABC(S)-f_A| = 207373.68844885938
|ReSHAP_AB(A)|  = 129832.82087788271
|ReSHAP_ABC(A)| = 123695.95412830851

P   = -0.008801938445805235
R   = 0.013061112307479306
P/R = -0.6739042003922514
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 317236.263952 138628.543309 178607.720643 317236.263952 138628.543309 138628.543309
2 B 1 0.653707 52691.751258 167830.652805 309677.662236 141847.009431 167830.652805 309677.662236 141847.009431 141847.009431
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 383071.399707 87023.669299 125974.397285 170073.333123 239486.557432 278437.285418 383071.399707 69413.224309 108363.952295 35220.88998 83165.245706 129832.820878
5 AC 2 0.520030 62033.710779 178562.160107 316170.662754 159939.168023 -22330.665376 178562.160107 338501.32813 156231.494731 316170.662754 159939.168023 -22330.665376 0.0 159939.168023 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 356099.088853 149444.745053 34405.852893 172248.490907 310270.861492 195231.969333 356099.088853 138022.370586 22983.478426 22844.748934 157606.046723 26244.551223
7 ABC 3 0.786106 41411.391808 166398.662065 373772.350514 91550.177818 124473.23666 -8649.726029 166398.662065 258038.687837 290265.133407 381725.429165 157052.305711 248692.331483 282312.003598 373772.350514 91460.346917 125080.019031 -7953.078651 179.678855 -0.051158 -1393.277703 0.051158 91536.204586 123695.954128 -7858.470265
──────── Test point index: 2659 | True SalePrice: 163000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2659 1768 7 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2659 ===
f_AB(S)   = 219683.466167116
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 224541.851625629
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 49610.13304365057
|psi_ABC(A)|   = 36678.17784944263
|f_ABC(S)-f_A| = 58143.18956014229
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36770.27012226064

P   = 0.06532289675494218
R   = 0.06373900922633058
P/R = 1.024849578740507
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 208996.598533 30388.87789 178607.720643 208996.598533 30388.87789 30388.87789
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 219683.466167 15074.143016 34535.990028 170073.333123 185147.476139 204609.323151 219683.466167 15074.143016 34535.990028 0.0 15074.143016 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 217148.839295 35060.397598 3526.28159 178562.160107 213622.557705 182088.441697 217148.839295 35060.397598 3526.28159 -0.0 35060.397598 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 205010.113789 36390.991532 -3629.36865 172248.490907 208639.482439 168619.122257 205010.113789 36390.991532 -3629.36865 0.0 36390.991532 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 224541.851626 20081.91955 36678.177849 1383.09216 166398.662065 186487.148077 203189.349614 223238.436861 167874.564543 187963.050554 204453.365614 224541.851626 20088.486011 36578.801072 1303.414765 0.0 -39.398765 172.487712 39.398765 20065.08425 36770.270122 1307.835188
──────── Test point index: 755 | True SalePrice: 103000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
755 1208 4 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 755 ===
f_AB(S)   = 104542.97563767005
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 111069.26722584515
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 61403.03139343908
|f_AB(S)-f_A|  = 65530.35748579539
|psi_ABC(A)|   = 44754.12856631706
|f_ABC(S)-f_A| = 55329.394839641565
|ReSHAP_AB(A)|  = 53972.89124632164
|ReSHAP_ABC(A)| = 46992.70848037272

P   = 0.12814945340067407
R   = -0.025694393289336648
P/R = -4.987448115922511
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 147145.361151 -31462.359492 178607.720643 147145.361151 -31462.359492 -31462.359492
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 104542.975638 -4127.326092 -61403.031393 170073.333123 154466.698739 97190.993438 104542.975638 -15606.634385 -72882.339686 22958.616585 -11557.466239 -53972.891246
5 AC 2 0.520030 62033.710779 178562.160107 154408.524232 -36298.899787 12145.263912 178562.160107 142263.26032 190707.424019 154408.524232 -36298.899787 12145.263912 -0.0 -36298.899787 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 94438.680856 -70322.415853 -7487.394197 172248.490907 96913.151575 159748.173232 94438.680856 -75335.339331 -12500.317675 10025.846956 -66736.319204 -11073.490847
7 ABC 3 0.786106 41411.391808 166398.662065 111069.267226 -16600.466782 -44754.128566 6025.200509 166398.662065 146468.995191 117851.808625 104916.040441 171481.984153 152844.523064 125154.386063 111069.267226 -14085.118837 -41775.255838 6153.226784 -4552.342252 1149.350653 222.301089 -2441.556439 -13801.024355 -46992.70848 5464.337996
──────── Test point index: 1226 | True SalePrice: 165500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1226 1661 6 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1226 ===
f_AB(S)   = 176907.26506615613
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 170417.301602206
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 6833.931942690688
|psi_ABC(A)|   = 2518.6497773952337
|f_ABC(S)-f_A| = 4018.6395367192745
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 2518.6497773952055

P   = -0.2787736838217277
R   = -0.2787736838217207
P/R = 1.000000000000025
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 197178.594247 18570.873605 178607.720643 197178.594247 18570.873605 18570.873605
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 176907.265066 9211.923048 -2377.991105 170073.333123 179285.256172 167695.342018 176907.265066 9211.923048 -2377.991105 0.0 9211.923048 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 186276.151758 21425.674705 -13711.683054 178562.160107 199987.834812 164850.477053 186276.151758 21425.674705 -13711.683054 -0.0 21425.674705 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 183855.302240 -2505.718067 14112.529401 172248.490907 169742.77284 186361.020308 183855.30224 -2505.718067 14112.529401 0.0 -2505.718067 14112.529401
7 ABC 3 0.786106 41411.391808 166398.662065 170417.301602 12276.226058 -2518.649777 -5738.936744 166398.662065 178674.888123 163880.012288 176156.238346 160659.725322 172935.95138 158141.075544 170417.301602 12276.226058 -2518.649777 -5738.936744 0.0 0.0 0.0 -0.0 12276.226058 -2518.649777 -5738.936744
──────── Test point index: 2873 | True SalePrice: 145400 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2873 1285 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2873 ===
f_AB(S)   = 156307.31452577407
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 150862.65796932078
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433437
|f_AB(S)-f_A|  = 13766.018597691378
|psi_ABC(A)|   = 2318.278301191584
|f_ABC(S)-f_A| = 15536.00409616594
|ReSHAP_AB(A)|  = 2377.991105433437
|ReSHAP_ABC(A)| = 2671.8361416687703

P   = 0.02352383560333371
R   = 0.0007665140753071176
P/R = 30.689372003910645
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 155649.906291 -22957.814352 178607.720643 155649.906291 -22957.814352 -22957.814352
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 156307.314526 -11388.027492 -2377.991105 170073.333123 158685.305631 167695.342018 156307.314526 -11388.027492 -2377.991105 0.0 -11388.027492 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 155601.445300 -26486.996397 3526.28159 178562.160107 152075.16371 182088.441697 155601.4453 -26486.996397 3526.28159 0.0 -26486.996397 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 150862.657969 -14793.623751 -2318.278301 1575.897956 166398.662065 151337.697901 163880.012288 149153.737992 167874.564543 152947.538252 165355.914765 150862.657969 -14493.256796 -2084.880283 1708.919978 -433.769495 -233.017501 -99.079627 99.079627 -15040.719725 -2671.836142 2176.551771
──────── Test point index: 2554 | True SalePrice: 121600 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2554 1230 4 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2554 ===
f_AB(S)   = 103536.87936032933
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 112074.85725689273
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 62508.73765962061
|f_AB(S)-f_A|  = 66536.45376313612
|psi_ABC(A)|   = 44818.14973285249
|f_ABC(S)-f_A| = 54323.80480859398
|ReSHAP_AB(A)|  = 55558.30892077074
|ReSHAP_ABC(A)| = 47161.39986115428

P   = 0.11444741756150356
R   = -0.033147912629836185
P/R = -3.4526281892782085
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 149575.231191 -29032.489452 178607.720643 149575.231191 -29032.489452 -29032.489452
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 103536.879360 -4027.716104 -62508.73766 170073.333123 155672.014994 97190.993438 103536.87936 -14401.31813 -72882.339686 20747.204053 -10978.144842 -55558.308921
5 AC 2 0.520030 62033.710779 178562.160107 157211.925201 -33495.498818 12145.263912 178562.160107 145066.661289 190707.424019 157211.925201 -33495.498818 12145.263912 -0.0 -33495.498818 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 94438.680856 -70322.415853 -7487.394197 172248.490907 96913.151575 159748.173232 94438.680856 -75335.339331 -12500.317675 10025.846956 -66736.319204 -11073.490847
7 ABC 3 0.786106 41411.391808 166398.662065 112074.857257 -15466.493906 -44818.149733 5960.83883 166398.662065 147772.477659 117851.808625 106009.227085 171481.984153 153937.028683 125154.386063 112074.857257 -13079.528806 -41862.171426 6065.630172 -4465.426665 1236.947266 98.920851 -2318.176201 -12363.507601 -47161.399861 5201.102653
──────── Test point index: 1364 | True SalePrice: 127000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1364 1236 6 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1364 ===
f_AB(S)   = 153622.74650322428
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 148095.68256971097
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433466
|f_AB(S)-f_A|  = 16450.58662024117
|psi_ABC(A)|   = 2161.708967196207
|f_ABC(S)-f_A| = 18302.97949577574
|ReSHAP_AB(A)|  = 2377.9911054334457
|ReSHAP_ABC(A)| = 1322.6381159075768

P   = 0.026446631307094626
R   = 0.07229003355109877
P/R = 0.3658406284788984
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 150237.923020 -28369.797623 178607.720643 150237.92302 -28369.797623 -28369.797623
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 153622.746503 -14072.595515 -2377.991105 170073.333123 156000.737609 167695.342018 153622.746503 -14072.595515 -2377.991105 -0.0 -14072.595515 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 149357.506779 -32730.934918 3526.28159 178562.160107 145831.225189 182088.441697 149357.506779 -32730.934918 3526.28159 0.0 -32730.934918 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 166113.404190 -2505.718067 -3629.36865 172248.490907 169742.77284 168619.122257 166113.40419 -2505.718067 -3629.36865 0.0 -2505.718067 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 148095.682570 -17775.692545 -2161.708967 1634.422016 166398.662065 148143.821752 163880.012288 146266.428433 167874.564543 149864.138146 165355.914765 148095.68257 -17260.232196 -1768.455576 1829.254136 -750.194201 -353.351659 -108.937742 108.937742 -18227.457369 1322.638116 -1398.160243
──────── Test point index: 291 | True SalePrice: 100000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
291 1666 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 291 ===
f_AB(S)   = 140267.21944531
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 141191.67943668008
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784634
|f_AB(S)-f_A|  = 29806.113678155438
|psi_ABC(A)|   = 32650.7449845076
|f_ABC(S)-f_A| = 25206.98262880664
|ReSHAP_AB(A)|  = 39291.97223878462
|ReSHAP_ABC(A)| = 28260.820873054712

P   = 0.022946540413483874
R   = 0.19710162180890878
P/R = 0.11641984577747759
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 197730.837438 19123.116795 178607.720643 197730.837438 19123.116795 19123.116795
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 140267.219445 9485.858561 -39291.972239 170073.333123 179559.191684 130781.360885 140267.219445 9485.858561 -39291.972239 0.0 9485.858561 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 195532.270664 22062.811289 -5092.700732 178562.160107 200624.971396 173469.459375 195532.270664 22062.811289 -5092.700732 0.0 22062.811289 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 141191.679437 10037.746542 -32650.744985 -2593.984187 166398.662065 179039.947 136833.552157 144228.501046 164267.144932 176908.429866 133737.841655 141191.679437 7453.837781 -35716.75043 -3036.82161 5187.447153 -58.888892 905.304476 58.888892 5972.411889 -28260.820873 -2918.573645
──────── Test point index: 2662 | True SalePrice: 144000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2662 1690 7 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2662 ===
f_AB(S)   = 215410.07217203674
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 215740.91946867868
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 45336.7390485713
|psi_ABC(A)|   = 36935.44727241989
|f_ABC(S)-f_A| = 49342.257403191965
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36766.462937135315

P   = 0.013209990968385266
R   = 0.016634729603745013
P/R = 0.7941211719733197
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 200381.604755 21773.884112 178607.720643 200381.604755 21773.884112 21773.884112
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 215410.072172 10800.749021 34535.990028 170073.333123 180874.082144 204609.323151 215410.072172 10800.749021 34535.990028 0.0 10800.749021 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 198590.526266 25121.066891 -5092.700732 178562.160107 203683.226998 173469.459375 198590.526266 25121.066891 -5092.700732 0.0 25121.066891 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 213881.062815 36390.991532 5241.580375 172248.490907 208639.482439 177490.071282 213881.062815 36390.991532 5241.580375 0.0 36390.991532 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 215740.919469 14379.452769 36935.447272 -1972.642638 166398.662065 180792.229606 203189.349614 217554.687612 164267.144932 178660.712473 201375.581472 215740.919469 14365.337997 37080.206996 -1813.768143 28.229543 0.0 -317.74899 0.0 14373.220641 36766.462937 -1797.426175
──────── Test point index: 2834 | True SalePrice: 92000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2834 1338 6 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2834 ===
f_AB(S)   = 159211.0309583279
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 150441.91780173985
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433466
|f_AB(S)-f_A|  = 10862.302165137546
|psi_ABC(A)|   = 2506.8268251367604
|f_ABC(S)-f_A| = 15956.744263746863
|ReSHAP_AB(A)|  = 2377.99110543345
|ReSHAP_ABC(A)| = 2491.8833828280403

P   = 0.061820067589872574
R   = 0.06275656453822856
P/R = 0.9850773069678549
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 161503.684114 -17104.036528 178607.720643 161503.684114 -17104.036528 -17104.036528
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 159211.030958 -8484.31106 -2377.991105 170073.333123 161589.022064 167695.342018 159211.030958 -8484.31106 -2377.991105 -0.0 -8484.31106 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 153736.110767 -19733.348608 -5092.700732 178562.160107 158828.811499 173469.459375 153736.110767 -19733.348608 -5092.700732 0.0 -19733.348608 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 174984.353215 -2505.718067 5241.580375 172248.490907 169742.77284 177490.071282 174984.353215 -2505.718067 5241.580375 0.0 -2505.718067 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 150441.917802 -11294.754401 -2506.826825 -2155.163038 166398.662065 155092.084712 163880.012288 152644.372649 164267.144932 152960.567579 161748.495155 150441.917802 -11306.577353 -2518.649777 -2202.454847 0.0 70.937714 70.937714 -70.937714 -11297.429163 -2491.883383 -2167.431718
──────── Test point index: 1264 | True SalePrice: 148000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1264 1081 6 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1264 ===
f_AB(S)   = 145130.74561556676
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 145712.97733736245
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 2377.991105433466
|f_AB(S)-f_A|  = 24942.587507898686
|psi_ABC(A)|   = 1896.6802152623907
|f_ABC(S)-f_A| = 20685.684728124266
|ReSHAP_AB(A)|  = 2377.9911054334425
|ReSHAP_ABC(A)| = 1247.0125154495254

P   = 0.0036481163438926295
R   = 0.035054748918619885
P/R = 0.10406910494101057
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 133118.384102 -45489.336541 178607.720643 133118.384102 -45489.336541 -45489.336541
2 B 1 0.653707 52691.751258 167830.652805 164717.893260 -3112.759544 167830.652805 164717.89326 -3112.759544 -3112.759544
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 145130.745616 -22564.596402 -2377.991105 170073.333123 147508.736721 167695.342018 145130.745616 -22564.596402 -2377.991105 -0.0 -22564.596402 -2377.991105
5 AC 2 0.520030 62033.710779 178562.160107 138225.255003 -52482.169016 12145.263912 178562.160107 126079.991091 190707.424019 138225.255003 -52482.169016 12145.263912 -0.0 -52482.169016 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 157242.455164 -2505.718067 -12500.317675 172248.490907 169742.77284 159748.173232 157242.455164 -2505.718067 -12500.317675 0.0 -2505.718067 -12500.317675
7 ABC 3 0.786106 41411.391808 166398.662065 145712.977337 -24973.10116 -1896.680215 6184.096647 166398.662065 139754.727406 163880.012288 138376.19551 171481.984153 146935.777369 168963.334376 145712.977337 -23250.357038 -1222.800031 7336.781828 -1295.849746 -2253.45974 -155.731866 155.731866 -27566.320088 -1247.012515 8127.647875
──────── Test point index: 218 | True SalePrice: 131500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
218 1039 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 218 ===
f_AB(S)   = 107176.57745843756
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 119708.9276712361
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38661.53659815233
|f_AB(S)-f_A|  = 62896.755665027886
|psi_ABC(A)|   = 24779.540650246054
|f_ABC(S)-f_A| = 46689.73439425061
|ReSHAP_AB(A)|  = 38519.77847573285
|ReSHAP_ABC(A)| = 25032.992693790053

P   = 0.08395471163642132
R   = 0.07627245724810727
P/R = 1.1007212126826382
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 128479.541298 -50128.179345 178607.720643 128479.541298 -50128.179345 -50128.179345
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 107176.577458 -24235.219067 -38661.536598 170073.333123 145207.678416 130781.360885 107176.577458 -24865.654708 -39291.972239 1260.871281 -24376.977189 -38519.778476
5 AC 2 0.520030 62033.710779 178562.160107 124254.325377 -57834.11632 3526.28159 178562.160107 120728.043787 182088.441697 124254.325377 -57834.11632 3526.28159 -0.0 -57834.11632 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 119708.927671 -23825.195101 -24779.54065 1915.001357 166398.662065 137805.852244 136833.552157 117565.399606 167874.564543 139913.471234 138977.080222 119708.927671 -19268.152551 -20204.543563 2143.528066 -8692.940757 0.0 -35.909075 -631.716513 -23833.989922 -25032.992694 2177.248222
──────── Test point index: 2028 | True SalePrice: 113000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2028 1173 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2028 ===
f_AB(S)   = 113257.17791230907
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 120158.52372389584
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878466
|f_AB(S)-f_A|  = 56816.15521115638
|psi_ABC(A)|   = 25743.745016412668
|f_ABC(S)-f_A| = 46240.13834159088
|ReSHAP_AB(A)|  = 39291.97223878467
|ReSHAP_ABC(A)| = 26461.136035322765

P   = 0.13482315345253515
R   = 0.11930868821433682
P/R = 1.1300363407761784
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 143279.658814 -35328.061828 178607.720643 143279.658814 -35328.061828 -35328.061828
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 113257.177912 -17524.182972 -39291.972239 170073.333123 152549.150151 130781.360885 113257.177912 -17524.182972 -39291.972239 -0.0 -17524.182972 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 132710.603501 -40758.855874 -5092.700732 178562.160107 137803.304233 173469.459375 132710.603501 -40758.855874 -5092.700732 0.0 -40758.855874 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 120158.523724 -17804.029613 -25743.745016 -2692.363712 166398.662065 144527.420559 136833.552157 123254.234225 164267.144932 141923.404059 133737.841655 120158.523724 -13579.317931 -21764.880335 -3095.710501 -8764.422942 0.0 491.694002 472.499366 -16775.212788 -26461.136035 -3003.789518
──────── Test point index: 2726 | True SalePrice: 164000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2726 1000 5 4
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2726 ===
f_AB(S)   = 106239.2144026331
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 127054.3198117648
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 38061.869627284745
|f_AB(S)-f_A|  = 63834.11872083235
|psi_ABC(A)|   = 24722.075950636547
|f_ABC(S)-f_A| = 39344.342253721916
|ReSHAP_AB(A)|  = 37833.833599257654
|ReSHAP_ABC(A)| = 23866.669978646314

P   = -0.032089324923349816
R   = -0.013920122674971225
P/R = 2.3052472792533165
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 124172.044409 -54435.676234 178607.720643 124172.044409 -54435.676234 -54435.676234
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 121774.215974 -56061.742701 177835.958675 121774.215974 -56061.742701 -56061.742701
4 AB 2 0.749226 44839.624931 170073.333123 106239.214403 -25772.249094 -38061.869627 170073.333123 143070.981418 130781.360885 106239.214403 -27002.351705 -39291.972239 2460.205223 -26000.285122 -37833.833599
5 AC 2 0.520030 62033.710779 178562.160107 136522.624668 -62803.781673 20764.246234 178562.160107 115758.378434 199326.406341 136522.624668 -62803.781673 20764.246234 0.0 -62803.781673 20764.246234
6 BC 2 0.753073 44494.356934 172248.490907 113421.452957 -39429.099458 -19397.938492 172248.490907 130846.06324 150877.224206 113421.452957 -41402.427667 -21371.266701 3946.656417 -38799.408066 -20027.629884
7 ABC 3 0.786106 41411.391808 166398.662065 127054.319812 -25441.390537 -24722.075951 10819.124234 166398.662065 136114.23757 136833.552157 115909.694007 175089.403763 148736.24277 149455.557357 127054.319812 -22401.237545 -21681.922958 11144.625804 -3951.923449 1477.379395 1477.379395 -5408.642897 -24891.151901 -23866.669979 9413.479626
──────── Test point index: 1756 | True SalePrice: 170000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1756 1430 7 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1756 ===
f_AB(S)   = 201165.42552177262
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 193725.43338474646
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791772
|f_AB(S)-f_A|  = 31092.09239830717
|psi_ABC(A)|   = 37222.94482024236
|f_ABC(S)-f_A| = 27326.771319259744
|ReSHAP_AB(A)|  = 34535.99002791772
|ReSHAP_ABC(A)| = 36661.958308339345

P   = -0.25137764730521317
R   = -0.23084882209754443
P/R = 1.0889275718244487
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 171664.958827 -6942.761816 178607.720643 171664.958827 -6942.761816 -6942.761816
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 201165.425522 -3443.89763 34535.990028 170073.333123 166629.435494 204609.323151 201165.425522 -3443.89763 34535.990028 0.0 -3443.89763 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 156840.441586 -8010.035466 -13711.683054 178562.160107 170552.124641 164850.477053 156840.441586 -8010.035466 -13711.683054 -0.0 -8010.035466 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 224013.701667 37021.836446 14743.374314 172248.490907 208639.482439 186361.020308 224013.701667 36390.991532 14112.529401 1261.689827 37300.119113 14465.091647
7 ABC 3 0.786106 41411.391808 166398.662065 193725.433385 -4584.993431 37222.94482 -5311.18007 166398.662065 161809.168037 203189.349614 198608.856782 160659.725322 156070.231293 198305.926217 193725.433385 -4580.492833 37655.202092 -4883.423397 -9.001196 -0.0 -855.513347 -0.0 -4579.246264 36661.958308 -4755.940726
──────── Test point index: 856 | True SalePrice: 136500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
856 864 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 856 ===
f_AB(S)   = 107787.13159503626
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 111693.78305927281
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 33562.38806101406
|f_AB(S)-f_A|  = 62286.201528429185
|psi_ABC(A)|   = 25002.446446461316
|f_ABC(S)-f_A| = 54704.8790062139
|ReSHAP_AB(A)|  = 33186.459097906736
|ReSHAP_ABC(A)| = 24345.013684069007

P   = 0.08179911488246894
R   = 0.08778141389467187
P/R = 0.931850049494748
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 109151.029616 -69456.691027 178607.720643 109151.029616 -69456.691027 -69456.691027
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 107787.131595 -28723.813467 -33562.388061 170073.333123 135619.935478 130781.360885 107787.131595 -34453.397645 -39291.972239 11459.168356 -29099.742431 -33186.459098
5 AC 2 0.520030 62033.710779 178562.160107 101954.544944 -80133.896753 3526.28159 178562.160107 98428.263354 182088.441697 101954.544944 -80133.896753 3526.28159 -0.0 -80133.896753 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 111693.783059 -31495.512764 -25002.446446 1793.080204 166398.662065 130340.476707 136833.552157 109933.982121 167874.564543 132484.023037 138977.080222 111693.783059 -27283.297163 -20790.239977 1759.800938 -8107.244343 383.727127 383.745391 -1051.37098 -32090.993352 -24345.013684 1731.12803
──────── Test point index: 2718 | True SalePrice: 157000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2718 1949 5 10
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2718 ===
f_AB(S)   = 155771.96945309758
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 145186.30058425185
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.97223878465
|f_AB(S)-f_A|  = 14301.363670367864
|psi_ABC(A)|   = 36344.98386964207
|f_ABC(S)-f_A| = 21212.361481234868
|ReSHAP_AB(A)|  = 39291.97223878468
|ReSHAP_ABC(A)| = 18568.224961291697

P   = 1.0340413923431484
R   = 1.8720791055923924
P/R = 0.5523491978806851
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 228987.802044 50380.081401 178607.720643 228987.802044 50380.081401 50380.081401
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 267739.007895 89903.049221 177835.958675 267739.007895 89903.049221 89903.049221
4 AB 2 0.749226 44839.624931 170073.333123 155771.969453 24990.608568 -39291.972239 170073.333123 195063.941692 130781.360885 155771.969453 24990.608568 -39291.972239 -0.0 24990.608568 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 205737.254341 58124.741932 -30949.647698 178562.160107 236686.902039 147612.512409 205737.254341 58124.741932 -30949.647698 -0.0 58124.741932 -30949.647698
6 BC 2 0.753073 44494.356934 172248.490907 162700.891581 -41500.204664 31952.605337 172248.490907 130846.06324 204298.873241 162700.891581 -41402.427667 32050.382334 -195.553994 -42268.164497 32720.565171
7 ABC 3 0.786106 41411.391808 166398.662065 145186.300584 29285.643537 -36344.98387 -14153.021148 166398.662065 199702.2794 136833.552157 158975.894873 153444.869037 186748.503435 118355.975866 145186.300584 26830.324718 -41562.202851 -13789.594288 6473.30968 -4687.982002 835.818324 4687.964938 4419.065389 -18568.224961 -7063.201908
──────── Test point index: 283 | True SalePrice: 80000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
283 924 4 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 283 ===
f_AB(S)   = 117530.76394515927
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 98441.79387364765
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 47129.36868455017
|f_AB(S)-f_A|  = 52542.56917830618
|psi_ABC(A)|   = 44722.58369672697
|f_ABC(S)-f_A| = 67956.86819183906
|ReSHAP_AB(A)|  = 36804.230366747215
|ReSHAP_ABC(A)| = 45771.02939251234

P   = 0.23887247310595738
R   = 0.026934405277491802
P/R = 8.868674494386378
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 115777.947907 -62829.772736 178607.720643 115777.947907 -62829.772736 -62829.772736
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 117530.763945 -5413.200494 -47129.368685 170073.333123 138907.161628 97190.993438 117530.763945 -31166.171495 -72882.339686 51505.942003 -15738.338812 -36804.230367
5 AC 2 0.520030 62033.710779 178562.160107 118219.166272 -72488.257747 12145.263912 178562.160107 106073.90236 190707.424019 118219.166272 -72488.257747 12145.263912 -0.0 -72488.257747 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 94438.680856 -70322.415853 -7487.394197 172248.490907 96913.151575 159748.173232 94438.680856 -75335.339331 -12500.317675 10025.846956 -66736.319204 -11073.490847
7 ABC 3 0.786106 41411.391808 166398.662065 98441.793874 -29646.56364 -44722.583697 6412.279145 166398.662065 132887.733815 117851.808625 91630.95083 171481.984153 140270.500986 125154.386063 98441.793874 -26712.592189 -41828.707112 6810.843044 -4498.890978 491.734394 571.924127 -2791.179477 -28996.911658 -45771.029393 6811.072858
──────── Test point index: 2525 | True SalePrice: 262500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2525 2156 7 9
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2525 ===
f_AB(S)   = 240940.86193751023
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 243558.36729405666
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 34535.99002791771
|f_AB(S)-f_A|  = 70867.52881404478
|psi_ABC(A)|   = 37439.84715857739
|f_ABC(S)-f_A| = 77159.70522856995
|ReSHAP_AB(A)|  = 34535.99002791771
|ReSHAP_ABC(A)| = 36369.8900913013

P   = 0.002106275650754219
R   = 0.015973060964494856
P/R = 0.13186424664853394
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 251850.670148 73242.949505 178607.720643 251850.670148 73242.949505 73242.949505
2 B 1 0.653707 52691.751258 167830.652805 213037.816252 45207.163448 167830.652805 213037.816252 45207.163448 45207.163448
3 C 1 0.292963 75290.806037 177835.958675 242702.445299 64866.486624 177835.958675 242702.445299 64866.486624 64866.486624
4 AB 2 0.749226 44839.624931 170073.333123 240940.861938 36331.538786 34535.990028 170073.333123 206404.87191 204609.323151 240940.861938 36331.538786 34535.990028 -0.0 36331.538786 34535.990028
5 AC 2 0.520030 62033.710779 178562.160107 240733.691232 84502.196501 -22330.665376 178562.160107 263064.356608 156231.494731 240733.691232 84502.196501 -22330.665376 0.0 84502.196501 -22330.665376
6 BC 2 0.753073 44494.356934 172248.490907 239102.818452 40130.920326 26723.40722 172248.490907 208639.482439 195231.969333 239102.818452 36390.991532 22983.478426 7479.857587 40975.443979 25878.883566
7 ABC 3 0.786106 41411.391808 166398.662065 243558.367294 48369.575573 37439.847159 -8649.717503 166398.662065 214815.71688 203189.349614 251511.445945 157052.305711 205469.360526 195236.270963 243558.367294 48322.096331 38089.006768 -7953.078651 94.958484 -0.0 -1393.277703 0.0 48375.198222 36369.890091 -7585.383085
──────── Test point index: 527 | True SalePrice: 264500 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
527 1836 8 8
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 527 ===
f_AB(S)   = 260322.97027053632
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 266847.9688075247
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 90249.63714707087
|psi_ABC(A)|   = 80731.76810072012
|f_ABC(S)-f_A| = 100449.306742038
|ReSHAP_AB(A)|  = 71449.97116126891
|ReSHAP_ABC(A)| = 80262.21386185226

P   = -0.012013959013118525
R   = -0.007339419644980305
P/R = 1.6369085832740613
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 216507.105929 37899.385286 178607.720643 216507.105929 37899.385286 37899.385286
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 217665.882703 39829.924028 177835.958675 217665.882703 39829.924028 39829.924028
4 AB 2 0.749226 44839.624931 170073.333123 260322.970271 18799.665986 71449.971161 170073.333123 188872.999109 241523.304285 260322.970271 18799.665986 71449.971161 -0.0 18799.665986 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 208575.932191 43725.455138 -13711.683054 178562.160107 222287.615245 164850.477053 208575.932191 43725.455138 -13711.683054 -0.0 43725.455138 -13711.683054
6 BC 2 0.753073 44494.356934 172248.490907 279912.601119 87183.460064 20480.650149 172248.490907 253063.830223 186361.020308 279912.601119 80815.339316 14112.529401 12736.241496 91658.137032 16005.973181
7 ABC 3 0.786106 41411.391808 166398.662065 266847.968808 25028.718712 80731.768101 -5311.18007 166398.662065 191451.948795 246727.241511 271731.392204 160659.725322 185713.012051 241843.818114 266847.968808 25004.150694 81134.956756 -4883.423397 49.136035 0.0 -855.513347 -0.0 25015.726115 80262.213862 -4828.633235
──────── Test point index: 2421 | True SalePrice: 215000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
2421 1456 8 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 2421 ===
f_AB(S)   = 239503.87132015018
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 242227.56361795284
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 71449.97116126893
|f_AB(S)-f_A|  = 69430.53819668473
|psi_ABC(A)|   = 80490.09300124651
|f_ABC(S)-f_A| = 75828.90155246612
|ReSHAP_AB(A)|  = 71449.97116126891
|ReSHAP_ABC(A)| = 80321.0873809726

P   = -0.032384194134503774
R   = -0.030155418340396034
P/R = 1.0739096294055415
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 174536.623420 -4071.097223 178607.720643 174536.62342 -4071.097223 -4071.097223
2 B 1 0.653707 52691.751258 167830.652805 261357.739244 93527.08644 167830.652805 261357.739244 93527.08644 93527.08644
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 239503.871320 -2019.432965 71449.971161 170073.333123 168053.900159 241523.304285 239503.87132 -2019.432965 71449.971161 -0.0 -2019.432965 71449.971161
5 AC 2 0.520030 62033.710779 178562.160107 168772.534144 -4696.925231 -5092.700732 178562.160107 173865.234876 173469.459375 168772.534144 -4696.925231 -5092.700732 0.0 -4696.925231 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 262265.809925 82795.538979 7221.780039 172248.490907 253063.830223 177490.071282 262265.809925 80815.339316 5241.580375 3960.399327 84534.517466 5482.801552
7 ABC 3 0.786106 41411.391808 166398.662065 242227.563618 -2688.548811 80490.093001 -1972.642638 166398.662065 163707.474194 246727.241511 244041.331761 164267.144932 161575.957061 244913.473368 242227.563618 -2685.90975 80651.606557 -1813.768143 -5.278122 -0.0 -317.74899 0.0 -2685.727919 80321.087381 -1806.45791
──────── Test point index: 1509 | True SalePrice: 135000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1509 1470 4 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1509 ===
f_AB(S)   = 96953.97560639406
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 113785.61513301407
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 72374.64183671019
|f_AB(S)-f_A|  = 73119.35751707139
|psi_ABC(A)|   = 48679.680110798254
|f_ABC(S)-f_A| = 52613.046932472644
|ReSHAP_AB(A)|  = 71884.09782231449
|ReSHAP_ABC(A)| = 48590.738630354404

P   = 0.06457536138276887
R   = 0.05955703286402403
P/R = 1.0842608887216107
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 176082.904354 -2524.816288 178607.720643 176082.904354 -2524.816288 -2524.816288
2 B 1 0.653707 52691.751258 167830.652805 111235.094150 -56595.558654 167830.652805 111235.09415 -56595.558654 -56595.558654
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 96953.975606 -744.71568 -72374.641837 170073.333123 168820.919594 97190.993438 96953.975606 -1252.41353 -72882.339686 1015.395698 -1235.259695 -71884.097822
5 AC 2 0.520030 62033.710779 178562.160107 170556.516579 -2912.942796 -5092.700732 178562.160107 175649.217311 173469.459375 170556.516579 -2912.942796 -5092.700732 0.0 -2912.942796 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 100697.287222 -76064.061696 4512.858011 172248.490907 96913.151575 177490.071282 100697.287222 -75335.339331 5241.580375 -1457.444729 -76901.771134 5350.567449
7 ABC 3 0.786106 41411.391808 166398.662065 113785.615133 -1319.753004 -48679.680111 -2613.613817 166398.662065 164729.639048 117851.808625 116881.325634 164267.144932 162598.121914 114756.098124 113785.615133 -970.482991 -48812.506781 -3095.710501 -698.540027 -0.0 964.193368 0.0 -984.100518 -48590.73863 -3038.207784
──────── Test point index: 1662 | True SalePrice: 139000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1662 1109 5 5
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1662 ===
f_AB(S)   = 109992.72067384052
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 127919.94551729194
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39171.013577986414
|f_AB(S)-f_A|  = 60080.612449624925
|psi_ABC(A)|   = 24583.400009280875
|f_ABC(S)-f_A| = 38478.716548194774
|ReSHAP_AB(A)|  = 39134.395812194125
|ReSHAP_ABC(A)| = 25975.294604852686

P   = 0.0130912165063638
R   = -0.023691364215125188
P/R = -0.5525733506729015
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 136210.945971 -42396.774672 178607.720643 136210.945971 -42396.774672 -42396.774672
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 142556.194914 -35279.763761 177835.958675 142556.194914 -35279.763761 -35279.763761
4 AB 2 0.749226 44839.624931 170073.333123 109992.720674 -20909.598872 -39171.013578 170073.333123 149042.775591 130781.360885 109992.720674 -21030.557532 -39291.972239 241.917322 -20946.216637 -39134.395812
5 AC 2 0.520030 62033.710779 178562.160107 141793.219873 -48914.204146 12145.263912 178562.160107 129647.955961 190707.424019 141793.219873 -48914.204146 12145.263912 -0.0 -48914.204146 12145.263912
6 BC 2 0.753073 44494.356934 172248.490907 120357.952063 -40396.324417 -11494.214426 172248.490907 130846.06324 159748.173232 120357.952063 -41402.427667 -12500.317675 2012.206498 -39856.861973 -12033.676871
7 ABC 3 0.786106 41411.391808 166398.662065 127919.945517 -20460.061729 -24583.400009 6564.74519 166398.662065 141111.510222 136833.552157 120537.178885 171481.984153 148185.037286 144216.318789 127919.945517 -16296.373272 -20265.091769 7382.766633 -7000.573595 0.0 -309.239568 -1990.204977 -20475.758535 -25975.294605 7972.336592
──────── Test point index: 594 | True SalePrice: 143000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
594 1568 5 7
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 594 ===
f_AB(S)   = 134898.08340021042
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 136927.84965523236
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784634
|f_AB(S)-f_A|  = 35175.24972325502
|psi_ABC(A)|   = 31195.268071890794
|f_ABC(S)-f_A| = 29470.81241025435
|ReSHAP_AB(A)|  = 39291.97223878463
|ReSHAP_ABC(A)| = 29401.04614456375

P   = 0.05852061662125463
R   = 0.11940193547837386
P/R = 0.4901144724898862
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 186906.870896 8299.150254 178607.720643 186906.870896 8299.150254 8299.150254
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 192629.320106 14793.361432 177835.958675 192629.320106 14793.361432 14793.361432
4 AB 2 0.749226 44839.624931 170073.333123 134898.083400 4116.722516 -39291.972239 170073.333123 174190.055639 130781.360885 134898.0834 4116.722516 -39291.972239 0.0 4116.722516 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 183044.393621 9574.934246 -5092.700732 178562.160107 188137.094353 173469.459375 183044.393621 9574.934246 -5092.700732 -0.0 9574.934246 -5092.700732
6 BC 2 0.753073 44494.356934 172248.490907 136087.643616 -41402.427667 5241.580375 172248.490907 130846.06324 177490.071282 136087.643616 -41402.427667 5241.580375 0.0 -41402.427667 5241.580375
7 ABC 3 0.786106 41411.391808 166398.662065 136927.849655 4338.069479 -31195.268072 -2613.613817 166398.662065 171884.793024 136833.552157 140023.560157 164267.144932 169753.27589 133737.841655 136927.849655 3190.008 -32825.426235 -3095.710501 2296.122958 0.0 964.193368 0.0 2942.849328 -29401.046145 -3012.615594
──────── Test point index: 1362 | True SalePrice: 120000 ────────
Gr Liv Area Overall Qual TotRms AbvGrd
1362 1308 5 6
=== Subset predictions + SHAP + μ + ReSHAP — MLP | Test idx 1362 ===
f_AB(S)   = 120653.43674994627
f_AB(A)   = 170073.33312346545
f_ABC(S)  = 131129.0509004248
f_ABC(A)  = 166398.66206548671
|psi_AB(A)|    = 39291.972238784634
|f_AB(S)-f_A|  = 49419.89637351918
|psi_ABC(A)|   = 26453.143050704122
|f_ABC(S)-f_A| = 35269.61116506191
|ReSHAP_AB(A)|  = 39291.97223878463
|ReSHAP_ABC(A)| = 28215.330790399315

P   = 0.04503733211890504
R   = -0.004926012570327654
P/R = -9.142756230504174
Feature subset n_features R2 (subset test) RMSE (subset test) Baseline f(∅) Prediction f(S) φ A φ B φ C f(∅) f(A) f(B) f(AB) f(C) f(AC) f(BC) f(ABC) μ(A) μ(B) μ(C) μ(AB) μ(AC) μ(BC) μ(ABC) ReSHAP A ReSHAP B ReSHAP C
0 nothing 0 -0.015283 90222.372400 178582.207765 178582.207765 178582.207765
1 A 1 0.526650 61604.437299 178607.720643 158190.224969 -20417.495674 178607.720643 158190.224969 -20417.495674 -20417.495674
2 B 1 0.653707 52691.751258 167830.652805 116397.970268 -51432.682536 167830.652805 116397.970268 -51432.682536 -51432.682536
3 C 1 0.292963 75290.806037 177835.958675 167592.757510 -10243.201165 177835.958675 167592.75751 -10243.201165 -10243.201165
4 AB 2 0.749226 44839.624931 170073.333123 120653.436750 -10127.924135 -39291.972239 170073.333123 159945.408989 130781.360885 120653.43675 -10127.924135 -39291.972239 0.0 -10127.924135 -39291.972239
5 AC 2 0.520030 62033.710779 178562.160107 158532.273586 -23556.168111 3526.28159 178562.160107 155005.991996 182088.441697 158532.273586 -23556.168111 3526.28159 -0.0 -23556.168111 3526.28159
6 BC 2 0.753073 44494.356934 172248.490907 127216.694590 -41402.427667 -3629.36865 172248.490907 130846.06324 168619.122257 127216.69459 -41402.427667 -3629.36865 0.0 -41402.427667 -3629.36865
7 ABC 3 0.786106 41411.391808 166398.662065 131129.050900 -10641.902178 -26453.143051 1825.434063 166398.662065 152931.449448 136833.552157 128985.522835 167874.564543 154501.664677 138977.080222 131129.0509 -7848.029322 -23372.613777 2143.528066 -5524.870543 0.0 -573.312836 -94.312752 -9261.599058 -28215.33079 2207.318683
================ Overall Summary (averaged over test points) ================
mean(|P/R|) n_points
Model
MLP 2.528795 100