{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# The Value of ABBR as a Metric \n",
    "\n",
    "This notebook explains why the Average Black-Box Ranking (ABBR) is a suitable metric for evaluating how well a rule or rule list explains the confident predictions of a black-box model. In particular, we show that ABBR is a better metric than _consistency_, which is a more traditional metric for evaluating rules against 0/1 predictions, ignoring the underlying confidence of the predictions. \n",
    "\n",
    "To show the value of ABBR, we show that consistency is not a very robust metric when the confidence threshold is high. Here, we define robustness as the abilty to generalize to unseen data, i.e. from training set to testing set. We set up our experiment as follows: \n",
    "\n",
    "1. Given a dataseet, we divide it into a 70% training set and a 30% testing set. \n",
    "2. We train a black-box model (e.g., a random forest) on the training set. \n",
    "3. We generate predicted probabilities on both the training set and testing set. We normalize the probabilities to be quantiles (i.e., rankings), so the data point with the highest probability is assigned a value of 1, and the data point with the lowest probability is assigned a prediction of 0. \n",
    "4. We generate random rules of up to 3 conditions on the training set with support at least 10\\% of the dataset. For each rule, we calculate the ABBR of the rule on the training set, as well as the consistency of the rule on the training set, with a cutoff of say $0.9$. \n",
    "5. Out of these rules, select the top 10 rules based on ABBR. Select the top 10 rules based on consistency with the 0.9 threshold.\n",
    "6. Evaluate the consistency of both sets of top 10 rules on the testing set for consistency with the 0.9 threshold. \n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [],
   "source": [
    "from datasets import Recidivism\n",
    "\n",
    "recidivism = Recidivism()\n",
    "\n",
    "X_train = recidivism.get_X_train()\n",
    "X_test = recidivism.get_X_test()\n",
    "y_train_quantile = recidivism.get_y_train_quantile()\n",
    "y_test_quantile = recidivism.get_y_test_quantile()\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10.12"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
