# Multi-Level Strategic Classification: Incentivizing Improvement through Promotion and Relegation Dynamics

This repository provides the numerical implementation and experimental framework for the paper "Multi-Level Strategic Classification: Incentivizing Improvement through Promotion and Relegation Dynamics." 

The project explores how a multi-level classifier system can be designed to incentivize honest skill improvement over strategic gaming (cheating). By modeling the inter-temporal incentives driven by an agent’s farsightedness, skill retention, and the "leg-up" effect of higher organizational levels, we demonstrate that a principal can effectively drive a population toward high qualification standards.

## Repo structure:
multi_stage_strategic_classification/
├── data/
│   └── fico/
├── experiments/
│   ├── experiment_1.py
│   ├── experiment_2.py
│   ├── experiment_3.py
│   ├── experiment_4.py
│   ├── experiment_5.py
│   └── experiments_6_7.ipynb
├── functions/
│   ├── value_iter.py
│   ├── greedy_search.py
│   └── get_agent_step.py
├── plotting/
│   ├── experiment_1_plot.py
│   ├── experiment_2_plot.py
│   ├── experiment_3_plot.py
│   ├── experiment_5_plot.py
├── plots/
├── requirements.txt
└── README.md

## Functions:
value_iter.py: Solves for the fixed point $W(l, x)$ using Value Iteration. The solve_fixed_point function utilizes a vectorized Bellman Operator (T_operator_optimized) to iterate until convergence. It accounts for costs of improvement, gaming, and level-based rewards.

greedy_search.py: Constructs the sequence of thresholds iteratively until a target attribute is reached or the system hits a feasibility limit. The find_optimal_thresholds_greedy function uses a binary search pattern to find the maximum feasible next threshold. It utilizes check_single_step to verify that the incentive gap remains bridgeable according to the value function.

get_agent_step.py: Provides a vectorized determination of agent actions for an entire batch. get_agent_step calculates the optimal target attribute, the next level, and the resulting effort variables. It processes agents group by group based on their current level to optimize branch selection (Demote, Stay, or Promote) against improvement costs.

## Experiments:
experiment_1.py: Performs a sensitivity analysis on the core model parameters to observe how they influence the formation of the optimal threshold sequence.

experiment_2.py: Tests how the interplay between farsightedness (beta) and retention (gamma) determines the limits of incentivizability.

experiment_3.py: Finds maximum incentivizable attribute as we sweep gaming cost. We sweep the Gaming Cost (c_minus) to empirically demonstrate the bound where c^- is too low for any improvement incentivization. 

experiment_4.py: Generates a table of example optimal setups using full CMA-ES optimization and agent simulation.

experiment_5.py: Solves the Principal's utility maximization problem to determine the optimal organizational structure (number of levels L, thresholds mu, and reward rate r) across various environmental settings.

experiments_6_7.ipynb: Example scenarios that tracks the specific trajectory of an agent starting from zero qualification.

## Data:
The data folder contains .pkl files of the generated data from each experiment. This data will be plotted by the scripts in the plotting folder. 

## Plotting:
experiment_1_plot.py: Plots how the internal structure of the threshold sequence (mu_2, mu_3, mu_4, mu_5) evolves as we vary environmental parameters

experiment_2_plot.py: Generates 2D heatmaps of the feasibility (Max Levels) and system reach (Max Attribute).

experiment_3_plot.py: Plots the max incentivizable attribute as we sweep gaming cost.

experiment_5_plot.py: Creates time-series trajectory plots for population attributes, levels, and the improvement fraction, using FICO data.


## Plots:
Contains the plots for each experiment, generated by the scripts in the plotting folder.
