# Q-learning-experiments
Q-learning experiments

Glossary:

  files "main_": used to produce figures in the paper.
  files "aux_" : auxiliary files called by main files.
  files "fun_" : minor functions performing specific routines, mostly unrelated to Q-learning (editing plots etc). For these files, we don't provide a description here.

Main files:

The files that are used to run the simulations and produce the figures in the paper start with the indication "main_". The next indicator ""game" or "linenet" indicates whether the file refers to the 2 player zero sum game or in the line network zero-sum game. These files are:

  2 Player zero-sum games

  1. main_game_qlearning: runs the q-learning dynamics in one out of several predefined games ('aux_define_game') for arbitrary exploration policies drawn from CLR-1 and ETE            schemes ('aux_exploration_rates'), with three different implementations ('aux_discrete_batch', 'aux_discrete_step', 'aux_continuous_ode'). In the experiments, we use              aux_discrete_step.
  2. main_game_surfaces: is called by main_game_qlearning to produce the plots in Figures 1 (main part) and 6 (supplementary).
  3. main_plots_trajectories_2s: runs the Q-learning dynamics in 2x2 games (hardcoded) and generates the plots in Figure 2.
  4. main_plots_trajectories_3s: runs the Q-learning dynamics in 2x3 games (scalable to 2xn) and generates the plots in Figures 7 and 8.

  n Player network games

  1. main_linenet_surfaces: produces the surfaces in Figures 4 and 9. The last panel in Figure 4 and the panels in Figure 9 are obtained according the algorithm in Appendix C4.
  2. main_linenet_boxplots: produces the boxplots of Figure 5.
  3. main_linenet_eqselect(_averages): produce the plots in Figure 9 (supplementary).
