The Fully-Connected-on-QuPairs.py file shows the code for training the MLP model with reinforcement learning. It has a model initialization part and the training part. If we would like to train the model we will need to comment the initialization codes.

This code is only applied with R40.txt success rate matrix and X40std9.txt fidelity matrix for the table 1 result in the main paper. It can be also used for other input environment but the model definition for the MLP layer input and output needs to be adjusted accordingly.