# Knowledge-based Policy Recycling (KPR)
Due to the size limit of supplementary material (100 MB), we are not able to include the Robot Foodcourt (RoFoCo) environment, datasets and the trained models. We promise to release it upon publication. 

`policy_traqin.py` is for main policy and auxiliary task policies training. It includes implementations from CleanRL and envpool, as cited in the paper.

`VAE.py` is for training a simple VAE to reconstruct states in order to extract latent state representations. 

`MLP.py` is for training the MLP fusion network and `GNN.py` is for training GNN pooling based fusion network.

`fusion_adv.py` generates the dataset for VAE, MLP and GNN, and tests the trained model under various adversarial attacks. 