# A NOVEL AUTOENCODER BASED APPROACH FOR COUNTERFACTUAL ESTIMATION USING SPARSITY CONSTRAINTS

In this repository, we find the codes to reproduce all experiments in the paper except those of the real world dataset. 

In TimeSeriesCFs we find, in the first place, the file Models.py, where there are the codes for CSAE, CVAE, the LSTM model (named forecast_model) and the event predictor model for the effectiveness metric. Then, we find Rosseman and Synthetic folders. To reproduce the results of the semi-synthetic dataset, it is necessary to download the Rosseman Store Sales dataset from url "https://www.kaggle.com/competitions/rossmann-store-sales/data?select=train.csv" (only "train.csv"), and put it in rosseman_data folder. Then, Example.ipynb and Example.html show all the results for the configuration where actual data has event (e=1). In Example.ipynb there are indications of what must be modified to obtain the results for the configuration where actual data has no event (e=0). In Synthetic folder, we find, apart from the codes to generate synthetic datasets, the notebooks Example.ipynb and Example.html, where, as in the case of semi-synthetic dataset, we find the results for the configuration where actual data has event (e=1) and there are instructions to reproduce the other case. 

In ImageCFs folder, there are all the files containing the models, the code for generating color MNIST dataset, and some utils for the data augmentation to train the predictive models (named oracles) for the effective metrics. In Example.ipynb and Example.html we find the metrics for CVAE and CSAE.
