# Supplementary Materials for E^2M: Double Bounded α-Divergence Optimization for Tensor-based Discrete Density Estimation

## Appendix
This appendix provides a detailed description of the experimental setup, environment, additional numerical results, theoretical remarks, and proofs of theorems and propositions. In the following, we describe how we can reproduce the numerical results. 

## Source Code

To ensure reproducibility, we provide the source code for all experiments. Our code works with Python 3.12.3. 
The proposed algorithms for sparse data are available in `methods/emmix/sparse_em_mix_all.py`.
The proposed algorithms for dense data are available in `methods/emmix/dense_em_mix_all.py`.
Datasets used in the experiments need to be stored in the directories `real_data/` and `syn_data/`.
Results will be saved in the `results/` directory. 
Details for the setting of the experiments can be modified in the files given in `config/`.
Please refer to Section D for the datasets and implementation details.


###  Required packages

The following packages are needed for the proposed methods
```
numpy, itertools, functools, collections, COO
```
The following packages are needed to run baselines
```
Tensorly, nn-fac, torch
```
The following packages are needed for dataset loading and preprocessing
```
pmlb, ucimlrepo, pands, pickle
```
### E2M algorithm for dense random tensor

Run the following code in `src/` for a (rank two) mixture of CP, Tucker, TT, and background term.
```
$python3
>>> import sys
>>> sys.path.append("methods/emmix/")
>>> import numpy as np
>>> import dense_em_mix_all as emmix
>>> T = np.random.rand(10,10,10);
>>> T = T / np.sum(T); # input random normalized tensor
>>> rankcp = 2; ranktucker = 2; ranktt = 2; alpha = 0.5; model = [1,1,1,1]
>>> factors, hist, P, _ = emmix.EMCPTuckerTrain(T, [rankcp, ranktucker, ranktrain], alpha)
```
The four binary numbers in the `model` argument represent the CP, Tucker, TT, and the background term, respectively. For example, `model = [1,0,0,1]` means invoking a CP decomposition with a background term. When `alpha=1.0`, it optimizes the KL divergence from dense tensor `T`. The obtained factors are returned in `factors` and the loss curve is stored in `hist`. The full low-rank mixture tensor is returned as `P`. The code for sparse data can be found later.


### Data set preparation for sparse setting

Run the following commands in `src/` to download real datasets to generate training, validation, and test data as numpy files.
```
$cd src
$python3 loader/prepro.py
```
The resulting files will be saved in `real_data/{dataset_name}/`.  
Dataset size information can be found in `loader/dataset_info.py`.

### E2M-based Non-negative Tensor Mixture Learning for sparse data

The proposed method decomposes sparse tensors stored in COO format. In preparation for running the proposed method, the following steps are performed in `src/`:
```
$python3
>>> import sys
>>> import numpy as np
>>> sys.path.append("../real_data/")
>>> sys.path.append("mehtods/emmix")
>>> sys.path.append("loader")
>>> import sp_tensor, dataset_info, sparse_em_mix_all
>>> dataset_name = "SolarFlare"
>>> coords = np.load(f"../real_data/{dataset_name}/X_train_coords.npy")
>>> values = np.load(f"../real_data/{dataset_name}/X_train_values.npy")
>>> T = sp_tensor.Sp_tensor(coords, values, dataset_info.tensor_sizes[dataset_name], normalize=True)
```
Set `normalize=True` to treat the tensor as a discrete probability distribution.  
You can change `dataset_name` as needed (see `dataset_info.py` for available options).

To load real datasets more conveniently:
```
>>> import reader
>>> tvt = "train" # or "valid", "test"
>>> T = reader.load_data_real(dataset_name, tvt="train")
```

To perform E2MCPTTB decomposition with Hellinger distance (alpha=0.5):
```
>>> Rcp = 2; Rtrain = 2; alpha = 0.5
>>> model = [1,0,1,1]
>>> factors, hist, P, _ = sparse_em_mix_all.EMMix_sparse(T, [Rcp, 0, Rtrain], model=model, alpha=alpha )
```
`Rcp` is the rank of the CP decomposition, and `Rtrain` is the rank of the Train decomposition.

To compute the reconstruction at specific indices:
```
>>> import utils_mix_sparse as ums
>>> idx1 = [2, 0, 0, 1, 1, 1, 1, 1, 0, 1, 1, 0, 1]
>>> idx2 = [1, 0, 0, 1, 1, 0, 1, 1, 1, 1, 0, 1, 1]
>>> idxs = np.array( [idx1, idx2] )
>>> ums.get_vals_from_mixture(idxs, factors)
```

### For Experiments in Section 4

#### Convergence performance
Open `src/exp_losscurve.ipynb` and run each cell. Results will be saved in `../results/exp_loss/`. To visualize the figures, open and run `src/plot_losscurve.ipynb`.

#### Sensitivity of noise depending alpha 
Open `src/loader/syn_data_maker.ipynb` and run each cell to generate synthetic datasets (saved to `syn_data/anom/`). Then, open and run `src/exp_syn_anom.ipynb` to reproduce the results shown in Figure 9

#### Classification and density estimation
Open `src/config/rank_choise.ipynb` and run each cell. Then, the rank for all methods is saved in `src/config/ranks/`.  
To run an experiment, use the following command:
```
$python3 exp_em_alpha.py SolarFlare cptrain --alpha 1.0 --learn_noise True --rank_id 0
```
Changing `rank_id` = 1, 2, ..., then the number of parameters of the model increases. The evaluation code will be automatically run. You can see the results in `f../results/exp_dde/{datasetname}/cptrain/` as `.txt` file.

#### Running the Baselines KLTT and BM
Move to `methods/baselines/` and clone the repository for baselines and change the directory name `tnfp`
```
$cd methods/baselines/
$git clone https://github.com/glivan/tensor_networks_for_probabilistic_modeling
```
For evaluation, add the following `cross` method in the class `TN` in `methods/baselines/tfnp/tensornetworks/MPSClass.py`.
```
    def cross(self, coord, values, w=None):
        distance=0
        cross=0
        epsilon=10**(-10)
        if w is not None:
            self.w=self._padding_function(w)
        self.norm=self._computenorm()

        for n in range(len(coord)):
            loglikelihood = np.log(max(self._probability(coord[n,:])/self.norm,10** (-50)))
            cross -= values[n] * loglikelihood
        return cross

```
To run the code in Python 3.12.3, change all `xrange` to `range` in the cloned files.

Run `exp_bs.py` as follows:
```
$python3 exp_bs.py SolarFlare BM  --rank_id 0 --lr_id 0
$python3 exp_bs.py SolarFlare MPS --rank_id 0 --lr_id 0
```
`BM` and `MPS` correspond to BornMachine(BM) and Matrix Product States(KLTT), respectively.

The evaluation code will be automatically run. 
We can open the results by:
```
$python3
>>> imoprt numpy as np
>>> utils_exp as ue
>>> dataset_name = "SolarFlare"
>>> ue.pickle_load(f"results/exp_dde/BM/{dataset_name}_lrs.pkl") # Results for learning rate tuning.
>>> ue.pickle_load(f"results/exp_dde/BM/{dataset_name}.pkl") # Results for train and validation
>>> ue.pickle_load(f"results/exp_dde/BM/{dataset_name}_test.pkl") # Results for test dataset
```

### License
This source code is released under the MIT License.
/).