This directory contains the code for "Capturing the Denoising Effect of PCA via Compression Ratio."

## Data Availability

For our real-world experiments, we use the following datasets from Duo et al. (doi: 10.12688/f1000research.15666.3): 
- Koh
- Kumar
- SimKumar4easy
- SimKumar4hard
- SimKumar8hard
- Trapnell
- Zhengmix4eq
- Zhengmix4uneq
- Zhengmix8eq

The SingleCellExperiment objects for these datasets are available [here](https://bioconductor.org/packages/release/data/experiment/html/DuoClustering2018.html). The experiments.ipynb notebook expects these datasets in h5ad format in the following directory structure: 
    
    ```
    data/
    ├── Koh
    │   ├── koh.h5ad
    ├── Kumar
    │   ├── kumar.h5ad
    ├── simkumar
    │   ├── simkumar4easy.h5ad
    │   ├── simkumar4hard.h5ad
    │   ├── simkumar8hard.h5ad
    ├── Trapnell
    │   ├── trapnell.h5ad
    ├── zheng
    │   ├── zhengmix4eq.h5ad
    │   ├── zhengmix4uneq.h5ad
    │   ├── zhengmix8eq.h5ad
    ```