# CoLT
This is the supplementary materials of the code for the paper: Radiograph Representation Learning by Completing Multi-modal Cloze Tests.

Some code of this repository is borrowed from [MAE](https://github.com/facebookresearch/mae) and [huggingface](https://huggingface.co).

## Getting started
### 1 Requirements
OS: Ubuntu 18.04 LTS.

Language: Python 3.8.11

If you are using conda, we provide an easy way to continue:

      conda env create -f environment.yaml
      pip install -r requirements.txt

### 2 Data preparation
- We use MIMIC-CXR-JPG for pre-training. You can acquire more information about this dataset at [Johnson et al. MIMIC-CXR-JPG](https://physionet.org/content/mimic-cxr-jpg/2.0.0/).
- The dataset directory specified in run.sh includes the MIMIC-CXR-JPG dataset and you need to prepare a file "training.csv" and put it into the dateset directory.
- The file "training.csv" includes two columes for each line, including 1.the path to an image, 2.the text of the corresponding report.

  

### 3 Start Pre-training

- Set the data path, GPU IDs, batch size, output directory, and other parameters in [run.sh](/config.yaml).

- Start training by running

      chmod a+x run.sh
      ./run.sh


