# Data preparation of Microsoft dataset

Data can be obtained from Srinivasavaradhan et al. 2021

1. Run  `prepare_microsoft_data.py` with `cluster_case` parameter `SC` and `LC`
    `SC` = small clusters
    `LC` = larger clusters