To load our sample dataset, simply use 

    from datasets import load_from_disk
    data = load_from_disk('pick-double-caption_sample_250.hf')