# Dataset Introduction

There are there datasets here for evaluation.

c4_validation contains 10000 samples from c4(realnewslike) validation set.

c4_small contains 200 samples from c4(en) validation set.

openwebtext_eval contains 3769 samples from openwebtext.