To produce the data, first generate instruction-tuning data using generate-data.py. 
Then concatenate the data. An example of how to concatenate four documents is given by concatenate-350K.py. 

30 examples of concatenated data at 350K, 650K, and 1M context length are provided. 