Evaluating with bsz 1 tgt_len 256 ext_len 0 mem_len 0 clamp_len -1 using a sliding window.
245568 positions evaluated.
Time : 3482.72s, 14.20ms/segment
====================================================================================================
| test loss  3.75 | test ppl    42.609 
====================================================================================================
