Evaluating with bsz 1 tgt_len 256 ext_len 0 mem_len 0 clamp_len -1 using a sliding window.
245568 positions evaluated.
Time : 4086.30s, 16.66ms/segment
====================================================================================================
| test loss  3.77 | test ppl    43.337 
====================================================================================================
