Evaluating with bsz 1 tgt_len 256 ext_len 0 mem_len 0 clamp_len -1 using a sliding window.
245568 positions evaluated.
Time : 4006.82s, 16.33ms/segment
====================================================================================================
| test loss  3.89 | test ppl    48.999 
====================================================================================================
