Evaluating with bsz 1 tgt_len 256 ext_len 0 mem_len 0 clamp_len -1 using a sliding window.
245568 positions evaluated.
Time : 3658.60s, 14.91ms/segment
====================================================================================================
| test loss  4.00 | test ppl    54.732 
====================================================================================================
