====> Epoch:   1 Average train loss: -47209.5690 Average bpd: 5.543
====> Epoch:   2 Average train loss: -41231.0712 Average bpd: 4.841
====> [eval] Epoch:   2 Average bpd: 4.648
====> [test] Epoch:   2 Average bpd: 4.648
Best val_bpd: 4.6476198332999585
Best test_bpd: 4.647584846171234
====> Epoch:   3 Average train loss: -38452.7597 Average bpd: 4.515
====> Epoch:   4 Average train loss: -36633.3848 Average bpd: 4.301
====> [eval] Epoch:   4 Average bpd: 4.207
====> [test] Epoch:   4 Average bpd: 4.207
Best val_bpd: 4.207059110279229
Best test_bpd: 4.206984012656407
====> Epoch:   5 Average train loss: -35477.5095 Average bpd: 4.165
====> Epoch:   6 Average train loss: -34723.4304 Average bpd: 4.077
====> [eval] Epoch:   6 Average bpd: 4.036
====> [test] Epoch:   6 Average bpd: 4.036
Best val_bpd: 4.035503529700514
Best test_bpd: 4.035510892950901
====> Epoch:   7 Average train loss: -34206.2342 Average bpd: 4.016
====> Epoch:   8 Average train loss: -33828.7449 Average bpd: 3.972
====> [eval] Epoch:   8 Average bpd: 3.944
====> [test] Epoch:   8 Average bpd: 3.944
Best val_bpd: 3.9436690017323968
Best test_bpd: 3.943764879894871
====> Epoch:   9 Average train loss: -33540.3933 Average bpd: 3.938
====> Epoch:  10 Average train loss: -33326.2264 Average bpd: 3.913
====> [eval] Epoch:  10 Average bpd: 3.899
====> [test] Epoch:  10 Average bpd: 3.899
Best val_bpd: 3.899292435334255
Best test_bpd: 3.8992714719483432
====> Epoch:  11 Average train loss: -33046.7076 Average bpd: 3.880
====> Epoch:  12 Average train loss: -32869.9667 Average bpd: 3.859
====> [eval] Epoch:  12 Average bpd: 3.869
====> [test] Epoch:  12 Average bpd: 3.868
Best val_bpd: 3.8685092027415475
Best test_bpd: 3.8684316079570613
====> Epoch:  13 Average train loss: -32715.0233 Average bpd: 3.841
====> Epoch:  14 Average train loss: -32589.7746 Average bpd: 3.826
====> [eval] Epoch:  14 Average bpd: 3.822
====> [test] Epoch:  14 Average bpd: 3.822
Best val_bpd: 3.8222945621098483
Best test_bpd: 3.822303400168353
====> Epoch:  15 Average train loss: -32488.6276 Average bpd: 3.814
====> Epoch:  16 Average train loss: -32398.9922 Average bpd: 3.804
====> [eval] Epoch:  16 Average bpd: 3.813
====> [test] Epoch:  16 Average bpd: 3.813
Best val_bpd: 3.8125946424617045
Best test_bpd: 3.812624825306827
====> Epoch:  17 Average train loss: -32332.5888 Average bpd: 3.796
====> Epoch:  18 Average train loss: -32266.4119 Average bpd: 3.788
====> [eval] Epoch:  18 Average bpd: 3.786
====> [test] Epoch:  18 Average bpd: 3.787
Best val_bpd: 3.7864540590677973
Best test_bpd: 3.7865051705059263
====> Epoch:  19 Average train loss: -32210.4766 Average bpd: 3.782
====> Epoch:  20 Average train loss: -32160.2612 Average bpd: 3.776
====> [eval] Epoch:  20 Average bpd: 3.791
====> [test] Epoch:  20 Average bpd: 3.791
Best val_bpd: 3.7864540590677973
Best test_bpd: 3.7865051705059263
====> Epoch:  21 Average train loss: -32117.6893 Average bpd: 3.771
====> Epoch:  22 Average train loss: -32075.5159 Average bpd: 3.766
====> [eval] Epoch:  22 Average bpd: 3.767
====> [test] Epoch:  22 Average bpd: 3.767
Best val_bpd: 3.767477837850716
Best test_bpd: 3.7674045814625567
====> Epoch:  23 Average train loss: -32041.9553 Average bpd: 3.762
====> Epoch:  24 Average train loss: -32006.7737 Average bpd: 3.758
====> [eval] Epoch:  24 Average bpd: 3.766
====> [test] Epoch:  24 Average bpd: 3.766
Best val_bpd: 3.766150901120312
Best test_bpd: 3.766192045051771
====> Epoch:  25 Average train loss: -31979.4928 Average bpd: 3.755
====> Epoch:  26 Average train loss: -31952.4695 Average bpd: 3.751
====> [eval] Epoch:  26 Average bpd: 3.760
====> [test] Epoch:  26 Average bpd: 3.760
Best val_bpd: 3.759724178552062
Best test_bpd: 3.759794172334645
====> Epoch:  27 Average train loss: -31919.0733 Average bpd: 3.748
====> Epoch:  28 Average train loss: -31898.0836 Average bpd: 3.745
====> [eval] Epoch:  28 Average bpd: 3.750
====> [test] Epoch:  28 Average bpd: 3.750
Best val_bpd: 3.7500456884707156
Best test_bpd: 3.750066201530736
====> Epoch:  29 Average train loss: -32271.1425 Average bpd: 3.789
====> Epoch:  30 Average train loss: -31859.2445 Average bpd: 3.740
====> [eval] Epoch:  30 Average bpd: 3.746
====> [test] Epoch:  30 Average bpd: 3.746
Best val_bpd: 3.7463529650012903
Best test_bpd: 3.746348904727995
====> Epoch:  31 Average train loss: -31841.8906 Average bpd: 3.738
====> Epoch:  32 Average train loss: -31826.4660 Average bpd: 3.737
====> [eval] Epoch:  32 Average bpd: 3.734
====> [test] Epoch:  32 Average bpd: 3.734
Best val_bpd: 3.7343776919805736
Best test_bpd: 3.7343915075531786
====> Epoch:  33 Average train loss: -31803.6794 Average bpd: 3.734
====> Epoch:  34 Average train loss: -31785.3758 Average bpd: 3.732
====> [eval] Epoch:  34 Average bpd: 3.743
====> [test] Epoch:  34 Average bpd: 3.743
Best val_bpd: 3.7343776919805736
Best test_bpd: 3.7343915075531786
====> Epoch:  35 Average train loss: -31776.5343 Average bpd: 3.731
====> Epoch:  36 Average train loss: -31756.4673 Average bpd: 3.728
====> [eval] Epoch:  36 Average bpd: 3.743
====> [test] Epoch:  36 Average bpd: 3.743
Best val_bpd: 3.7343776919805736
Best test_bpd: 3.7343915075531786
====> Epoch:  37 Average train loss: -31742.7583 Average bpd: 3.727
====> Epoch:  38 Average train loss: -31733.3497 Average bpd: 3.726
====> [eval] Epoch:  38 Average bpd: 3.739
====> [test] Epoch:  38 Average bpd: 3.739
Best val_bpd: 3.7343776919805736
Best test_bpd: 3.7343915075531786
====> Epoch:  39 Average train loss: -31714.8883 Average bpd: 3.724
====> Epoch:  40 Average train loss: -31706.0197 Average bpd: 3.723
====> [eval] Epoch:  40 Average bpd: 3.720
====> [test] Epoch:  40 Average bpd: 3.720
Best val_bpd: 3.720436176032044
Best test_bpd: 3.720495464316676
====> Epoch:  41 Average train loss: -31692.8035 Average bpd: 3.721
====> Epoch:  42 Average train loss: -31683.5733 Average bpd: 3.720
====> [eval] Epoch:  42 Average bpd: 3.723
====> [test] Epoch:  42 Average bpd: 3.723
Best val_bpd: 3.720436176032044
Best test_bpd: 3.720495464316676
====> Epoch:  43 Average train loss: -31668.7957 Average bpd: 3.718
====> Epoch:  44 Average train loss: -31659.3560 Average bpd: 3.717
====> [eval] Epoch:  44 Average bpd: 3.726
====> [test] Epoch:  44 Average bpd: 3.726
Best val_bpd: 3.720436176032044
Best test_bpd: 3.720495464316676
====> Epoch:  45 Average train loss: -31648.3132 Average bpd: 3.716
====> Epoch:  46 Average train loss: -31642.3793 Average bpd: 3.715
====> [eval] Epoch:  46 Average bpd: 3.720
====> [test] Epoch:  46 Average bpd: 3.720
Best val_bpd: 3.7201810467060485
Best test_bpd: 3.720248861014082
====> Epoch:  47 Average train loss: -31631.8596 Average bpd: 3.714
====> Epoch:  48 Average train loss: -31628.3436 Average bpd: 3.713
====> [eval] Epoch:  48 Average bpd: 3.718
====> [test] Epoch:  48 Average bpd: 3.718
Best val_bpd: 3.7180321448437192
Best test_bpd: 3.7179866206763617
====> Epoch:  49 Average train loss: -31613.1593 Average bpd: 3.712
====> Epoch:  50 Average train loss: -31608.0093 Average bpd: 3.711
====> [eval] Epoch:  50 Average bpd: 3.715
====> [test] Epoch:  50 Average bpd: 3.715
Best val_bpd: 3.7145498190624586
Best test_bpd: 3.7146048126254017
====> Epoch:  51 Average train loss: -31604.6018 Average bpd: 3.711
====> Epoch:  52 Average train loss: -31591.9122 Average bpd: 3.709
====> [eval] Epoch:  52 Average bpd: 3.716
====> [test] Epoch:  52 Average bpd: 3.716
Best val_bpd: 3.7145498190624586
Best test_bpd: 3.7146048126254017
====> Epoch:  53 Average train loss: -31583.3460 Average bpd: 3.708
