====> Epoch:   1 Average train loss: -12466.6726 Average bpd: 5.855
====> Epoch:   2 Average train loss: -10995.8586 Average bpd: 5.164
====> [eval] Epoch:   2 Average bpd: 4.930
====> [test] Epoch:   2 Average bpd: 4.930
Best val_bpd: 4.930290538988138
Best test_bpd: 4.930220644356865
====> Epoch:   3 Average train loss: -10260.6747 Average bpd: 4.819
====> Epoch:   4 Average train loss: -9798.3382 Average bpd: 4.602
====> [eval] Epoch:   4 Average bpd: 4.508
====> [test] Epoch:   4 Average bpd: 4.508
Best val_bpd: 4.50772396753235
Best test_bpd: 4.507612856156345
====> Epoch:   5 Average train loss: -9495.2453 Average bpd: 4.459
====> Epoch:   6 Average train loss: -9295.2145 Average bpd: 4.365
====> [eval] Epoch:   6 Average bpd: 4.330
====> [test] Epoch:   6 Average bpd: 4.330
Best val_bpd: 4.329541047530525
Best test_bpd: 4.329511366558847
====> Epoch:   7 Average train loss: -9158.7716 Average bpd: 4.301
====> Epoch:   8 Average train loss: -9061.5665 Average bpd: 4.256
====> [eval] Epoch:   8 Average bpd: 4.214
====> [test] Epoch:   8 Average bpd: 4.214
Best val_bpd: 4.213667966409883
Best test_bpd: 4.213698214189339
====> Epoch:   9 Average train loss: -8991.6343 Average bpd: 4.223
====> Epoch:  10 Average train loss: -8937.5494 Average bpd: 4.197
====> [eval] Epoch:  10 Average bpd: 4.167
====> [test] Epoch:  10 Average bpd: 4.167
Best val_bpd: 4.167312862503325
Best test_bpd: 4.167477331129277
====> Epoch:  11 Average train loss: -8872.7213 Average bpd: 4.167
====> Epoch:  12 Average train loss: -8824.7534 Average bpd: 4.144
====> [eval] Epoch:  12 Average bpd: 4.129
====> [test] Epoch:  12 Average bpd: 4.129
Best val_bpd: 4.128730463936672
Best test_bpd: 4.12897453554104
====> Epoch:  13 Average train loss: -8787.2868 Average bpd: 4.127
====> Epoch:  14 Average train loss: -8757.7934 Average bpd: 4.113
====> [eval] Epoch:  14 Average bpd: 4.101
====> [test] Epoch:  14 Average bpd: 4.102
Best val_bpd: 4.101477838781818
Best test_bpd: 4.1016590911505695
====> Epoch:  15 Average train loss: -8733.5112 Average bpd: 4.101
====> Epoch:  16 Average train loss: -8714.4804 Average bpd: 4.093
====> [eval] Epoch:  16 Average bpd: 4.078
====> [test] Epoch:  16 Average bpd: 4.078
Best val_bpd: 4.078133685470723
Best test_bpd: 4.077841619535242
====> Epoch:  17 Average train loss: -8698.1023 Average bpd: 4.085
====> Epoch:  18 Average train loss: -8683.2496 Average bpd: 4.078
====> [eval] Epoch:  18 Average bpd: 4.066
====> [test] Epoch:  18 Average bpd: 4.066
Best val_bpd: 4.066114009681731
Best test_bpd: 4.0662378912840405
====> Epoch:  19 Average train loss: -8671.4648 Average bpd: 4.072
====> Epoch:  20 Average train loss: -8660.9239 Average bpd: 4.067
====> [eval] Epoch:  20 Average bpd: 4.062
====> [test] Epoch:  20 Average bpd: 4.062
Best val_bpd: 4.06154833303755
Best test_bpd: 4.061545628353126
====> Epoch:  21 Average train loss: -8651.3967 Average bpd: 4.063
====> Epoch:  22 Average train loss: -8643.0316 Average bpd: 4.059
====> [eval] Epoch:  22 Average bpd: 4.048
====> [test] Epoch:  22 Average bpd: 4.049
Best val_bpd: 4.048444557112164
Best test_bpd: 4.048624525931763
====> Epoch:  23 Average train loss: -8635.6045 Average bpd: 4.056
====> Epoch:  24 Average train loss: -8628.0812 Average bpd: 4.052
====> [eval] Epoch:  24 Average bpd: 4.046
====> [test] Epoch:  24 Average bpd: 4.046
Best val_bpd: 4.046131326074811
Best test_bpd: 4.045992326462521
====> Epoch:  25 Average train loss: -8622.0886 Average bpd: 4.049
====> Epoch:  26 Average train loss: -8615.8983 Average bpd: 4.046
====> [eval] Epoch:  26 Average bpd: 4.037
====> [test] Epoch:  26 Average bpd: 4.038
Best val_bpd: 4.037310861855582
Best test_bpd: 4.037501123175364
====> Epoch:  27 Average train loss: -8610.5731 Average bpd: 4.044
====> Epoch:  28 Average train loss: -8605.2692 Average bpd: 4.041
====> [eval] Epoch:  28 Average bpd: 4.043
====> [test] Epoch:  28 Average bpd: 4.043
Best val_bpd: 4.037310861855582
Best test_bpd: 4.037501123175364
====> Epoch:  29 Average train loss: -8601.0947 Average bpd: 4.039
====> Epoch:  30 Average train loss: -8596.5247 Average bpd: 4.037
====> [eval] Epoch:  30 Average bpd: 4.032
====> [test] Epoch:  30 Average bpd: 4.032
Best val_bpd: 4.0320391273425615
Best test_bpd: 4.031671274973574
====> Epoch:  31 Average train loss: -8592.4928 Average bpd: 4.035
====> Epoch:  32 Average train loss: -8589.2320 Average bpd: 4.034
====> [eval] Epoch:  32 Average bpd: 4.024
====> [test] Epoch:  32 Average bpd: 4.024
Best val_bpd: 4.023671332925915
Best test_bpd: 4.024019083315896
====> Epoch:  33 Average train loss: -8585.0863 Average bpd: 4.032
====> Epoch:  34 Average train loss: -8581.8514 Average bpd: 4.030
====> [eval] Epoch:  34 Average bpd: 4.036
====> [test] Epoch:  34 Average bpd: 4.036
Best val_bpd: 4.023671332925915
Best test_bpd: 4.024019083315896
====> Epoch:  35 Average train loss: -8578.5045 Average bpd: 4.029
====> Epoch:  36 Average train loss: -8575.9874 Average bpd: 4.028
====> [eval] Epoch:  36 Average bpd: 4.027
====> [test] Epoch:  36 Average bpd: 4.027
Best val_bpd: 4.023671332925915
Best test_bpd: 4.024019083315896
====> Epoch:  37 Average train loss: -8572.8285 Average bpd: 4.026
====> Epoch:  38 Average train loss: -8570.4452 Average bpd: 4.025
====> [eval] Epoch:  38 Average bpd: 4.030
====> [test] Epoch:  38 Average bpd: 4.030
Best val_bpd: 4.023671332925915
Best test_bpd: 4.024019083315896
====> Epoch:  39 Average train loss: -8567.4840 Average bpd: 4.024
====> Epoch:  40 Average train loss: -8564.8999 Average bpd: 4.022
====> [eval] Epoch:  40 Average bpd: 4.021
====> [test] Epoch:  40 Average bpd: 4.021
Best val_bpd: 4.020547441819567
Best test_bpd: 4.020749854816085
====> Epoch:  41 Average train loss: -8562.8681 Average bpd: 4.021
====> Epoch:  42 Average train loss: -8560.5574 Average bpd: 4.020
====> [eval] Epoch:  42 Average bpd: 4.018
====> [test] Epoch:  42 Average bpd: 4.018
Best val_bpd: 4.018265659500352
Best test_bpd: 4.018445468684773
====> Epoch:  43 Average train loss: -8558.7987 Average bpd: 4.019
====> Epoch:  44 Average train loss: -8557.1401 Average bpd: 4.019
====> [eval] Epoch:  44 Average bpd: 4.015
====> [test] Epoch:  44 Average bpd: 4.015
Best val_bpd: 4.015120852657772
Best test_bpd: 4.015222454716128
====> Epoch:  45 Average train loss: -8555.5648 Average bpd: 4.018
====> Epoch:  46 Average train loss: -8552.6117 Average bpd: 4.017
====> [eval] Epoch:  46 Average bpd: 4.016
====> [test] Epoch:  46 Average bpd: 4.016
Best val_bpd: 4.015120852657772
Best test_bpd: 4.015222454716128
====> Epoch:  47 Average train loss: -8551.1597 Average bpd: 4.016
====> Epoch:  48 Average train loss: -8549.3532 Average bpd: 4.015
====> [eval] Epoch:  48 Average bpd: 4.010
====> [test] Epoch:  48 Average bpd: 4.010
Best val_bpd: 4.009956292147009
Best test_bpd: 4.010003709380782
====> Epoch:  49 Average train loss: -8548.0522 Average bpd: 4.014
====> Epoch:  50 Average train loss: -8546.3178 Average bpd: 4.014
====> [eval] Epoch:  50 Average bpd: 4.013
====> [test] Epoch:  50 Average bpd: 4.013
Best val_bpd: 4.009956292147009
Best test_bpd: 4.010003709380782
====> Epoch:  51 Average train loss: -8545.4282 Average bpd: 4.013
====> Epoch:  52 Average train loss: -8543.5705 Average bpd: 4.012
====> [eval] Epoch:  52 Average bpd: 4.015
====> [test] Epoch:  52 Average bpd: 4.015
Best val_bpd: 4.009956292147009
Best test_bpd: 4.010003709380782
====> Epoch:  53 Average train loss: -8542.2954 Average bpd: 4.012
====> Epoch:  54 Average train loss: -8541.1085 Average bpd: 4.011
====> [eval] Epoch:  54 Average bpd: 4.038
====> [test] Epoch:  54 Average bpd: 4.038
Best val_bpd: 4.009956292147009
Best test_bpd: 4.010003709380782
====> Epoch:  55 Average train loss: -8539.4141 Average bpd: 4.010
====> Epoch:  56 Average train loss: -8538.1438 Average bpd: 4.010
====> [eval] Epoch:  56 Average bpd: 4.006
====> [test] Epoch:  56 Average bpd: 4.006
Best val_bpd: 4.006156530683853
Best test_bpd: 4.006139038431329
====> Epoch:  57 Average train loss: -8536.7299 Average bpd: 4.009
====> Epoch:  58 Average train loss: -8535.2658 Average bpd: 4.008
====> [eval] Epoch:  58 Average bpd: 4.003
====> [test] Epoch:  58 Average bpd: 4.003
Best val_bpd: 4.003120074381254
Best test_bpd: 4.003035258711585
====> Epoch:  59 Average train loss: -8534.7694 Average bpd: 4.008
====> Epoch:  60 Average train loss: -8533.1108 Average bpd: 4.007
====> [eval] Epoch:  60 Average bpd: 4.009
====> [test] Epoch:  60 Average bpd: 4.008
Best val_bpd: 4.003120074381254
Best test_bpd: 4.003035258711585
====> Epoch:  61 Average train loss: -8532.7082 Average bpd: 4.007
====> Epoch:  62 Average train loss: -8531.5565 Average bpd: 4.007
====> [eval] Epoch:  62 Average bpd: 4.002
====> [test] Epoch:  62 Average bpd: 4.002
Best val_bpd: 4.001788692003689
Best test_bpd: 4.002005644148922
====> Epoch:  63 Average train loss: -8530.8120 Average bpd: 4.006
====> Epoch:  64 Average train loss: -8529.7057 Average bpd: 4.006
====> [eval] Epoch:  64 Average bpd: 3.998
====> [test] Epoch:  64 Average bpd: 3.998
Best val_bpd: 3.997576402370626
Best test_bpd: 3.9976714058809506
====> Epoch:  65 Average train loss: -8528.7576 Average bpd: 4.005
====> Epoch:  66 Average train loss: -8528.4713 Average bpd: 4.005
====> [eval] Epoch:  66 Average bpd: 4.000
====> [test] Epoch:  66 Average bpd: 4.000
Best val_bpd: 3.997576402370626
Best test_bpd: 3.9976714058809506
====> Epoch:  67 Average train loss: -8526.9993 Average bpd: 4.005
====> Epoch:  68 Average train loss: -8525.8969 Average bpd: 4.004
====> [eval] Epoch:  68 Average bpd: 3.998
====> [test] Epoch:  68 Average bpd: 3.998
Best val_bpd: 3.997576402370626
Best test_bpd: 3.9976714058809506
====> Epoch:  69 Average train loss: -8525.5198 Average bpd: 4.004
====> Epoch:  70 Average train loss: -8523.7516 Average bpd: 4.003
====> [eval] Epoch:  70 Average bpd: 4.002
====> [test] Epoch:  70 Average bpd: 4.002
Best val_bpd: 3.997576402370626
Best test_bpd: 3.9976714058809506
====> Epoch:  71 Average train loss: -8522.8975 Average bpd: 4.003
====> Epoch:  72 Average train loss: -8522.0536 Average bpd: 4.002
====> [eval] Epoch:  72 Average bpd: 3.998
====> [test] Epoch:  72 Average bpd: 3.998
Best val_bpd: 3.997576402370626
Best test_bpd: 3.9976714058809506
====> Epoch:  73 Average train loss: -8521.3494 Average bpd: 4.002
