Permalink
Switch branches/tags
Nothing to show
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
413 lines (411 sloc) 18.5 KB
Namespace(batch_size=16, bucket_mult=100, bucket_num=10, bucket_ratio=0.0, bucket_type='fixed', clip=None, dropout=0.0, epochs=3, gpu=0, lm_model='standard_lstm_lm_200', log_interval=30, lr=0.005, no_pretrained=True, save_prefix='imdb_lstm_200', use_mean_pool=True, valid_ratio=0.1)
Use gpu0
Tokenize using spaCy...
Done! Tokenizing Time=8.51s, #Sentences=22500
Done! Tokenizing Time=1.36s, #Sentences=2500
Done! Tokenizing Time=8.90s, #Sentences=25000
Use FixedBucketSampler
FixedBucketSampler:
sample_num=22500, batch_num=1412
key=[59, 108, 157, 206, 255, 304, 353, 402, 451, 500]
cnt=[541, 1794, 4583, 4609, 2737, 1848, 1322, 1052, 780, 3234]
batch_size=[16, 16, 16, 16, 16, 16, 16, 16, 16, 16]
SentimentNet(
(embedding): HybridSequential(
(0): Embedding(33278 -> 200, float32)
)
(encoder): LSTM(200 -> 200, TNC, num_layers=2)
(agg_layer): AggregationLayer(
)
(output): HybridSequential(
(0): Dropout(p = 0.0, axes=())
(1): Dense(None -> 1, linear)
)
)
[22:26:59] src/storage/storage.cc:129: Using GPUPooledRoundedStorageManager.
[Epoch 0 Batch 30/1412] avg loss 0.0428669, throughput 106.378K wps
[Epoch 0 Batch 60/1412] avg loss 0.0570866, throughput 168.379K wps
[Epoch 0 Batch 90/1412] avg loss 0.0439298, throughput 156.44K wps
[Epoch 0 Batch 120/1412] avg loss 0.0443242, throughput 159.624K wps
[Epoch 0 Batch 150/1412] avg loss 0.0449614, throughput 152.496K wps
[Epoch 0 Batch 180/1412] avg loss 0.043559, throughput 152.676K wps
[Epoch 0 Batch 210/1412] avg loss 0.0432507, throughput 130.492K wps
[Epoch 0 Batch 240/1412] avg loss 0.0433857, throughput 152.49K wps
[Epoch 0 Batch 270/1412] avg loss 0.0433795, throughput 138.597K wps
[Epoch 0 Batch 300/1412] avg loss 0.0445076, throughput 142.928K wps
[Epoch 0 Batch 330/1412] avg loss 0.0427613, throughput 145.381K wps
[Epoch 0 Batch 360/1412] avg loss 0.0430574, throughput 127.357K wps
[Epoch 0 Batch 390/1412] avg loss 0.0413071, throughput 136.346K wps
[Epoch 0 Batch 420/1412] avg loss 0.0389483, throughput 151.582K wps
[Epoch 0 Batch 450/1412] avg loss 0.0370766, throughput 153.719K wps
[Epoch 0 Batch 480/1412] avg loss 0.0359703, throughput 101.234K wps
[Epoch 0 Batch 510/1412] avg loss 0.0341675, throughput 150.443K wps
[Epoch 0 Batch 540/1412] avg loss 0.0308559, throughput 158.564K wps
[Epoch 0 Batch 570/1412] avg loss 0.0322593, throughput 150.255K wps
[Epoch 0 Batch 600/1412] avg loss 0.0449493, throughput 136.193K wps
[Epoch 0 Batch 630/1412] avg loss 0.0400247, throughput 138.369K wps
[Epoch 0 Batch 660/1412] avg loss 0.0393196, throughput 162.616K wps
[Epoch 0 Batch 690/1412] avg loss 0.0382806, throughput 161.626K wps
[Epoch 0 Batch 720/1412] avg loss 0.0381384, throughput 150.525K wps
[Epoch 0 Batch 750/1412] avg loss 0.031074, throughput 144.569K wps
[Epoch 0 Batch 780/1412] avg loss 0.0291415, throughput 166.854K wps
[Epoch 0 Batch 810/1412] avg loss 0.0294662, throughput 156.329K wps
[Epoch 0 Batch 840/1412] avg loss 0.0297857, throughput 140.029K wps
[Epoch 0 Batch 870/1412] avg loss 0.0263667, throughput 147.856K wps
[Epoch 0 Batch 900/1412] avg loss 0.031217, throughput 173.415K wps
[Epoch 0 Batch 930/1412] avg loss 0.0288161, throughput 150.079K wps
[Epoch 0 Batch 960/1412] avg loss 0.0286358, throughput 159.721K wps
[Epoch 0 Batch 990/1412] avg loss 0.0283354, throughput 166.265K wps
[Epoch 0 Batch 1020/1412] avg loss 0.0261076, throughput 159.984K wps
[Epoch 0 Batch 1050/1412] avg loss 0.0241801, throughput 126.988K wps
[Epoch 0 Batch 1080/1412] avg loss 0.0307288, throughput 129.859K wps
[Epoch 0 Batch 1110/1412] avg loss 0.0267675, throughput 150.202K wps
[Epoch 0 Batch 1140/1412] avg loss 0.0276033, throughput 144.298K wps
[Epoch 0 Batch 1170/1412] avg loss 0.0224557, throughput 149.781K wps
[Epoch 0 Batch 1200/1412] avg loss 0.0258165, throughput 143.115K wps
[Epoch 0 Batch 1230/1412] avg loss 0.0224211, throughput 155.295K wps
[Epoch 0 Batch 1260/1412] avg loss 0.0237606, throughput 145.458K wps
[Epoch 0 Batch 1290/1412] avg loss 0.0237062, throughput 144.302K wps
[Epoch 0 Batch 1320/1412] avg loss 0.0273534, throughput 146.953K wps
[Epoch 0 Batch 1350/1412] avg loss 0.0191257, throughput 152.756K wps
[Epoch 0 Batch 1380/1412] avg loss 0.0249543, throughput 152.931K wps
[Epoch 0 Batch 1410/1412] avg loss 0.0228795, throughput 169.814K wps
Begin Testing...
[Batch 30/157] elapsed 0.57 s
[Batch 60/157] elapsed 0.46 s
[Batch 90/157] elapsed 0.37 s
[Batch 120/157] elapsed 0.27 s
[Batch 150/157] elapsed 0.25 s
Begin Testing...
[Batch 30/1563] elapsed 0.71 s
[Batch 60/1563] elapsed 0.65 s
[Batch 90/1563] elapsed 0.71 s
[Batch 120/1563] elapsed 0.71 s
[Batch 150/1563] elapsed 0.64 s
[Batch 180/1563] elapsed 0.62 s
[Batch 210/1563] elapsed 0.66 s
[Batch 240/1563] elapsed 0.61 s
[Batch 270/1563] elapsed 0.62 s
[Batch 300/1563] elapsed 0.59 s
[Batch 330/1563] elapsed 0.49 s
[Batch 360/1563] elapsed 0.54 s
[Batch 390/1563] elapsed 0.52 s
[Batch 420/1563] elapsed 0.35 s
[Batch 450/1563] elapsed 0.42 s
[Batch 480/1563] elapsed 0.45 s
[Batch 510/1563] elapsed 0.45 s
[Batch 540/1563] elapsed 0.42 s
[Batch 570/1563] elapsed 0.43 s
[Batch 600/1563] elapsed 0.41 s
[Batch 630/1563] elapsed 0.41 s
[Batch 660/1563] elapsed 0.40 s
[Batch 690/1563] elapsed 0.40 s
[Batch 720/1563] elapsed 0.38 s
[Batch 750/1563] elapsed 0.37 s
[Batch 780/1563] elapsed 0.36 s
[Batch 810/1563] elapsed 0.35 s
[Batch 840/1563] elapsed 0.36 s
[Batch 870/1563] elapsed 0.35 s
[Batch 900/1563] elapsed 0.32 s
[Batch 930/1563] elapsed 0.35 s
[Batch 960/1563] elapsed 0.28 s
[Batch 990/1563] elapsed 0.28 s
[Batch 1020/1563] elapsed 0.29 s
[Batch 1050/1563] elapsed 0.33 s
[Batch 1080/1563] elapsed 0.31 s
[Batch 1110/1563] elapsed 0.29 s
[Batch 1140/1563] elapsed 0.27 s
[Batch 1170/1563] elapsed 0.26 s
[Batch 1200/1563] elapsed 0.27 s
[Batch 1230/1563] elapsed 0.27 s
[Batch 1260/1563] elapsed 0.27 s
[Batch 1290/1563] elapsed 0.25 s
[Batch 1320/1563] elapsed 0.27 s
[Batch 1350/1563] elapsed 0.26 s
[Batch 1380/1563] elapsed 0.20 s
[Batch 1410/1563] elapsed 0.21 s
[Batch 1440/1563] elapsed 0.18 s
[Batch 1470/1563] elapsed 0.15 s
[Batch 1500/1563] elapsed 0.14 s
[Batch 1530/1563] elapsed 0.19 s
[Batch 1560/1563] elapsed 0.19 s
[Epoch 0] train avg loss 0.0339896, valid acc 0.8576, valid avg loss 0.365598, test acc 0.8383, test avg loss 0.386616, throughput 146.557K wps
Observed Improvement.
[Epoch 1 Batch 30/1412] avg loss 0.0164784, throughput 156.317K wps
[Epoch 1 Batch 60/1412] avg loss 0.0195543, throughput 194.057K wps
[Epoch 1 Batch 90/1412] avg loss 0.016039, throughput 161.211K wps
[Epoch 1 Batch 120/1412] avg loss 0.0180766, throughput 143.514K wps
[Epoch 1 Batch 150/1412] avg loss 0.0157969, throughput 132.522K wps
[Epoch 1 Batch 180/1412] avg loss 0.0183245, throughput 183.884K wps
[Epoch 1 Batch 210/1412] avg loss 0.0162369, throughput 154.258K wps
[Epoch 1 Batch 240/1412] avg loss 0.0172209, throughput 138.354K wps
[Epoch 1 Batch 270/1412] avg loss 0.0184088, throughput 143.766K wps
[Epoch 1 Batch 300/1412] avg loss 0.0153024, throughput 148.588K wps
[Epoch 1 Batch 330/1412] avg loss 0.0138965, throughput 154.2K wps
[Epoch 1 Batch 360/1412] avg loss 0.0164958, throughput 141.821K wps
[Epoch 1 Batch 390/1412] avg loss 0.0180795, throughput 137.314K wps
[Epoch 1 Batch 420/1412] avg loss 0.0179689, throughput 133.119K wps
[Epoch 1 Batch 450/1412] avg loss 0.0191183, throughput 127.396K wps
[Epoch 1 Batch 480/1412] avg loss 0.0160748, throughput 148.973K wps
[Epoch 1 Batch 510/1412] avg loss 0.0144047, throughput 167.895K wps
[Epoch 1 Batch 540/1412] avg loss 0.0147949, throughput 158.629K wps
[Epoch 1 Batch 570/1412] avg loss 0.0167364, throughput 165.848K wps
[Epoch 1 Batch 600/1412] avg loss 0.0153618, throughput 153.916K wps
[Epoch 1 Batch 630/1412] avg loss 0.0153536, throughput 145.819K wps
[Epoch 1 Batch 660/1412] avg loss 0.0158145, throughput 133.36K wps
[Epoch 1 Batch 690/1412] avg loss 0.0168222, throughput 125.331K wps
[Epoch 1 Batch 720/1412] avg loss 0.0188277, throughput 135.199K wps
[Epoch 1 Batch 750/1412] avg loss 0.0185904, throughput 138.009K wps
[Epoch 1 Batch 780/1412] avg loss 0.0172805, throughput 129.293K wps
[Epoch 1 Batch 810/1412] avg loss 0.0163323, throughput 146.436K wps
[Epoch 1 Batch 840/1412] avg loss 0.0198024, throughput 146.528K wps
[Epoch 1 Batch 870/1412] avg loss 0.016999, throughput 179.667K wps
[Epoch 1 Batch 900/1412] avg loss 0.0152223, throughput 166.778K wps
[Epoch 1 Batch 930/1412] avg loss 0.0202609, throughput 157.282K wps
[Epoch 1 Batch 960/1412] avg loss 0.0169, throughput 146.135K wps
[Epoch 1 Batch 990/1412] avg loss 0.0155935, throughput 147.879K wps
[Epoch 1 Batch 1020/1412] avg loss 0.0181931, throughput 187.976K wps
[Epoch 1 Batch 1050/1412] avg loss 0.0182481, throughput 142.725K wps
[Epoch 1 Batch 1080/1412] avg loss 0.0168207, throughput 148.122K wps
[Epoch 1 Batch 1110/1412] avg loss 0.0161095, throughput 165.792K wps
[Epoch 1 Batch 1140/1412] avg loss 0.0139229, throughput 149.435K wps
[Epoch 1 Batch 1170/1412] avg loss 0.0152732, throughput 127.445K wps
[Epoch 1 Batch 1200/1412] avg loss 0.0154432, throughput 173.635K wps
[Epoch 1 Batch 1230/1412] avg loss 0.0157494, throughput 135.453K wps
[Epoch 1 Batch 1260/1412] avg loss 0.0161098, throughput 155.932K wps
[Epoch 1 Batch 1290/1412] avg loss 0.0161368, throughput 142.513K wps
[Epoch 1 Batch 1320/1412] avg loss 0.0151052, throughput 157.653K wps
[Epoch 1 Batch 1350/1412] avg loss 0.0150935, throughput 142.506K wps
[Epoch 1 Batch 1380/1412] avg loss 0.0154601, throughput 160.373K wps
[Epoch 1 Batch 1410/1412] avg loss 0.0163084, throughput 161.016K wps
Begin Testing...
[Batch 30/157] elapsed 0.71 s
[Batch 60/157] elapsed 0.49 s
[Batch 90/157] elapsed 0.39 s
[Batch 120/157] elapsed 0.33 s
[Batch 150/157] elapsed 0.27 s
Begin Testing...
[Batch 30/1563] elapsed 0.77 s
[Batch 60/1563] elapsed 0.71 s
[Batch 90/1563] elapsed 0.74 s
[Batch 120/1563] elapsed 0.73 s
[Batch 150/1563] elapsed 0.69 s
[Batch 180/1563] elapsed 0.62 s
[Batch 210/1563] elapsed 0.58 s
[Batch 240/1563] elapsed 0.65 s
[Batch 270/1563] elapsed 0.54 s
[Batch 300/1563] elapsed 0.51 s
[Batch 330/1563] elapsed 0.48 s
[Batch 360/1563] elapsed 0.49 s
[Batch 390/1563] elapsed 0.39 s
[Batch 420/1563] elapsed 0.40 s
[Batch 450/1563] elapsed 0.44 s
[Batch 480/1563] elapsed 0.44 s
[Batch 510/1563] elapsed 0.35 s
[Batch 540/1563] elapsed 0.34 s
[Batch 570/1563] elapsed 0.36 s
[Batch 600/1563] elapsed 0.30 s
[Batch 630/1563] elapsed 0.30 s
[Batch 660/1563] elapsed 0.30 s
[Batch 690/1563] elapsed 0.27 s
[Batch 720/1563] elapsed 0.27 s
[Batch 750/1563] elapsed 0.30 s
[Batch 780/1563] elapsed 0.33 s
[Batch 810/1563] elapsed 0.32 s
[Batch 840/1563] elapsed 0.29 s
[Batch 870/1563] elapsed 0.24 s
[Batch 900/1563] elapsed 0.23 s
[Batch 930/1563] elapsed 0.23 s
[Batch 960/1563] elapsed 0.23 s
[Batch 990/1563] elapsed 0.27 s
[Batch 1020/1563] elapsed 0.29 s
[Batch 1050/1563] elapsed 0.25 s
[Batch 1080/1563] elapsed 0.22 s
[Batch 1110/1563] elapsed 0.23 s
[Batch 1140/1563] elapsed 0.23 s
[Batch 1170/1563] elapsed 0.22 s
[Batch 1200/1563] elapsed 0.21 s
[Batch 1230/1563] elapsed 0.25 s
[Batch 1260/1563] elapsed 0.27 s
[Batch 1290/1563] elapsed 0.26 s
[Batch 1320/1563] elapsed 0.23 s
[Batch 1350/1563] elapsed 0.22 s
[Batch 1380/1563] elapsed 0.22 s
[Batch 1410/1563] elapsed 0.23 s
[Batch 1440/1563] elapsed 0.22 s
[Batch 1470/1563] elapsed 0.19 s
[Batch 1500/1563] elapsed 0.17 s
[Batch 1530/1563] elapsed 0.14 s
[Batch 1560/1563] elapsed 0.13 s
[Epoch 1] train avg loss 0.0166272, valid acc 0.8852, valid avg loss 0.301081, test acc 0.8560, test avg loss 0.340057, throughput 149.671K wps
Observed Improvement.
[Epoch 2 Batch 30/1412] avg loss 0.00652912, throughput 168.676K wps
[Epoch 2 Batch 60/1412] avg loss 0.00728209, throughput 172.449K wps
[Epoch 2 Batch 90/1412] avg loss 0.0103104, throughput 174.691K wps
[Epoch 2 Batch 120/1412] avg loss 0.00847687, throughput 160.652K wps
[Epoch 2 Batch 150/1412] avg loss 0.0113538, throughput 170.461K wps
[Epoch 2 Batch 180/1412] avg loss 0.00837356, throughput 165.28K wps
[Epoch 2 Batch 210/1412] avg loss 0.00791863, throughput 173.254K wps
[Epoch 2 Batch 240/1412] avg loss 0.00816121, throughput 151.527K wps
[Epoch 2 Batch 270/1412] avg loss 0.00737816, throughput 153.706K wps
[Epoch 2 Batch 300/1412] avg loss 0.010716, throughput 171.591K wps
[Epoch 2 Batch 330/1412] avg loss 0.0114099, throughput 159.642K wps
[Epoch 2 Batch 360/1412] avg loss 0.00711972, throughput 159.317K wps
[Epoch 2 Batch 390/1412] avg loss 0.00753915, throughput 175.116K wps
[Epoch 2 Batch 420/1412] avg loss 0.010602, throughput 162.14K wps
[Epoch 2 Batch 450/1412] avg loss 0.0103454, throughput 174.899K wps
[Epoch 2 Batch 480/1412] avg loss 0.0115595, throughput 141.081K wps
[Epoch 2 Batch 510/1412] avg loss 0.00997689, throughput 168.937K wps
[Epoch 2 Batch 540/1412] avg loss 0.00886734, throughput 172.366K wps
[Epoch 2 Batch 570/1412] avg loss 0.00999336, throughput 175.769K wps
[Epoch 2 Batch 600/1412] avg loss 0.00907662, throughput 164.702K wps
[Epoch 2 Batch 630/1412] avg loss 0.00943337, throughput 177.321K wps
[Epoch 2 Batch 660/1412] avg loss 0.00882155, throughput 151.687K wps
[Epoch 2 Batch 690/1412] avg loss 0.0102559, throughput 157.662K wps
[Epoch 2 Batch 720/1412] avg loss 0.00911931, throughput 161.575K wps
[Epoch 2 Batch 750/1412] avg loss 0.0093397, throughput 150.315K wps
[Epoch 2 Batch 780/1412] avg loss 0.0117298, throughput 139.682K wps
[Epoch 2 Batch 810/1412] avg loss 0.00990772, throughput 154.829K wps
[Epoch 2 Batch 840/1412] avg loss 0.0114896, throughput 157.851K wps
[Epoch 2 Batch 870/1412] avg loss 0.0131169, throughput 164.421K wps
[Epoch 2 Batch 900/1412] avg loss 0.0117314, throughput 155.224K wps
[Epoch 2 Batch 930/1412] avg loss 0.00785715, throughput 157.559K wps
[Epoch 2 Batch 960/1412] avg loss 0.00978283, throughput 153.974K wps
[Epoch 2 Batch 990/1412] avg loss 0.00991021, throughput 142.764K wps
[Epoch 2 Batch 1020/1412] avg loss 0.0104802, throughput 171.805K wps
[Epoch 2 Batch 1050/1412] avg loss 0.00971933, throughput 167.467K wps
[Epoch 2 Batch 1080/1412] avg loss 0.00954951, throughput 165.625K wps
[Epoch 2 Batch 1110/1412] avg loss 0.0132563, throughput 152.675K wps
[Epoch 2 Batch 1140/1412] avg loss 0.00928522, throughput 169.422K wps
[Epoch 2 Batch 1170/1412] avg loss 0.00886832, throughput 146.858K wps
[Epoch 2 Batch 1200/1412] avg loss 0.0138692, throughput 184.118K wps
[Epoch 2 Batch 1230/1412] avg loss 0.00932438, throughput 161.471K wps
[Epoch 2 Batch 1260/1412] avg loss 0.0133991, throughput 163.4K wps
[Epoch 2 Batch 1290/1412] avg loss 0.00901605, throughput 168.981K wps
[Epoch 2 Batch 1320/1412] avg loss 0.00925615, throughput 159.753K wps
[Epoch 2 Batch 1350/1412] avg loss 0.00751305, throughput 141.891K wps
[Epoch 2 Batch 1380/1412] avg loss 0.0137352, throughput 142.459K wps
[Epoch 2 Batch 1410/1412] avg loss 0.0101656, throughput 168.485K wps
Begin Testing...
[Batch 30/157] elapsed 0.49 s
[Batch 60/157] elapsed 0.39 s
[Batch 90/157] elapsed 0.30 s
[Batch 120/157] elapsed 0.22 s
[Batch 150/157] elapsed 0.18 s
Begin Testing...
[Batch 30/1563] elapsed 0.59 s
[Batch 60/1563] elapsed 0.58 s
[Batch 90/1563] elapsed 0.55 s
[Batch 120/1563] elapsed 0.58 s
[Batch 150/1563] elapsed 0.62 s
[Batch 180/1563] elapsed 0.63 s
[Batch 210/1563] elapsed 0.56 s
[Batch 240/1563] elapsed 0.64 s
[Batch 270/1563] elapsed 0.59 s
[Batch 300/1563] elapsed 0.52 s
[Batch 330/1563] elapsed 0.53 s
[Batch 360/1563] elapsed 0.53 s
[Batch 390/1563] elapsed 0.52 s
[Batch 420/1563] elapsed 0.43 s
[Batch 450/1563] elapsed 0.41 s
[Batch 480/1563] elapsed 0.45 s
[Batch 510/1563] elapsed 0.41 s
[Batch 540/1563] elapsed 0.40 s
[Batch 570/1563] elapsed 0.40 s
[Batch 600/1563] elapsed 0.35 s
[Batch 630/1563] elapsed 0.37 s
[Batch 660/1563] elapsed 0.31 s
[Batch 690/1563] elapsed 0.29 s
[Batch 720/1563] elapsed 0.26 s
[Batch 750/1563] elapsed 0.26 s
[Batch 780/1563] elapsed 0.26 s
[Batch 810/1563] elapsed 0.27 s
[Batch 840/1563] elapsed 0.26 s
[Batch 870/1563] elapsed 0.27 s
[Batch 900/1563] elapsed 0.25 s
[Batch 930/1563] elapsed 0.25 s
[Batch 960/1563] elapsed 0.26 s
[Batch 990/1563] elapsed 0.26 s
[Batch 1020/1563] elapsed 0.25 s
[Batch 1050/1563] elapsed 0.24 s
[Batch 1080/1563] elapsed 0.26 s
[Batch 1110/1563] elapsed 0.23 s
[Batch 1140/1563] elapsed 0.26 s
[Batch 1170/1563] elapsed 0.30 s
[Batch 1200/1563] elapsed 0.27 s
[Batch 1230/1563] elapsed 0.27 s
[Batch 1260/1563] elapsed 0.26 s
[Batch 1290/1563] elapsed 0.23 s
[Batch 1320/1563] elapsed 0.21 s
[Batch 1350/1563] elapsed 0.23 s
[Batch 1380/1563] elapsed 0.22 s
[Batch 1410/1563] elapsed 0.20 s
[Batch 1440/1563] elapsed 0.18 s
[Batch 1470/1563] elapsed 0.17 s
[Batch 1500/1563] elapsed 0.15 s
[Batch 1530/1563] elapsed 0.16 s
[Batch 1560/1563] elapsed 0.14 s
[Epoch 2] train avg loss 0.00986556, valid acc 0.8720, valid avg loss 0.373289, test acc 0.8450, test avg loss 0.43176, throughput 161.126K wps
No Improvement.
Begin Testing...
[Batch 30/157] elapsed 0.60 s
[Batch 60/157] elapsed 0.49 s
[Batch 90/157] elapsed 0.34 s
[Batch 120/157] elapsed 0.25 s
[Batch 150/157] elapsed 0.25 s
Begin Testing...
[Batch 30/1563] elapsed 0.70 s
[Batch 60/1563] elapsed 0.66 s
[Batch 90/1563] elapsed 0.64 s
[Batch 120/1563] elapsed 0.63 s
[Batch 150/1563] elapsed 0.62 s
[Batch 180/1563] elapsed 0.65 s
[Batch 210/1563] elapsed 0.64 s
[Batch 240/1563] elapsed 0.56 s
[Batch 270/1563] elapsed 0.56 s
[Batch 300/1563] elapsed 0.50 s
[Batch 330/1563] elapsed 0.52 s
[Batch 360/1563] elapsed 0.54 s
[Batch 390/1563] elapsed 0.51 s
[Batch 420/1563] elapsed 0.44 s
[Batch 450/1563] elapsed 0.47 s
[Batch 480/1563] elapsed 0.41 s
[Batch 510/1563] elapsed 0.37 s
[Batch 540/1563] elapsed 0.42 s
[Batch 570/1563] elapsed 0.40 s
[Batch 600/1563] elapsed 0.38 s
[Batch 630/1563] elapsed 0.40 s
[Batch 660/1563] elapsed 0.38 s
[Batch 690/1563] elapsed 0.35 s
[Batch 720/1563] elapsed 0.30 s
[Batch 750/1563] elapsed 0.31 s
[Batch 780/1563] elapsed 0.30 s
[Batch 810/1563] elapsed 0.27 s
[Batch 840/1563] elapsed 0.29 s
[Batch 870/1563] elapsed 0.26 s
[Batch 900/1563] elapsed 0.28 s
[Batch 930/1563] elapsed 0.28 s
[Batch 960/1563] elapsed 0.24 s
[Batch 990/1563] elapsed 0.22 s
[Batch 1020/1563] elapsed 0.24 s
[Batch 1050/1563] elapsed 0.24 s
[Batch 1080/1563] elapsed 0.24 s
[Batch 1110/1563] elapsed 0.26 s
[Batch 1140/1563] elapsed 0.26 s
[Batch 1170/1563] elapsed 0.23 s
[Batch 1200/1563] elapsed 0.24 s
[Batch 1230/1563] elapsed 0.24 s
[Batch 1260/1563] elapsed 0.22 s
[Batch 1290/1563] elapsed 0.21 s
[Batch 1320/1563] elapsed 0.19 s
[Batch 1350/1563] elapsed 0.19 s
[Batch 1380/1563] elapsed 0.19 s
[Batch 1410/1563] elapsed 0.20 s
[Batch 1440/1563] elapsed 0.19 s
[Batch 1470/1563] elapsed 0.17 s
[Batch 1500/1563] elapsed 0.16 s
[Batch 1530/1563] elapsed 0.15 s
[Batch 1560/1563] elapsed 0.13 s
Best validation loss 0.301081, validation acc 0.8852
Best test loss 0.340057, test acc 0.8560
Total time cost 192.25s