Permalink
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
4795 lines (4794 sloc) 284 KB
Namespace(batch_size=50, data_name='Subj', dropout=0.5, epochs=60, gpu=0, log_interval=30, lr=0.0001, model_mode='static', save_prefix='sa-model')
Use gpu0
3413
120
Done! Tokenizing Time=1.09s, #Sentences=10000
SentimentNet(
(embedding): Embedding(21326 -> 300, float32)
(encoder): ConvolutionalEncoder(
(_convs): HybridConcurrent(
(0): HybridSequential(
(0): Conv1D(300 -> 100, kernel_size=(3,), stride=(1,))
(1): Activation(relu)
(2): HybridLambda(<lambda>)
)
(1): HybridSequential(
(0): Conv1D(300 -> 100, kernel_size=(4,), stride=(1,))
(1): Activation(relu)
(2): HybridLambda(<lambda>)
)
(2): HybridSequential(
(0): Conv1D(300 -> 100, kernel_size=(5,), stride=(1,))
(1): Activation(relu)
(2): HybridLambda(<lambda>)
)
)
)
(output): HybridSequential(
(0): Dropout(p = 0.5, axes=())
(1): Dense(None -> 2, linear)
)
)
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0148733, throughput 5.51171K wps
[Epoch 0 Batch 60/162] avg loss 0.0140187, throughput 13.1587K wps
[Epoch 0 Batch 90/162] avg loss 0.0135814, throughput 13.3718K wps
[Epoch 0 Batch 120/162] avg loss 0.0134636, throughput 13.2887K wps
[Epoch 0 Batch 150/162] avg loss 0.0128888, throughput 13.2788K wps
Begin Testing...
[Epoch 0] train avg loss 0.013672, test acc 0.6933, test avg loss 0.600131, throughput 10.5282K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0125733, throughput 13.5494K wps
[Epoch 1 Batch 60/162] avg loss 0.0120451, throughput 13.3227K wps
[Epoch 1 Batch 90/162] avg loss 0.011544, throughput 13.2973K wps
[Epoch 1 Batch 120/162] avg loss 0.0116538, throughput 13.3526K wps
[Epoch 1 Batch 150/162] avg loss 0.0118374, throughput 13.3344K wps
Begin Testing...
[Epoch 1] train avg loss 0.0118729, test acc 0.7444, test avg loss 0.553187, throughput 13.3686K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0115701, throughput 13.6526K wps
[Epoch 2 Batch 60/162] avg loss 0.0107224, throughput 13.2366K wps
[Epoch 2 Batch 90/162] avg loss 0.0108318, throughput 13.3321K wps
[Epoch 2 Batch 120/162] avg loss 0.0106604, throughput 13.317K wps
[Epoch 2 Batch 150/162] avg loss 0.0103517, throughput 13.2779K wps
Begin Testing...
[Epoch 2] train avg loss 0.0107903, test acc 0.8133, test avg loss 0.502979, throughput 13.3616K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0104061, throughput 13.6768K wps
[Epoch 3 Batch 60/162] avg loss 0.00986498, throughput 13.3074K wps
[Epoch 3 Batch 90/162] avg loss 0.00979749, throughput 13.3173K wps
[Epoch 3 Batch 120/162] avg loss 0.00956013, throughput 13.3374K wps
[Epoch 3 Batch 150/162] avg loss 0.00959471, throughput 13.271K wps
Begin Testing...
[Epoch 3] train avg loss 0.00983826, test acc 0.8422, test avg loss 0.456663, throughput 13.371K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00932655, throughput 13.6255K wps
[Epoch 4 Batch 60/162] avg loss 0.00909974, throughput 13.3037K wps
[Epoch 4 Batch 90/162] avg loss 0.00880083, throughput 13.3359K wps
[Epoch 4 Batch 120/162] avg loss 0.00860752, throughput 13.3183K wps
[Epoch 4 Batch 150/162] avg loss 0.00848703, throughput 13.3269K wps
Begin Testing...
[Epoch 4] train avg loss 0.00882322, test acc 0.8467, test avg loss 0.420219, throughput 13.3754K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00823989, throughput 13.6312K wps
[Epoch 5 Batch 60/162] avg loss 0.0082774, throughput 13.1895K wps
[Epoch 5 Batch 90/162] avg loss 0.00811325, throughput 13.3593K wps
[Epoch 5 Batch 120/162] avg loss 0.00776606, throughput 13.2783K wps
[Epoch 5 Batch 150/162] avg loss 0.00782421, throughput 13.3582K wps
Begin Testing...
[Epoch 5] train avg loss 0.00799923, test acc 0.8811, test avg loss 0.376663, throughput 13.3554K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00744783, throughput 13.6384K wps
[Epoch 6 Batch 60/162] avg loss 0.0074406, throughput 13.2999K wps
[Epoch 6 Batch 90/162] avg loss 0.00732351, throughput 13.3604K wps
[Epoch 6 Batch 120/162] avg loss 0.00704622, throughput 13.3806K wps
[Epoch 6 Batch 150/162] avg loss 0.0071423, throughput 13.3224K wps
Begin Testing...
[Epoch 6] train avg loss 0.00723326, test acc 0.8878, test avg loss 0.341924, throughput 13.394K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00651205, throughput 13.599K wps
[Epoch 7 Batch 60/162] avg loss 0.00678835, throughput 13.0821K wps
[Epoch 7 Batch 90/162] avg loss 0.00690226, throughput 13.2235K wps
[Epoch 7 Batch 120/162] avg loss 0.00620876, throughput 13.2475K wps
[Epoch 7 Batch 150/162] avg loss 0.00666906, throughput 13.3037K wps
Begin Testing...
[Epoch 7] train avg loss 0.00658197, test acc 0.8889, test avg loss 0.318546, throughput 13.2896K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00614777, throughput 13.4501K wps
[Epoch 8 Batch 60/162] avg loss 0.00614122, throughput 13.1986K wps
[Epoch 8 Batch 90/162] avg loss 0.00628948, throughput 13.2612K wps
[Epoch 8 Batch 120/162] avg loss 0.0060885, throughput 13.2727K wps
[Epoch 8 Batch 150/162] avg loss 0.00605033, throughput 13.2824K wps
Begin Testing...
[Epoch 8] train avg loss 0.0061366, test acc 0.8956, test avg loss 0.305506, throughput 13.2894K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00577385, throughput 13.5678K wps
[Epoch 9 Batch 60/162] avg loss 0.00570415, throughput 13.2671K wps
[Epoch 9 Batch 90/162] avg loss 0.00590363, throughput 13.275K wps
[Epoch 9 Batch 120/162] avg loss 0.00584293, throughput 13.3132K wps
[Epoch 9 Batch 150/162] avg loss 0.00568439, throughput 13.2065K wps
Begin Testing...
[Epoch 9] train avg loss 0.00579051, test acc 0.8967, test avg loss 0.285447, throughput 13.3099K wps
Observed Improvement.
Begin Testing...
[Epoch 10 Batch 30/162] avg loss 0.00536096, throughput 13.5426K wps
[Epoch 10 Batch 60/162] avg loss 0.005637, throughput 13.0881K wps
[Epoch 10 Batch 90/162] avg loss 0.00566644, throughput 13.313K wps
[Epoch 10 Batch 120/162] avg loss 0.00564111, throughput 13.3091K wps
[Epoch 10 Batch 150/162] avg loss 0.00547661, throughput 13.1845K wps
Begin Testing...
[Epoch 10] train avg loss 0.00552717, test acc 0.8967, test avg loss 0.274789, throughput 13.2826K wps
Observed Improvement.
Begin Testing...
[Epoch 11 Batch 30/162] avg loss 0.00512118, throughput 13.5686K wps
[Epoch 11 Batch 60/162] avg loss 0.00521603, throughput 13.2957K wps
[Epoch 11 Batch 90/162] avg loss 0.00513222, throughput 13.2175K wps
[Epoch 11 Batch 120/162] avg loss 0.00552246, throughput 13.2252K wps
[Epoch 11 Batch 150/162] avg loss 0.00524076, throughput 13.2062K wps
Begin Testing...
[Epoch 11] train avg loss 0.00519457, test acc 0.9000, test avg loss 0.269614, throughput 13.2901K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00516317, throughput 13.4057K wps
[Epoch 12 Batch 60/162] avg loss 0.00485779, throughput 13.0995K wps
[Epoch 12 Batch 90/162] avg loss 0.00490939, throughput 13.1938K wps
[Epoch 12 Batch 120/162] avg loss 0.00514623, throughput 13.0752K wps
[Epoch 12 Batch 150/162] avg loss 0.00504575, throughput 13.1788K wps
Begin Testing...
[Epoch 12] train avg loss 0.00504136, test acc 0.9022, test avg loss 0.259176, throughput 13.1914K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.00462842, throughput 13.4647K wps
[Epoch 13 Batch 60/162] avg loss 0.00493083, throughput 13.1709K wps
[Epoch 13 Batch 90/162] avg loss 0.0045591, throughput 13.3237K wps
[Epoch 13 Batch 120/162] avg loss 0.00476632, throughput 13.2249K wps
[Epoch 13 Batch 150/162] avg loss 0.00519548, throughput 13.1971K wps
Begin Testing...
[Epoch 13] train avg loss 0.00484922, test acc 0.9022, test avg loss 0.251473, throughput 13.2639K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.00450569, throughput 13.5856K wps
[Epoch 14 Batch 60/162] avg loss 0.00484012, throughput 13.2K wps
[Epoch 14 Batch 90/162] avg loss 0.00474914, throughput 13.2082K wps
[Epoch 14 Batch 120/162] avg loss 0.00458483, throughput 13.1978K wps
[Epoch 14 Batch 150/162] avg loss 0.00424398, throughput 13.2459K wps
Begin Testing...
[Epoch 14] train avg loss 0.00461127, test acc 0.9056, test avg loss 0.246114, throughput 13.273K wps
Observed Improvement.
Begin Testing...
[Epoch 15 Batch 30/162] avg loss 0.00447352, throughput 13.5897K wps
[Epoch 15 Batch 60/162] avg loss 0.00445302, throughput 13.1872K wps
[Epoch 15 Batch 90/162] avg loss 0.00449065, throughput 13.2899K wps
[Epoch 15 Batch 120/162] avg loss 0.00444741, throughput 13.1898K wps
[Epoch 15 Batch 150/162] avg loss 0.00452134, throughput 13.1339K wps
Begin Testing...
[Epoch 15] train avg loss 0.00447327, test acc 0.9089, test avg loss 0.240588, throughput 13.2609K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00413716, throughput 13.4402K wps
[Epoch 16 Batch 60/162] avg loss 0.00463497, throughput 13.1543K wps
[Epoch 16 Batch 90/162] avg loss 0.00427522, throughput 13.0865K wps
[Epoch 16 Batch 120/162] avg loss 0.00412066, throughput 13.0899K wps
[Epoch 16 Batch 150/162] avg loss 0.00429683, throughput 13.1352K wps
Begin Testing...
[Epoch 16] train avg loss 0.00431762, test acc 0.9089, test avg loss 0.237355, throughput 13.1695K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00422938, throughput 13.4677K wps
[Epoch 17 Batch 60/162] avg loss 0.00379366, throughput 13.1093K wps
[Epoch 17 Batch 90/162] avg loss 0.00427682, throughput 13.147K wps
[Epoch 17 Batch 120/162] avg loss 0.00438692, throughput 13.076K wps
[Epoch 17 Batch 150/162] avg loss 0.00391251, throughput 13.1531K wps
Begin Testing...
[Epoch 17] train avg loss 0.00412787, test acc 0.9100, test avg loss 0.234954, throughput 13.1853K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00402005, throughput 13.3341K wps
[Epoch 18 Batch 60/162] avg loss 0.00391572, throughput 13.1176K wps
[Epoch 18 Batch 90/162] avg loss 0.00375563, throughput 13.1178K wps
[Epoch 18 Batch 120/162] avg loss 0.00408373, throughput 13.1016K wps
[Epoch 18 Batch 150/162] avg loss 0.00390885, throughput 13.1443K wps
Begin Testing...
[Epoch 18] train avg loss 0.00394367, test acc 0.9089, test avg loss 0.230655, throughput 13.154K wps
[Epoch 19 Batch 30/162] avg loss 0.00365169, throughput 13.5001K wps
[Epoch 19 Batch 60/162] avg loss 0.00374245, throughput 13.0873K wps
[Epoch 19 Batch 90/162] avg loss 0.00409526, throughput 13.1598K wps
[Epoch 19 Batch 120/162] avg loss 0.00411976, throughput 13.1412K wps
[Epoch 19 Batch 150/162] avg loss 0.00387941, throughput 13.0822K wps
Begin Testing...
[Epoch 19] train avg loss 0.00386757, test acc 0.9111, test avg loss 0.227747, throughput 13.1973K wps
Observed Improvement.
Begin Testing...
[Epoch 20 Batch 30/162] avg loss 0.00352057, throughput 13.448K wps
[Epoch 20 Batch 60/162] avg loss 0.00386471, throughput 13.0387K wps
[Epoch 20 Batch 90/162] avg loss 0.00378309, throughput 13.1161K wps
[Epoch 20 Batch 120/162] avg loss 0.00355035, throughput 13.051K wps
[Epoch 20 Batch 150/162] avg loss 0.00375013, throughput 12.9947K wps
Begin Testing...
[Epoch 20] train avg loss 0.00370937, test acc 0.9133, test avg loss 0.22368, throughput 13.1153K wps
Observed Improvement.
Begin Testing...
[Epoch 21 Batch 30/162] avg loss 0.00350253, throughput 13.4534K wps
[Epoch 21 Batch 60/162] avg loss 0.00375662, throughput 12.9801K wps
[Epoch 21 Batch 90/162] avg loss 0.00347035, throughput 13.1362K wps
[Epoch 21 Batch 120/162] avg loss 0.00347907, throughput 13.1279K wps
[Epoch 21 Batch 150/162] avg loss 0.00358526, throughput 13.1011K wps
Begin Testing...
[Epoch 21] train avg loss 0.00355795, test acc 0.9144, test avg loss 0.220432, throughput 13.1585K wps
Observed Improvement.
Begin Testing...
[Epoch 22 Batch 30/162] avg loss 0.00350364, throughput 13.568K wps
[Epoch 22 Batch 60/162] avg loss 0.00368956, throughput 13.1628K wps
[Epoch 22 Batch 90/162] avg loss 0.00358279, throughput 13.138K wps
[Epoch 22 Batch 120/162] avg loss 0.00345167, throughput 13.1374K wps
[Epoch 22 Batch 150/162] avg loss 0.00333584, throughput 13.1669K wps
Begin Testing...
[Epoch 22] train avg loss 0.00349016, test acc 0.9133, test avg loss 0.219737, throughput 13.226K wps
[Epoch 23 Batch 30/162] avg loss 0.00341313, throughput 13.4863K wps
[Epoch 23 Batch 60/162] avg loss 0.0036231, throughput 13.1523K wps
[Epoch 23 Batch 90/162] avg loss 0.0033311, throughput 13.1816K wps
[Epoch 23 Batch 120/162] avg loss 0.00308959, throughput 13.1814K wps
[Epoch 23 Batch 150/162] avg loss 0.00346653, throughput 13.1896K wps
Begin Testing...
[Epoch 23] train avg loss 0.00337469, test acc 0.9178, test avg loss 0.216994, throughput 13.2296K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00298983, throughput 13.4736K wps
[Epoch 24 Batch 60/162] avg loss 0.00306925, throughput 13.157K wps
[Epoch 24 Batch 90/162] avg loss 0.00326855, throughput 13.1293K wps
[Epoch 24 Batch 120/162] avg loss 0.00306909, throughput 13.0949K wps
[Epoch 24 Batch 150/162] avg loss 0.00342294, throughput 13.1152K wps
Begin Testing...
[Epoch 24] train avg loss 0.00320864, test acc 0.9156, test avg loss 0.213515, throughput 13.1869K wps
[Epoch 25 Batch 30/162] avg loss 0.00296156, throughput 13.5017K wps
[Epoch 25 Batch 60/162] avg loss 0.00326144, throughput 13.1125K wps
[Epoch 25 Batch 90/162] avg loss 0.00299427, throughput 13.1706K wps
[Epoch 25 Batch 120/162] avg loss 0.00323567, throughput 13.0883K wps
[Epoch 25 Batch 150/162] avg loss 0.00349913, throughput 13.1266K wps
Begin Testing...
[Epoch 25] train avg loss 0.00318204, test acc 0.9133, test avg loss 0.216489, throughput 13.1873K wps
[Epoch 26 Batch 30/162] avg loss 0.00303202, throughput 13.4093K wps
[Epoch 26 Batch 60/162] avg loss 0.00332668, throughput 13.1154K wps
[Epoch 26 Batch 90/162] avg loss 0.00276401, throughput 13.1077K wps
[Epoch 26 Batch 120/162] avg loss 0.00316659, throughput 13.1068K wps
[Epoch 26 Batch 150/162] avg loss 0.00299795, throughput 13.0741K wps
Begin Testing...
[Epoch 26] train avg loss 0.00305356, test acc 0.9200, test avg loss 0.210835, throughput 13.1535K wps
Observed Improvement.
Begin Testing...
[Epoch 27 Batch 30/162] avg loss 0.00284135, throughput 13.3916K wps
[Epoch 27 Batch 60/162] avg loss 0.00289317, throughput 13.0645K wps
[Epoch 27 Batch 90/162] avg loss 0.00312041, throughput 13.0864K wps
[Epoch 27 Batch 120/162] avg loss 0.00283447, throughput 13.0812K wps
[Epoch 27 Batch 150/162] avg loss 0.00300325, throughput 13.1395K wps
Begin Testing...
[Epoch 27] train avg loss 0.00295153, test acc 0.9167, test avg loss 0.210709, throughput 13.1508K wps
[Epoch 28 Batch 30/162] avg loss 0.0027969, throughput 13.254K wps
[Epoch 28 Batch 60/162] avg loss 0.00304851, throughput 13.0727K wps
[Epoch 28 Batch 90/162] avg loss 0.00288398, throughput 13.0839K wps
[Epoch 28 Batch 120/162] avg loss 0.00283085, throughput 13.0557K wps
[Epoch 28 Batch 150/162] avg loss 0.00275658, throughput 12.9974K wps
Begin Testing...
[Epoch 28] train avg loss 0.00285652, test acc 0.9167, test avg loss 0.205983, throughput 13.0845K wps
[Epoch 29 Batch 30/162] avg loss 0.00283865, throughput 13.2546K wps
[Epoch 29 Batch 60/162] avg loss 0.00279888, throughput 13.012K wps
[Epoch 29 Batch 90/162] avg loss 0.00255884, throughput 13.0538K wps
[Epoch 29 Batch 120/162] avg loss 0.00278198, throughput 13.0961K wps
[Epoch 29 Batch 150/162] avg loss 0.00267321, throughput 13.0567K wps
Begin Testing...
[Epoch 29] train avg loss 0.00275968, test acc 0.9244, test avg loss 0.203459, throughput 13.0954K wps
Observed Improvement.
Begin Testing...
[Epoch 30 Batch 30/162] avg loss 0.00267031, throughput 13.4334K wps
[Epoch 30 Batch 60/162] avg loss 0.00242314, throughput 13.0816K wps
[Epoch 30 Batch 90/162] avg loss 0.00274833, throughput 13.1619K wps
[Epoch 30 Batch 120/162] avg loss 0.00256968, throughput 13.1617K wps
[Epoch 30 Batch 150/162] avg loss 0.00271866, throughput 13.1508K wps
Begin Testing...
[Epoch 30] train avg loss 0.0026766, test acc 0.9244, test avg loss 0.200777, throughput 13.1936K wps
Observed Improvement.
Begin Testing...
[Epoch 31 Batch 30/162] avg loss 0.00280746, throughput 13.4676K wps
[Epoch 31 Batch 60/162] avg loss 0.00250381, throughput 13.0944K wps
[Epoch 31 Batch 90/162] avg loss 0.00244303, throughput 13.1283K wps
[Epoch 31 Batch 120/162] avg loss 0.00261631, throughput 13.1443K wps
[Epoch 31 Batch 150/162] avg loss 0.00258874, throughput 13.1051K wps
Begin Testing...
[Epoch 31] train avg loss 0.00258264, test acc 0.9222, test avg loss 0.201894, throughput 13.1805K wps
[Epoch 32 Batch 30/162] avg loss 0.00239544, throughput 13.4251K wps
[Epoch 32 Batch 60/162] avg loss 0.00242165, throughput 13.0724K wps
[Epoch 32 Batch 90/162] avg loss 0.00253808, throughput 13.172K wps
[Epoch 32 Batch 120/162] avg loss 0.00235758, throughput 13.0454K wps
[Epoch 32 Batch 150/162] avg loss 0.00251205, throughput 13.0667K wps
Begin Testing...
[Epoch 32] train avg loss 0.00247463, test acc 0.9211, test avg loss 0.200797, throughput 13.1521K wps
[Epoch 33 Batch 30/162] avg loss 0.00217709, throughput 13.4281K wps
[Epoch 33 Batch 60/162] avg loss 0.00246339, throughput 13.056K wps
[Epoch 33 Batch 90/162] avg loss 0.00240603, throughput 13.0837K wps
[Epoch 33 Batch 120/162] avg loss 0.00245679, throughput 13.1473K wps
[Epoch 33 Batch 150/162] avg loss 0.00254246, throughput 13.0463K wps
Begin Testing...
[Epoch 33] train avg loss 0.00243782, test acc 0.9211, test avg loss 0.200857, throughput 13.1434K wps
[Epoch 34 Batch 30/162] avg loss 0.00250386, throughput 13.3377K wps
[Epoch 34 Batch 60/162] avg loss 0.00223558, throughput 12.9046K wps
[Epoch 34 Batch 90/162] avg loss 0.00229405, throughput 13.0251K wps
[Epoch 34 Batch 120/162] avg loss 0.00259608, throughput 13.0523K wps
[Epoch 34 Batch 150/162] avg loss 0.0021924, throughput 13.0902K wps
Begin Testing...
[Epoch 34] train avg loss 0.0023721, test acc 0.9222, test avg loss 0.197416, throughput 13.0824K wps
[Epoch 35 Batch 30/162] avg loss 0.002352, throughput 13.3477K wps
[Epoch 35 Batch 60/162] avg loss 0.00214414, throughput 12.9674K wps
[Epoch 35 Batch 90/162] avg loss 0.00230061, throughput 13.0016K wps
[Epoch 35 Batch 120/162] avg loss 0.00248154, throughput 13.0695K wps
[Epoch 35 Batch 150/162] avg loss 0.00223823, throughput 13.0676K wps
Begin Testing...
[Epoch 35] train avg loss 0.00229282, test acc 0.9244, test avg loss 0.198818, throughput 13.0814K wps
Observed Improvement.
Begin Testing...
[Epoch 36 Batch 30/162] avg loss 0.00207446, throughput 13.2336K wps
[Epoch 36 Batch 60/162] avg loss 0.00230276, throughput 13.0416K wps
[Epoch 36 Batch 90/162] avg loss 0.00224566, throughput 13.0679K wps
[Epoch 36 Batch 120/162] avg loss 0.00224707, throughput 13.118K wps
[Epoch 36 Batch 150/162] avg loss 0.00228941, throughput 13.0763K wps
Begin Testing...
[Epoch 36] train avg loss 0.00219725, test acc 0.9200, test avg loss 0.19734, throughput 13.1006K wps
[Epoch 37 Batch 30/162] avg loss 0.00207439, throughput 13.3369K wps
[Epoch 37 Batch 60/162] avg loss 0.00195904, throughput 13.0776K wps
[Epoch 37 Batch 90/162] avg loss 0.00217484, throughput 13.0217K wps
[Epoch 37 Batch 120/162] avg loss 0.00200286, throughput 13.0281K wps
[Epoch 37 Batch 150/162] avg loss 0.00232435, throughput 12.9862K wps
Begin Testing...
[Epoch 37] train avg loss 0.00209625, test acc 0.9244, test avg loss 0.198049, throughput 13.0834K wps
Observed Improvement.
Begin Testing...
[Epoch 38 Batch 30/162] avg loss 0.00217362, throughput 13.2963K wps
[Epoch 38 Batch 60/162] avg loss 0.00201886, throughput 12.9511K wps
[Epoch 38 Batch 90/162] avg loss 0.00231133, throughput 13.0329K wps
[Epoch 38 Batch 120/162] avg loss 0.0020087, throughput 13.0964K wps
[Epoch 38 Batch 150/162] avg loss 0.00182542, throughput 13.0356K wps
Begin Testing...
[Epoch 38] train avg loss 0.0021018, test acc 0.9211, test avg loss 0.197529, throughput 13.081K wps
[Epoch 39 Batch 30/162] avg loss 0.00175132, throughput 13.3177K wps
[Epoch 39 Batch 60/162] avg loss 0.00206885, throughput 12.9525K wps
[Epoch 39 Batch 90/162] avg loss 0.00192996, throughput 13.0705K wps
[Epoch 39 Batch 120/162] avg loss 0.00208113, throughput 13.0273K wps
[Epoch 39 Batch 150/162] avg loss 0.00203645, throughput 12.9488K wps
Begin Testing...
[Epoch 39] train avg loss 0.00198324, test acc 0.9256, test avg loss 0.195105, throughput 13.0635K wps
Observed Improvement.
Begin Testing...
[Epoch 40 Batch 30/162] avg loss 0.00203055, throughput 13.4027K wps
[Epoch 40 Batch 60/162] avg loss 0.00178149, throughput 13.0521K wps
[Epoch 40 Batch 90/162] avg loss 0.00190662, throughput 13.0951K wps
[Epoch 40 Batch 120/162] avg loss 0.00233152, throughput 13.0534K wps
[Epoch 40 Batch 150/162] avg loss 0.00194632, throughput 13.0341K wps
Begin Testing...
[Epoch 40] train avg loss 0.00197573, test acc 0.9244, test avg loss 0.196926, throughput 13.1144K wps
[Epoch 41 Batch 30/162] avg loss 0.00174941, throughput 13.2329K wps
[Epoch 41 Batch 60/162] avg loss 0.00181567, throughput 12.9062K wps
[Epoch 41 Batch 90/162] avg loss 0.00210591, throughput 12.8869K wps
[Epoch 41 Batch 120/162] avg loss 0.00188155, throughput 13.0972K wps
[Epoch 41 Batch 150/162] avg loss 0.00192983, throughput 12.9168K wps
Begin Testing...
[Epoch 41] train avg loss 0.00189778, test acc 0.9244, test avg loss 0.194945, throughput 13.0139K wps
[Epoch 42 Batch 30/162] avg loss 0.00188569, throughput 13.3327K wps
[Epoch 42 Batch 60/162] avg loss 0.00195483, throughput 12.9241K wps
[Epoch 42 Batch 90/162] avg loss 0.0017058, throughput 13.0298K wps
[Epoch 42 Batch 120/162] avg loss 0.0017511, throughput 13.0302K wps
[Epoch 42 Batch 150/162] avg loss 0.00192395, throughput 12.9769K wps
Begin Testing...
[Epoch 42] train avg loss 0.00182934, test acc 0.9244, test avg loss 0.194051, throughput 13.0354K wps
[Epoch 43 Batch 30/162] avg loss 0.00159785, throughput 13.2476K wps
[Epoch 43 Batch 60/162] avg loss 0.0018506, throughput 12.9153K wps
[Epoch 43 Batch 90/162] avg loss 0.00182889, throughput 13.0445K wps
[Epoch 43 Batch 120/162] avg loss 0.0016574, throughput 12.9194K wps
[Epoch 43 Batch 150/162] avg loss 0.0018352, throughput 12.9193K wps
Begin Testing...
[Epoch 43] train avg loss 0.00177938, test acc 0.9256, test avg loss 0.194374, throughput 13.0004K wps
Observed Improvement.
Begin Testing...
[Epoch 44 Batch 30/162] avg loss 0.00183226, throughput 13.3643K wps
[Epoch 44 Batch 60/162] avg loss 0.00134163, throughput 12.9326K wps
[Epoch 44 Batch 90/162] avg loss 0.00189961, throughput 12.9921K wps
[Epoch 44 Batch 120/162] avg loss 0.00174602, throughput 13.0005K wps
[Epoch 44 Batch 150/162] avg loss 0.00160431, throughput 12.9789K wps
Begin Testing...
[Epoch 44] train avg loss 0.00166612, test acc 0.9211, test avg loss 0.193132, throughput 13.0485K wps
[Epoch 45 Batch 30/162] avg loss 0.00167047, throughput 13.3029K wps
[Epoch 45 Batch 60/162] avg loss 0.00154697, throughput 12.9375K wps
[Epoch 45 Batch 90/162] avg loss 0.00172115, throughput 12.9317K wps
[Epoch 45 Batch 120/162] avg loss 0.00165001, throughput 12.9251K wps
[Epoch 45 Batch 150/162] avg loss 0.00167615, throughput 12.9742K wps
Begin Testing...
[Epoch 45] train avg loss 0.0016655, test acc 0.9256, test avg loss 0.194076, throughput 13.0074K wps
Observed Improvement.
Begin Testing...
[Epoch 46 Batch 30/162] avg loss 0.00182863, throughput 13.1811K wps
[Epoch 46 Batch 60/162] avg loss 0.00142345, throughput 12.8412K wps
[Epoch 46 Batch 90/162] avg loss 0.0015641, throughput 12.9738K wps
[Epoch 46 Batch 120/162] avg loss 0.0014819, throughput 12.9354K wps
[Epoch 46 Batch 150/162] avg loss 0.00162996, throughput 12.9436K wps
Begin Testing...
[Epoch 46] train avg loss 0.00158193, test acc 0.9278, test avg loss 0.193132, throughput 12.9742K wps
Observed Improvement.
Begin Testing...
[Epoch 47 Batch 30/162] avg loss 0.00180526, throughput 13.2657K wps
[Epoch 47 Batch 60/162] avg loss 0.00157225, throughput 12.9214K wps
[Epoch 47 Batch 90/162] avg loss 0.00141955, throughput 13.0019K wps
[Epoch 47 Batch 120/162] avg loss 0.00156289, throughput 13.0028K wps
[Epoch 47 Batch 150/162] avg loss 0.00161085, throughput 12.9608K wps
Begin Testing...
[Epoch 47] train avg loss 0.00160321, test acc 0.9244, test avg loss 0.19183, throughput 13.021K wps
[Epoch 48 Batch 30/162] avg loss 0.00153697, throughput 13.3229K wps
[Epoch 48 Batch 60/162] avg loss 0.00160671, throughput 13.0026K wps
[Epoch 48 Batch 90/162] avg loss 0.00136559, throughput 13.0109K wps
[Epoch 48 Batch 120/162] avg loss 0.0015183, throughput 13.0323K wps
[Epoch 48 Batch 150/162] avg loss 0.00156814, throughput 12.9658K wps
Begin Testing...
[Epoch 48] train avg loss 0.00150495, test acc 0.9289, test avg loss 0.1913, throughput 13.0559K wps
Observed Improvement.
Begin Testing...
[Epoch 49 Batch 30/162] avg loss 0.001577, throughput 13.3786K wps
[Epoch 49 Batch 60/162] avg loss 0.00149335, throughput 12.9874K wps
[Epoch 49 Batch 90/162] avg loss 0.00150394, throughput 12.9087K wps
[Epoch 49 Batch 120/162] avg loss 0.00126085, throughput 12.9062K wps
[Epoch 49 Batch 150/162] avg loss 0.00152473, throughput 12.9598K wps
Begin Testing...
[Epoch 49] train avg loss 0.00146032, test acc 0.9256, test avg loss 0.194352, throughput 13.0184K wps
[Epoch 50 Batch 30/162] avg loss 0.00128775, throughput 13.3341K wps
[Epoch 50 Batch 60/162] avg loss 0.00137412, throughput 13.0036K wps
[Epoch 50 Batch 90/162] avg loss 0.00138601, throughput 12.9493K wps
[Epoch 50 Batch 120/162] avg loss 0.00130751, throughput 12.9511K wps
[Epoch 50 Batch 150/162] avg loss 0.00147874, throughput 12.9563K wps
Begin Testing...
[Epoch 50] train avg loss 0.0013718, test acc 0.9267, test avg loss 0.192832, throughput 13.0221K wps
[Epoch 51 Batch 30/162] avg loss 0.00136882, throughput 13.3302K wps
[Epoch 51 Batch 60/162] avg loss 0.00139357, throughput 12.89K wps
[Epoch 51 Batch 90/162] avg loss 0.00128853, throughput 12.8881K wps
[Epoch 51 Batch 120/162] avg loss 0.00134575, throughput 12.9379K wps
[Epoch 51 Batch 150/162] avg loss 0.00128021, throughput 12.913K wps
Begin Testing...
[Epoch 51] train avg loss 0.00135632, test acc 0.9278, test avg loss 0.192661, throughput 12.9861K wps
[Epoch 52 Batch 30/162] avg loss 0.00127746, throughput 13.2955K wps
[Epoch 52 Batch 60/162] avg loss 0.00122428, throughput 12.9986K wps
[Epoch 52 Batch 90/162] avg loss 0.00128933, throughput 12.9907K wps
[Epoch 52 Batch 120/162] avg loss 0.00165778, throughput 13.0021K wps
[Epoch 52 Batch 150/162] avg loss 0.00131463, throughput 12.8976K wps
Begin Testing...
[Epoch 52] train avg loss 0.00134131, test acc 0.9278, test avg loss 0.190324, throughput 13.0212K wps
[Epoch 53 Batch 30/162] avg loss 0.00105276, throughput 13.295K wps
[Epoch 53 Batch 60/162] avg loss 0.00124275, throughput 12.8201K wps
[Epoch 53 Batch 90/162] avg loss 0.00129056, throughput 12.9242K wps
[Epoch 53 Batch 120/162] avg loss 0.00139937, throughput 12.9556K wps
[Epoch 53 Batch 150/162] avg loss 0.00124667, throughput 12.9138K wps
Begin Testing...
[Epoch 53] train avg loss 0.00126753, test acc 0.9289, test avg loss 0.189767, throughput 12.9839K wps
Observed Improvement.
Begin Testing...
[Epoch 54 Batch 30/162] avg loss 0.00110248, throughput 13.2544K wps
[Epoch 54 Batch 60/162] avg loss 0.00122235, throughput 12.8182K wps
[Epoch 54 Batch 90/162] avg loss 0.001233, throughput 12.8728K wps
[Epoch 54 Batch 120/162] avg loss 0.00135027, throughput 12.8584K wps
[Epoch 54 Batch 150/162] avg loss 0.00142639, throughput 12.9305K wps
Begin Testing...
[Epoch 54] train avg loss 0.00126234, test acc 0.9267, test avg loss 0.188275, throughput 12.9402K wps
[Epoch 55 Batch 30/162] avg loss 0.00128115, throughput 13.1496K wps
[Epoch 55 Batch 60/162] avg loss 0.00120673, throughput 12.9544K wps
[Epoch 55 Batch 90/162] avg loss 0.00145009, throughput 12.9649K wps
[Epoch 55 Batch 120/162] avg loss 0.00115725, throughput 12.9934K wps
[Epoch 55 Batch 150/162] avg loss 0.00111452, throughput 12.994K wps
Begin Testing...
[Epoch 55] train avg loss 0.0012198, test acc 0.9278, test avg loss 0.192551, throughput 13.0079K wps
[Epoch 56 Batch 30/162] avg loss 0.00117197, throughput 13.4167K wps
[Epoch 56 Batch 60/162] avg loss 0.00115605, throughput 12.868K wps
[Epoch 56 Batch 90/162] avg loss 0.00119042, throughput 13.0401K wps
[Epoch 56 Batch 120/162] avg loss 0.00109707, throughput 12.9553K wps
[Epoch 56 Batch 150/162] avg loss 0.00105548, throughput 12.9878K wps
Begin Testing...
[Epoch 56] train avg loss 0.00112486, test acc 0.9300, test avg loss 0.18883, throughput 13.0436K wps
Observed Improvement.
Begin Testing...
[Epoch 57 Batch 30/162] avg loss 0.00121302, throughput 13.3745K wps
[Epoch 57 Batch 60/162] avg loss 0.000928482, throughput 12.8926K wps
[Epoch 57 Batch 90/162] avg loss 0.00103487, throughput 13.0056K wps
[Epoch 57 Batch 120/162] avg loss 0.00130843, throughput 12.954K wps
[Epoch 57 Batch 150/162] avg loss 0.00107645, throughput 12.8858K wps
Begin Testing...
[Epoch 57] train avg loss 0.00110351, test acc 0.9267, test avg loss 0.189444, throughput 13.0173K wps
[Epoch 58 Batch 30/162] avg loss 0.000962082, throughput 13.3109K wps
[Epoch 58 Batch 60/162] avg loss 0.0010485, throughput 12.8136K wps
[Epoch 58 Batch 90/162] avg loss 0.00101269, throughput 12.9898K wps
[Epoch 58 Batch 120/162] avg loss 0.00119668, throughput 12.9619K wps
[Epoch 58 Batch 150/162] avg loss 0.00114594, throughput 12.9323K wps
Begin Testing...
[Epoch 58] train avg loss 0.00107507, test acc 0.9289, test avg loss 0.191987, throughput 12.996K wps
[Epoch 59 Batch 30/162] avg loss 0.000912776, throughput 13.1843K wps
[Epoch 59 Batch 60/162] avg loss 0.000923365, throughput 12.8419K wps
[Epoch 59 Batch 90/162] avg loss 0.00107308, throughput 12.8416K wps
[Epoch 59 Batch 120/162] avg loss 0.00106039, throughput 12.9625K wps
[Epoch 59 Batch 150/162] avg loss 0.00108295, throughput 12.9964K wps
Begin Testing...
[Epoch 59] train avg loss 0.00103534, test acc 0.9278, test avg loss 0.191567, throughput 12.9603K wps
Test loss 0.214006, test acc 0.9130
Total time cost 166.70s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0155103, throughput 11.628K wps
[Epoch 0 Batch 60/162] avg loss 0.0143324, throughput 12.7925K wps
[Epoch 0 Batch 90/162] avg loss 0.0138853, throughput 12.9237K wps
[Epoch 0 Batch 120/162] avg loss 0.0135737, throughput 12.9166K wps
[Epoch 0 Batch 150/162] avg loss 0.0132296, throughput 12.9319K wps
Begin Testing...
[Epoch 0] train avg loss 0.0140203, test acc 0.6822, test avg loss 0.599257, throughput 12.6372K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0126301, throughput 13.3852K wps
[Epoch 1 Batch 60/162] avg loss 0.0124942, throughput 12.8168K wps
[Epoch 1 Batch 90/162] avg loss 0.0121147, throughput 12.901K wps
[Epoch 1 Batch 120/162] avg loss 0.0116805, throughput 12.9429K wps
[Epoch 1 Batch 150/162] avg loss 0.0116224, throughput 12.983K wps
Begin Testing...
[Epoch 1] train avg loss 0.0120682, test acc 0.7433, test avg loss 0.557227, throughput 13.0011K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0112527, throughput 13.235K wps
[Epoch 2 Batch 60/162] avg loss 0.011005, throughput 12.7974K wps
[Epoch 2 Batch 90/162] avg loss 0.0108896, throughput 12.9409K wps
[Epoch 2 Batch 120/162] avg loss 0.0106114, throughput 12.8414K wps
[Epoch 2 Batch 150/162] avg loss 0.0107636, throughput 12.8762K wps
Begin Testing...
[Epoch 2] train avg loss 0.0108951, test acc 0.8044, test avg loss 0.508071, throughput 12.9271K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0101826, throughput 13.2703K wps
[Epoch 3 Batch 60/162] avg loss 0.0100996, throughput 12.9003K wps
[Epoch 3 Batch 90/162] avg loss 0.00984951, throughput 12.953K wps
[Epoch 3 Batch 120/162] avg loss 0.0098496, throughput 12.9724K wps
[Epoch 3 Batch 150/162] avg loss 0.00972931, throughput 13.0084K wps
Begin Testing...
[Epoch 3] train avg loss 0.0098897, test acc 0.8200, test avg loss 0.465245, throughput 13.0134K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00943229, throughput 13.3368K wps
[Epoch 4 Batch 60/162] avg loss 0.00894265, throughput 12.8904K wps
[Epoch 4 Batch 90/162] avg loss 0.00903182, throughput 12.9642K wps
[Epoch 4 Batch 120/162] avg loss 0.00880629, throughput 12.9244K wps
[Epoch 4 Batch 150/162] avg loss 0.00880136, throughput 12.9515K wps
Begin Testing...
[Epoch 4] train avg loss 0.00898577, test acc 0.8733, test avg loss 0.413622, throughput 13.0082K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00827037, throughput 13.2979K wps
[Epoch 5 Batch 60/162] avg loss 0.00825306, throughput 12.9028K wps
[Epoch 5 Batch 90/162] avg loss 0.00788509, throughput 12.9888K wps
[Epoch 5 Batch 120/162] avg loss 0.00807377, throughput 12.9734K wps
[Epoch 5 Batch 150/162] avg loss 0.00784665, throughput 12.9575K wps
Begin Testing...
[Epoch 5] train avg loss 0.00803662, test acc 0.8889, test avg loss 0.373767, throughput 13.0182K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00774638, throughput 13.3451K wps
[Epoch 6 Batch 60/162] avg loss 0.0073327, throughput 12.8694K wps
[Epoch 6 Batch 90/162] avg loss 0.00732456, throughput 12.8855K wps
[Epoch 6 Batch 120/162] avg loss 0.00706886, throughput 12.9347K wps
[Epoch 6 Batch 150/162] avg loss 0.00704164, throughput 12.9391K wps
Begin Testing...
[Epoch 6] train avg loss 0.00730228, test acc 0.8978, test avg loss 0.341632, throughput 12.9877K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00708363, throughput 13.2291K wps
[Epoch 7 Batch 60/162] avg loss 0.0065777, throughput 12.8199K wps
[Epoch 7 Batch 90/162] avg loss 0.006976, throughput 12.8486K wps
[Epoch 7 Batch 120/162] avg loss 0.00645988, throughput 12.8381K wps
[Epoch 7 Batch 150/162] avg loss 0.00634877, throughput 12.9334K wps
Begin Testing...
[Epoch 7] train avg loss 0.00668994, test acc 0.9011, test avg loss 0.317666, throughput 12.9371K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00647673, throughput 13.2088K wps
[Epoch 8 Batch 60/162] avg loss 0.00616782, throughput 12.8792K wps
[Epoch 8 Batch 90/162] avg loss 0.00625375, throughput 12.886K wps
[Epoch 8 Batch 120/162] avg loss 0.00620168, throughput 12.9549K wps
[Epoch 8 Batch 150/162] avg loss 0.00631827, throughput 12.9536K wps
Begin Testing...
[Epoch 8] train avg loss 0.00624568, test acc 0.9078, test avg loss 0.301862, throughput 12.9715K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.0061124, throughput 13.2877K wps
[Epoch 9 Batch 60/162] avg loss 0.00585537, throughput 12.8463K wps
[Epoch 9 Batch 90/162] avg loss 0.00587118, throughput 12.8925K wps
[Epoch 9 Batch 120/162] avg loss 0.00559313, throughput 12.8891K wps
[Epoch 9 Batch 150/162] avg loss 0.00553164, throughput 12.9837K wps
Begin Testing...
[Epoch 9] train avg loss 0.00577637, test acc 0.9044, test avg loss 0.285586, throughput 12.9687K wps
[Epoch 10 Batch 30/162] avg loss 0.00538402, throughput 13.3501K wps
[Epoch 10 Batch 60/162] avg loss 0.0056709, throughput 12.8322K wps
[Epoch 10 Batch 90/162] avg loss 0.00576944, throughput 12.845K wps
[Epoch 10 Batch 120/162] avg loss 0.00560697, throughput 12.8345K wps
[Epoch 10 Batch 150/162] avg loss 0.00515467, throughput 12.8383K wps
Begin Testing...
[Epoch 10] train avg loss 0.00548611, test acc 0.9078, test avg loss 0.275408, throughput 12.9291K wps
Observed Improvement.
Begin Testing...
[Epoch 11 Batch 30/162] avg loss 0.00572205, throughput 13.2919K wps
[Epoch 11 Batch 60/162] avg loss 0.00546853, throughput 12.8748K wps
[Epoch 11 Batch 90/162] avg loss 0.00513806, throughput 12.9295K wps
[Epoch 11 Batch 120/162] avg loss 0.00490708, throughput 12.9947K wps
[Epoch 11 Batch 150/162] avg loss 0.00505018, throughput 12.9139K wps
Begin Testing...
[Epoch 11] train avg loss 0.00522856, test acc 0.9089, test avg loss 0.268236, throughput 12.9997K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00501332, throughput 13.3503K wps
[Epoch 12 Batch 60/162] avg loss 0.00487508, throughput 12.9315K wps
[Epoch 12 Batch 90/162] avg loss 0.00498641, throughput 12.9403K wps
[Epoch 12 Batch 120/162] avg loss 0.00485481, throughput 12.9834K wps
[Epoch 12 Batch 150/162] avg loss 0.00491879, throughput 13.0104K wps
Begin Testing...
[Epoch 12] train avg loss 0.00493371, test acc 0.9144, test avg loss 0.271962, throughput 13.0351K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.00499496, throughput 13.346K wps
[Epoch 13 Batch 60/162] avg loss 0.00454967, throughput 12.8697K wps
[Epoch 13 Batch 90/162] avg loss 0.00424905, throughput 12.9889K wps
[Epoch 13 Batch 120/162] avg loss 0.00483515, throughput 12.9796K wps
[Epoch 13 Batch 150/162] avg loss 0.00512365, throughput 12.9761K wps
Begin Testing...
[Epoch 13] train avg loss 0.00477728, test acc 0.9133, test avg loss 0.255357, throughput 13.0239K wps
[Epoch 14 Batch 30/162] avg loss 0.00474327, throughput 13.2123K wps
[Epoch 14 Batch 60/162] avg loss 0.00447736, throughput 12.7876K wps
[Epoch 14 Batch 90/162] avg loss 0.00486136, throughput 12.9356K wps
[Epoch 14 Batch 120/162] avg loss 0.0045171, throughput 12.9042K wps
[Epoch 14 Batch 150/162] avg loss 0.00439132, throughput 12.9339K wps
Begin Testing...
[Epoch 14] train avg loss 0.00460523, test acc 0.9133, test avg loss 0.25135, throughput 12.9457K wps
[Epoch 15 Batch 30/162] avg loss 0.00419593, throughput 13.2331K wps
[Epoch 15 Batch 60/162] avg loss 0.00444564, throughput 12.8434K wps
[Epoch 15 Batch 90/162] avg loss 0.00470124, throughput 12.8288K wps
[Epoch 15 Batch 120/162] avg loss 0.00447044, throughput 12.8503K wps
[Epoch 15 Batch 150/162] avg loss 0.00431206, throughput 12.8385K wps
Begin Testing...
[Epoch 15] train avg loss 0.00440829, test acc 0.9122, test avg loss 0.244747, throughput 12.9102K wps
[Epoch 16 Batch 30/162] avg loss 0.0039861, throughput 13.1358K wps
[Epoch 16 Batch 60/162] avg loss 0.00420786, throughput 12.9098K wps
[Epoch 16 Batch 90/162] avg loss 0.00433169, throughput 12.8991K wps
[Epoch 16 Batch 120/162] avg loss 0.00412649, throughput 12.9291K wps
[Epoch 16 Batch 150/162] avg loss 0.00436309, throughput 12.8833K wps
Begin Testing...
[Epoch 16] train avg loss 0.00422052, test acc 0.9167, test avg loss 0.244005, throughput 12.948K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00428547, throughput 13.3412K wps
[Epoch 17 Batch 60/162] avg loss 0.00394612, throughput 12.9402K wps
[Epoch 17 Batch 90/162] avg loss 0.00410132, throughput 12.928K wps
[Epoch 17 Batch 120/162] avg loss 0.00377166, throughput 12.9361K wps
[Epoch 17 Batch 150/162] avg loss 0.00440707, throughput 12.9145K wps
Begin Testing...
[Epoch 17] train avg loss 0.00407349, test acc 0.9211, test avg loss 0.240777, throughput 13.0055K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00398562, throughput 13.3164K wps
[Epoch 18 Batch 60/162] avg loss 0.00411145, throughput 12.8787K wps
[Epoch 18 Batch 90/162] avg loss 0.00399971, throughput 12.9394K wps
[Epoch 18 Batch 120/162] avg loss 0.0035585, throughput 12.8598K wps
[Epoch 18 Batch 150/162] avg loss 0.00383836, throughput 12.9056K wps
Begin Testing...
[Epoch 18] train avg loss 0.00387432, test acc 0.9211, test avg loss 0.238244, throughput 12.9686K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.00353656, throughput 13.1885K wps
[Epoch 19 Batch 60/162] avg loss 0.00357508, throughput 12.7735K wps
[Epoch 19 Batch 90/162] avg loss 0.00396596, throughput 12.8238K wps
[Epoch 19 Batch 120/162] avg loss 0.00386241, throughput 12.8751K wps
[Epoch 19 Batch 150/162] avg loss 0.00402019, throughput 12.8814K wps
Begin Testing...
[Epoch 19] train avg loss 0.00380366, test acc 0.9167, test avg loss 0.234567, throughput 12.9074K wps
[Epoch 20 Batch 30/162] avg loss 0.00375281, throughput 13.2939K wps
[Epoch 20 Batch 60/162] avg loss 0.00348771, throughput 13.0111K wps
[Epoch 20 Batch 90/162] avg loss 0.00361895, throughput 12.9749K wps
[Epoch 20 Batch 120/162] avg loss 0.00397159, throughput 13.0061K wps
[Epoch 20 Batch 150/162] avg loss 0.00379593, throughput 13.017K wps
Begin Testing...
[Epoch 20] train avg loss 0.00368083, test acc 0.9167, test avg loss 0.230379, throughput 13.0545K wps
[Epoch 21 Batch 30/162] avg loss 0.00338053, throughput 13.3161K wps
[Epoch 21 Batch 60/162] avg loss 0.003641, throughput 12.8435K wps
[Epoch 21 Batch 90/162] avg loss 0.00357036, throughput 12.981K wps
[Epoch 21 Batch 120/162] avg loss 0.00347, throughput 12.9919K wps
[Epoch 21 Batch 150/162] avg loss 0.00353543, throughput 12.9987K wps
Begin Testing...
[Epoch 21] train avg loss 0.00353082, test acc 0.9111, test avg loss 0.229974, throughput 13.0217K wps
[Epoch 22 Batch 30/162] avg loss 0.00340427, throughput 13.3134K wps
[Epoch 22 Batch 60/162] avg loss 0.00367953, throughput 12.8707K wps
[Epoch 22 Batch 90/162] avg loss 0.00330967, throughput 12.9368K wps
[Epoch 22 Batch 120/162] avg loss 0.00348383, throughput 12.9885K wps
[Epoch 22 Batch 150/162] avg loss 0.00340157, throughput 12.9506K wps
Begin Testing...
[Epoch 22] train avg loss 0.00346667, test acc 0.9178, test avg loss 0.228448, throughput 13.0061K wps
[Epoch 23 Batch 30/162] avg loss 0.00320636, throughput 13.2141K wps
[Epoch 23 Batch 60/162] avg loss 0.00346128, throughput 12.8649K wps
[Epoch 23 Batch 90/162] avg loss 0.00344341, throughput 13.0116K wps
[Epoch 23 Batch 120/162] avg loss 0.00327983, throughput 12.9428K wps
[Epoch 23 Batch 150/162] avg loss 0.00366943, throughput 12.9523K wps
Begin Testing...
[Epoch 23] train avg loss 0.00340663, test acc 0.9133, test avg loss 0.225317, throughput 12.9914K wps
[Epoch 24 Batch 30/162] avg loss 0.00349787, throughput 13.1984K wps
[Epoch 24 Batch 60/162] avg loss 0.00313777, throughput 12.9158K wps
[Epoch 24 Batch 90/162] avg loss 0.0030592, throughput 12.912K wps
[Epoch 24 Batch 120/162] avg loss 0.00304955, throughput 12.9468K wps
[Epoch 24 Batch 150/162] avg loss 0.00294511, throughput 12.9085K wps
Begin Testing...
[Epoch 24] train avg loss 0.00317235, test acc 0.9244, test avg loss 0.225674, throughput 12.9685K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.0031326, throughput 13.3206K wps
[Epoch 25 Batch 60/162] avg loss 0.00325972, throughput 12.7334K wps
[Epoch 25 Batch 90/162] avg loss 0.0032552, throughput 12.8329K wps
[Epoch 25 Batch 120/162] avg loss 0.00297884, throughput 12.8704K wps
[Epoch 25 Batch 150/162] avg loss 0.00310807, throughput 12.8821K wps
Begin Testing...
[Epoch 25] train avg loss 0.00316286, test acc 0.9244, test avg loss 0.229713, throughput 12.9295K wps
Observed Improvement.
Begin Testing...
[Epoch 26 Batch 30/162] avg loss 0.0029335, throughput 13.2548K wps
[Epoch 26 Batch 60/162] avg loss 0.00308937, throughput 12.8823K wps
[Epoch 26 Batch 90/162] avg loss 0.00321212, throughput 12.8932K wps
[Epoch 26 Batch 120/162] avg loss 0.00308024, throughput 12.9382K wps
[Epoch 26 Batch 150/162] avg loss 0.0027113, throughput 12.9599K wps
Begin Testing...
[Epoch 26] train avg loss 0.00298192, test acc 0.9167, test avg loss 0.218924, throughput 12.9811K wps
[Epoch 27 Batch 30/162] avg loss 0.0028027, throughput 13.24K wps
[Epoch 27 Batch 60/162] avg loss 0.00285632, throughput 12.8604K wps
[Epoch 27 Batch 90/162] avg loss 0.00275009, throughput 12.9199K wps
[Epoch 27 Batch 120/162] avg loss 0.00329384, throughput 12.9719K wps
[Epoch 27 Batch 150/162] avg loss 0.00270272, throughput 12.8775K wps
Begin Testing...
[Epoch 27] train avg loss 0.00289289, test acc 0.9211, test avg loss 0.221917, throughput 12.9715K wps
[Epoch 28 Batch 30/162] avg loss 0.00300902, throughput 13.2332K wps
[Epoch 28 Batch 60/162] avg loss 0.00307074, throughput 12.9404K wps
[Epoch 28 Batch 90/162] avg loss 0.00275986, throughput 12.8936K wps
[Epoch 28 Batch 120/162] avg loss 0.00265558, throughput 12.8758K wps
[Epoch 28 Batch 150/162] avg loss 0.00289027, throughput 12.846K wps
Begin Testing...
[Epoch 28] train avg loss 0.00287805, test acc 0.9200, test avg loss 0.220553, throughput 12.9487K wps
[Epoch 29 Batch 30/162] avg loss 0.00261401, throughput 13.263K wps
[Epoch 29 Batch 60/162] avg loss 0.00246364, throughput 12.7179K wps
[Epoch 29 Batch 90/162] avg loss 0.00285205, throughput 12.8594K wps
[Epoch 29 Batch 120/162] avg loss 0.00290761, throughput 12.8759K wps
[Epoch 29 Batch 150/162] avg loss 0.00258643, throughput 12.9732K wps
Begin Testing...
[Epoch 29] train avg loss 0.00269121, test acc 0.9222, test avg loss 0.216961, throughput 12.9353K wps
[Epoch 30 Batch 30/162] avg loss 0.00256404, throughput 13.3133K wps
[Epoch 30 Batch 60/162] avg loss 0.00273195, throughput 12.9824K wps
[Epoch 30 Batch 90/162] avg loss 0.00255373, throughput 12.9447K wps
[Epoch 30 Batch 120/162] avg loss 0.0025917, throughput 12.9933K wps
[Epoch 30 Batch 150/162] avg loss 0.00248845, throughput 12.9657K wps
Begin Testing...
[Epoch 30] train avg loss 0.00260008, test acc 0.9189, test avg loss 0.215106, throughput 13.0323K wps
[Epoch 31 Batch 30/162] avg loss 0.00239108, throughput 13.2871K wps
[Epoch 31 Batch 60/162] avg loss 0.00241507, throughput 12.9038K wps
[Epoch 31 Batch 90/162] avg loss 0.00238647, throughput 12.983K wps
[Epoch 31 Batch 120/162] avg loss 0.00253818, throughput 12.9534K wps
[Epoch 31 Batch 150/162] avg loss 0.00281458, throughput 12.9862K wps
Begin Testing...
[Epoch 31] train avg loss 0.00252689, test acc 0.9289, test avg loss 0.222823, throughput 13.0209K wps
Observed Improvement.
Begin Testing...
[Epoch 32 Batch 30/162] avg loss 0.00262677, throughput 13.3196K wps
[Epoch 32 Batch 60/162] avg loss 0.0023233, throughput 12.89K wps
[Epoch 32 Batch 90/162] avg loss 0.00250999, throughput 12.9308K wps
[Epoch 32 Batch 120/162] avg loss 0.00268307, throughput 12.9507K wps
[Epoch 32 Batch 150/162] avg loss 0.00227971, throughput 12.958K wps
Begin Testing...
[Epoch 32] train avg loss 0.00251202, test acc 0.9156, test avg loss 0.214413, throughput 13.0016K wps
[Epoch 33 Batch 30/162] avg loss 0.0023849, throughput 13.3523K wps
[Epoch 33 Batch 60/162] avg loss 0.00245457, throughput 12.887K wps
[Epoch 33 Batch 90/162] avg loss 0.00232364, throughput 12.9847K wps
[Epoch 33 Batch 120/162] avg loss 0.00233137, throughput 12.9958K wps
[Epoch 33 Batch 150/162] avg loss 0.00216803, throughput 12.8853K wps
Begin Testing...
[Epoch 33] train avg loss 0.00235764, test acc 0.9189, test avg loss 0.215195, throughput 13.0013K wps
[Epoch 34 Batch 30/162] avg loss 0.00223987, throughput 13.195K wps
[Epoch 34 Batch 60/162] avg loss 0.00213421, throughput 12.7115K wps
[Epoch 34 Batch 90/162] avg loss 0.00241251, throughput 12.9143K wps
[Epoch 34 Batch 120/162] avg loss 0.00233138, throughput 12.965K wps
[Epoch 34 Batch 150/162] avg loss 0.00234978, throughput 12.9638K wps
Begin Testing...
[Epoch 34] train avg loss 0.00229416, test acc 0.9244, test avg loss 0.220714, throughput 12.9454K wps
[Epoch 35 Batch 30/162] avg loss 0.00209728, throughput 13.1912K wps
[Epoch 35 Batch 60/162] avg loss 0.00228713, throughput 12.9029K wps
[Epoch 35 Batch 90/162] avg loss 0.00236973, throughput 12.9315K wps
[Epoch 35 Batch 120/162] avg loss 0.00245452, throughput 12.9381K wps
[Epoch 35 Batch 150/162] avg loss 0.0020143, throughput 12.9372K wps
Begin Testing...
[Epoch 35] train avg loss 0.00229265, test acc 0.9267, test avg loss 0.213632, throughput 12.9743K wps
[Epoch 36 Batch 30/162] avg loss 0.00223495, throughput 13.2851K wps
[Epoch 36 Batch 60/162] avg loss 0.00209258, throughput 12.8383K wps
[Epoch 36 Batch 90/162] avg loss 0.00210351, throughput 12.9392K wps
[Epoch 36 Batch 120/162] avg loss 0.0022988, throughput 12.8971K wps
[Epoch 36 Batch 150/162] avg loss 0.0022426, throughput 12.9408K wps
Begin Testing...
[Epoch 36] train avg loss 0.00219593, test acc 0.9300, test avg loss 0.215252, throughput 12.9786K wps
Observed Improvement.
Begin Testing...
[Epoch 37 Batch 30/162] avg loss 0.00219521, throughput 13.1987K wps
[Epoch 37 Batch 60/162] avg loss 0.00186233, throughput 12.8164K wps
[Epoch 37 Batch 90/162] avg loss 0.00218772, throughput 12.8349K wps
[Epoch 37 Batch 120/162] avg loss 0.0021108, throughput 12.8528K wps
[Epoch 37 Batch 150/162] avg loss 0.00198065, throughput 12.8826K wps
Begin Testing...
[Epoch 37] train avg loss 0.0020624, test acc 0.9167, test avg loss 0.211354, throughput 12.9107K wps
[Epoch 38 Batch 30/162] avg loss 0.00210871, throughput 13.2705K wps
[Epoch 38 Batch 60/162] avg loss 0.00195535, throughput 12.8232K wps
[Epoch 38 Batch 90/162] avg loss 0.00206652, throughput 12.9732K wps
[Epoch 38 Batch 120/162] avg loss 0.00206605, throughput 12.8823K wps
[Epoch 38 Batch 150/162] avg loss 0.00182621, throughput 12.9793K wps
Begin Testing...
[Epoch 38] train avg loss 0.00203193, test acc 0.9267, test avg loss 0.212469, throughput 12.9808K wps
[Epoch 39 Batch 30/162] avg loss 0.00197146, throughput 13.3726K wps
[Epoch 39 Batch 60/162] avg loss 0.00180539, throughput 12.802K wps
[Epoch 39 Batch 90/162] avg loss 0.00176991, throughput 12.9099K wps
[Epoch 39 Batch 120/162] avg loss 0.00204293, throughput 12.9238K wps
[Epoch 39 Batch 150/162] avg loss 0.00193355, throughput 12.956K wps
Begin Testing...
[Epoch 39] train avg loss 0.00191962, test acc 0.9211, test avg loss 0.225445, throughput 12.9894K wps
[Epoch 40 Batch 30/162] avg loss 0.00196863, throughput 13.2665K wps
[Epoch 40 Batch 60/162] avg loss 0.00187401, throughput 12.8664K wps
[Epoch 40 Batch 90/162] avg loss 0.00193701, throughput 12.969K wps
[Epoch 40 Batch 120/162] avg loss 0.00191874, throughput 12.9688K wps
[Epoch 40 Batch 150/162] avg loss 0.00186422, throughput 12.9398K wps
Begin Testing...
[Epoch 40] train avg loss 0.0019046, test acc 0.9200, test avg loss 0.210158, throughput 12.9933K wps
[Epoch 41 Batch 30/162] avg loss 0.00160456, throughput 13.2634K wps
[Epoch 41 Batch 60/162] avg loss 0.0016595, throughput 12.983K wps
[Epoch 41 Batch 90/162] avg loss 0.00209243, throughput 12.9423K wps
[Epoch 41 Batch 120/162] avg loss 0.00205596, throughput 12.9861K wps
[Epoch 41 Batch 150/162] avg loss 0.0021462, throughput 12.956K wps
Begin Testing...
[Epoch 41] train avg loss 0.00189947, test acc 0.9233, test avg loss 0.210732, throughput 13.0208K wps
[Epoch 42 Batch 30/162] avg loss 0.00193422, throughput 13.243K wps
[Epoch 42 Batch 60/162] avg loss 0.00168109, throughput 12.8514K wps
[Epoch 42 Batch 90/162] avg loss 0.00161881, throughput 12.9287K wps
[Epoch 42 Batch 120/162] avg loss 0.00189888, throughput 12.9061K wps
[Epoch 42 Batch 150/162] avg loss 0.0017288, throughput 12.8916K wps
Begin Testing...
[Epoch 42] train avg loss 0.00178561, test acc 0.9267, test avg loss 0.212879, throughput 12.9585K wps
[Epoch 43 Batch 30/162] avg loss 0.00155564, throughput 13.2571K wps
[Epoch 43 Batch 60/162] avg loss 0.00181142, throughput 12.8307K wps
[Epoch 43 Batch 90/162] avg loss 0.00167164, throughput 12.973K wps
[Epoch 43 Batch 120/162] avg loss 0.00180191, throughput 12.9054K wps
[Epoch 43 Batch 150/162] avg loss 0.00181967, throughput 12.8776K wps
Begin Testing...
[Epoch 43] train avg loss 0.00175264, test acc 0.9244, test avg loss 0.214973, throughput 12.9568K wps
[Epoch 44 Batch 30/162] avg loss 0.00156178, throughput 13.1619K wps
[Epoch 44 Batch 60/162] avg loss 0.00183152, throughput 12.7281K wps
[Epoch 44 Batch 90/162] avg loss 0.0018236, throughput 12.918K wps
[Epoch 44 Batch 120/162] avg loss 0.00180279, throughput 12.9369K wps
[Epoch 44 Batch 150/162] avg loss 0.0015171, throughput 12.9392K wps
Begin Testing...
[Epoch 44] train avg loss 0.00170494, test acc 0.9167, test avg loss 0.209228, throughput 12.9324K wps
[Epoch 45 Batch 30/162] avg loss 0.00154385, throughput 13.1798K wps
[Epoch 45 Batch 60/162] avg loss 0.00165788, throughput 12.8999K wps
[Epoch 45 Batch 90/162] avg loss 0.0015923, throughput 12.8992K wps
[Epoch 45 Batch 120/162] avg loss 0.00169379, throughput 12.961K wps
[Epoch 45 Batch 150/162] avg loss 0.00156847, throughput 12.9263K wps
Begin Testing...
[Epoch 45] train avg loss 0.0016068, test acc 0.9256, test avg loss 0.211047, throughput 12.9672K wps
[Epoch 46 Batch 30/162] avg loss 0.00160788, throughput 13.1246K wps
[Epoch 46 Batch 60/162] avg loss 0.00158965, throughput 12.8673K wps
[Epoch 46 Batch 90/162] avg loss 0.00163326, throughput 12.9407K wps
[Epoch 46 Batch 120/162] avg loss 0.00164885, throughput 12.9193K wps
[Epoch 46 Batch 150/162] avg loss 0.00141688, throughput 12.9323K wps
Begin Testing...
[Epoch 46] train avg loss 0.00159313, test acc 0.9200, test avg loss 0.205645, throughput 12.9535K wps
[Epoch 47 Batch 30/162] avg loss 0.00134969, throughput 13.0902K wps
[Epoch 47 Batch 60/162] avg loss 0.00186414, throughput 12.8083K wps
[Epoch 47 Batch 90/162] avg loss 0.00158966, throughput 12.8144K wps
[Epoch 47 Batch 120/162] avg loss 0.00155811, throughput 12.8498K wps
[Epoch 47 Batch 150/162] avg loss 0.00142612, throughput 12.864K wps
Begin Testing...
[Epoch 47] train avg loss 0.00153884, test acc 0.9222, test avg loss 0.209728, throughput 12.8829K wps
[Epoch 48 Batch 30/162] avg loss 0.0014055, throughput 13.1977K wps
[Epoch 48 Batch 60/162] avg loss 0.00167883, throughput 12.8413K wps
[Epoch 48 Batch 90/162] avg loss 0.00145687, throughput 12.9459K wps
[Epoch 48 Batch 120/162] avg loss 0.00119519, throughput 12.9498K wps
[Epoch 48 Batch 150/162] avg loss 0.00150897, throughput 12.9678K wps
Begin Testing...
[Epoch 48] train avg loss 0.00147833, test acc 0.9133, test avg loss 0.205051, throughput 12.9783K wps
[Epoch 49 Batch 30/162] avg loss 0.00143017, throughput 13.2997K wps
[Epoch 49 Batch 60/162] avg loss 0.00168045, throughput 12.8422K wps
[Epoch 49 Batch 90/162] avg loss 0.00129402, throughput 12.9346K wps
[Epoch 49 Batch 120/162] avg loss 0.00142765, throughput 12.9412K wps
[Epoch 49 Batch 150/162] avg loss 0.00131301, throughput 12.9803K wps
Begin Testing...
[Epoch 49] train avg loss 0.00145965, test acc 0.9222, test avg loss 0.208444, throughput 12.9978K wps
[Epoch 50 Batch 30/162] avg loss 0.00131949, throughput 13.2332K wps
[Epoch 50 Batch 60/162] avg loss 0.00134158, throughput 13.0041K wps
[Epoch 50 Batch 90/162] avg loss 0.00139952, throughput 12.954K wps
[Epoch 50 Batch 120/162] avg loss 0.0013874, throughput 12.9618K wps
[Epoch 50 Batch 150/162] avg loss 0.00158841, throughput 12.9768K wps
Begin Testing...
[Epoch 50] train avg loss 0.00138587, test acc 0.9233, test avg loss 0.210756, throughput 13.0181K wps
[Epoch 51 Batch 30/162] avg loss 0.00137903, throughput 13.248K wps
[Epoch 51 Batch 60/162] avg loss 0.00118669, throughput 12.8914K wps
[Epoch 51 Batch 90/162] avg loss 0.00154764, throughput 12.9205K wps
[Epoch 51 Batch 120/162] avg loss 0.0012744, throughput 12.9545K wps
[Epoch 51 Batch 150/162] avg loss 0.00133426, throughput 12.873K wps
Begin Testing...
[Epoch 51] train avg loss 0.0013603, test acc 0.9211, test avg loss 0.206715, throughput 12.9676K wps
[Epoch 52 Batch 30/162] avg loss 0.00142497, throughput 13.2136K wps
[Epoch 52 Batch 60/162] avg loss 0.00118887, throughput 12.8748K wps
[Epoch 52 Batch 90/162] avg loss 0.00122483, throughput 12.9468K wps
[Epoch 52 Batch 120/162] avg loss 0.00147589, throughput 12.966K wps
[Epoch 52 Batch 150/162] avg loss 0.00133067, throughput 12.9509K wps
Begin Testing...
[Epoch 52] train avg loss 0.00132756, test acc 0.9244, test avg loss 0.209276, throughput 12.9904K wps
[Epoch 53 Batch 30/162] avg loss 0.00123694, throughput 13.144K wps
[Epoch 53 Batch 60/162] avg loss 0.00115648, throughput 12.6722K wps
[Epoch 53 Batch 90/162] avg loss 0.00124761, throughput 12.7773K wps
[Epoch 53 Batch 120/162] avg loss 0.00113395, throughput 12.8402K wps
[Epoch 53 Batch 150/162] avg loss 0.00113021, throughput 12.874K wps
Begin Testing...
[Epoch 53] train avg loss 0.00118559, test acc 0.9244, test avg loss 0.213194, throughput 12.8606K wps
[Epoch 54 Batch 30/162] avg loss 0.00128978, throughput 13.1158K wps
[Epoch 54 Batch 60/162] avg loss 0.0010675, throughput 12.9522K wps
[Epoch 54 Batch 90/162] avg loss 0.00117141, throughput 12.967K wps
[Epoch 54 Batch 120/162] avg loss 0.00117796, throughput 12.9098K wps
[Epoch 54 Batch 150/162] avg loss 0.00132106, throughput 12.9K wps
Begin Testing...
[Epoch 54] train avg loss 0.00119968, test acc 0.9156, test avg loss 0.206917, throughput 12.9629K wps
[Epoch 55 Batch 30/162] avg loss 0.00119383, throughput 13.2113K wps
[Epoch 55 Batch 60/162] avg loss 0.00124433, throughput 12.8173K wps
[Epoch 55 Batch 90/162] avg loss 0.00111638, throughput 12.9353K wps
[Epoch 55 Batch 120/162] avg loss 0.00118322, throughput 12.8957K wps
[Epoch 55 Batch 150/162] avg loss 0.00110615, throughput 12.9101K wps
Begin Testing...
[Epoch 55] train avg loss 0.00117165, test acc 0.9222, test avg loss 0.208834, throughput 12.9485K wps
[Epoch 56 Batch 30/162] avg loss 0.00118151, throughput 13.1137K wps
[Epoch 56 Batch 60/162] avg loss 0.00113473, throughput 12.8993K wps
[Epoch 56 Batch 90/162] avg loss 0.0011659, throughput 12.8749K wps
[Epoch 56 Batch 120/162] avg loss 0.00103964, throughput 12.8379K wps
[Epoch 56 Batch 150/162] avg loss 0.00113213, throughput 12.8296K wps
Begin Testing...
[Epoch 56] train avg loss 0.00113275, test acc 0.9244, test avg loss 0.211899, throughput 12.9086K wps
[Epoch 57 Batch 30/162] avg loss 0.00099364, throughput 13.1893K wps
[Epoch 57 Batch 60/162] avg loss 0.00098865, throughput 12.7942K wps
[Epoch 57 Batch 90/162] avg loss 0.00121184, throughput 12.8421K wps
[Epoch 57 Batch 120/162] avg loss 0.00107826, throughput 12.8504K wps
[Epoch 57 Batch 150/162] avg loss 0.0010855, throughput 12.8371K wps
Begin Testing...
[Epoch 57] train avg loss 0.0010653, test acc 0.9233, test avg loss 0.211234, throughput 12.9003K wps
[Epoch 58 Batch 30/162] avg loss 0.00112553, throughput 13.1803K wps
[Epoch 58 Batch 60/162] avg loss 0.00097019, throughput 12.8855K wps
[Epoch 58 Batch 90/162] avg loss 0.00112915, throughput 12.9881K wps
[Epoch 58 Batch 120/162] avg loss 0.00097894, throughput 12.8543K wps
[Epoch 58 Batch 150/162] avg loss 0.00100527, throughput 12.8947K wps
Begin Testing...
[Epoch 58] train avg loss 0.00103924, test acc 0.9189, test avg loss 0.206104, throughput 12.952K wps
[Epoch 59 Batch 30/162] avg loss 0.00111611, throughput 13.2769K wps
[Epoch 59 Batch 60/162] avg loss 0.00101195, throughput 12.775K wps
[Epoch 59 Batch 90/162] avg loss 0.0010719, throughput 12.9067K wps
[Epoch 59 Batch 120/162] avg loss 0.00117748, throughput 12.9318K wps
[Epoch 59 Batch 150/162] avg loss 0.00100235, throughput 12.9321K wps
Begin Testing...
[Epoch 59] train avg loss 0.00105103, test acc 0.9200, test avg loss 0.21575, throughput 12.9551K wps
Test loss 0.223393, test acc 0.9160
Total time cost 163.99s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0153622, throughput 11.601K wps
[Epoch 0 Batch 60/162] avg loss 0.014385, throughput 12.8273K wps
[Epoch 0 Batch 90/162] avg loss 0.0134586, throughput 12.8739K wps
[Epoch 0 Batch 120/162] avg loss 0.0132014, throughput 12.8983K wps
[Epoch 0 Batch 150/162] avg loss 0.012889, throughput 12.8945K wps
Begin Testing...
[Epoch 0] train avg loss 0.0137508, test acc 0.6911, test avg loss 0.596689, throughput 12.6145K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0125742, throughput 13.1275K wps
[Epoch 1 Batch 60/162] avg loss 0.0123434, throughput 12.8047K wps
[Epoch 1 Batch 90/162] avg loss 0.011781, throughput 12.8799K wps
[Epoch 1 Batch 120/162] avg loss 0.0120958, throughput 12.8399K wps
[Epoch 1 Batch 150/162] avg loss 0.0117518, throughput 12.8251K wps
Begin Testing...
[Epoch 1] train avg loss 0.0120804, test acc 0.7533, test avg loss 0.549325, throughput 12.891K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0113483, throughput 13.2359K wps
[Epoch 2 Batch 60/162] avg loss 0.0112953, throughput 12.8023K wps
[Epoch 2 Batch 90/162] avg loss 0.0109872, throughput 12.8476K wps
[Epoch 2 Batch 120/162] avg loss 0.0108312, throughput 12.8602K wps
[Epoch 2 Batch 150/162] avg loss 0.0106782, throughput 12.8329K wps
Begin Testing...
[Epoch 2] train avg loss 0.0109886, test acc 0.8200, test avg loss 0.504867, throughput 12.9125K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0101951, throughput 13.2373K wps
[Epoch 3 Batch 60/162] avg loss 0.010231, throughput 12.7643K wps
[Epoch 3 Batch 90/162] avg loss 0.0097362, throughput 12.7824K wps
[Epoch 3 Batch 120/162] avg loss 0.0100657, throughput 12.9238K wps
[Epoch 3 Batch 150/162] avg loss 0.00949614, throughput 12.9235K wps
Begin Testing...
[Epoch 3] train avg loss 0.00989993, test acc 0.8456, test avg loss 0.462955, throughput 12.9222K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00942658, throughput 13.3111K wps
[Epoch 4 Batch 60/162] avg loss 0.00884946, throughput 12.8319K wps
[Epoch 4 Batch 90/162] avg loss 0.00911789, throughput 12.9076K wps
[Epoch 4 Batch 120/162] avg loss 0.00875191, throughput 12.8723K wps
[Epoch 4 Batch 150/162] avg loss 0.00894574, throughput 12.9546K wps
Begin Testing...
[Epoch 4] train avg loss 0.00901507, test acc 0.8756, test avg loss 0.42069, throughput 12.9652K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00820447, throughput 13.2283K wps
[Epoch 5 Batch 60/162] avg loss 0.00842208, throughput 12.828K wps
[Epoch 5 Batch 90/162] avg loss 0.00828734, throughput 12.7911K wps
[Epoch 5 Batch 120/162] avg loss 0.00796562, throughput 12.7469K wps
[Epoch 5 Batch 150/162] avg loss 0.00792878, throughput 12.8199K wps
Begin Testing...
[Epoch 5] train avg loss 0.0081433, test acc 0.8733, test avg loss 0.387449, throughput 12.88K wps
[Epoch 6 Batch 30/162] avg loss 0.00782505, throughput 13.1289K wps
[Epoch 6 Batch 60/162] avg loss 0.00739161, throughput 12.6926K wps
[Epoch 6 Batch 90/162] avg loss 0.00715992, throughput 12.8751K wps
[Epoch 6 Batch 120/162] avg loss 0.00734613, throughput 12.8401K wps
[Epoch 6 Batch 150/162] avg loss 0.0073381, throughput 12.9107K wps
Begin Testing...
[Epoch 6] train avg loss 0.00741106, test acc 0.8867, test avg loss 0.355497, throughput 12.8934K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00699688, throughput 13.2019K wps
[Epoch 7 Batch 60/162] avg loss 0.0072943, throughput 12.7448K wps
[Epoch 7 Batch 90/162] avg loss 0.00655082, throughput 12.8159K wps
[Epoch 7 Batch 120/162] avg loss 0.00645104, throughput 12.8333K wps
[Epoch 7 Batch 150/162] avg loss 0.006671, throughput 12.9881K wps
Begin Testing...
[Epoch 7] train avg loss 0.00678768, test acc 0.8944, test avg loss 0.328133, throughput 12.9202K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00647422, throughput 13.3121K wps
[Epoch 8 Batch 60/162] avg loss 0.00632978, throughput 12.8002K wps
[Epoch 8 Batch 90/162] avg loss 0.00659252, throughput 12.8923K wps
[Epoch 8 Batch 120/162] avg loss 0.00609988, throughput 12.7994K wps
[Epoch 8 Batch 150/162] avg loss 0.00640158, throughput 12.9622K wps
Begin Testing...
[Epoch 8] train avg loss 0.00635582, test acc 0.8978, test avg loss 0.309114, throughput 12.9533K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00613149, throughput 13.2595K wps
[Epoch 9 Batch 60/162] avg loss 0.00598176, throughput 12.8042K wps
[Epoch 9 Batch 90/162] avg loss 0.00565862, throughput 12.9695K wps
[Epoch 9 Batch 120/162] avg loss 0.0056951, throughput 12.863K wps
[Epoch 9 Batch 150/162] avg loss 0.00606518, throughput 12.8107K wps
Begin Testing...
[Epoch 9] train avg loss 0.00589483, test acc 0.8956, test avg loss 0.29418, throughput 12.922K wps
[Epoch 10 Batch 30/162] avg loss 0.00546888, throughput 13.1181K wps
[Epoch 10 Batch 60/162] avg loss 0.00585724, throughput 12.7973K wps
[Epoch 10 Batch 90/162] avg loss 0.00545427, throughput 12.9449K wps
[Epoch 10 Batch 120/162] avg loss 0.00569079, throughput 12.8972K wps
[Epoch 10 Batch 150/162] avg loss 0.00554782, throughput 12.8988K wps
Begin Testing...
[Epoch 10] train avg loss 0.00560417, test acc 0.9022, test avg loss 0.281643, throughput 12.9204K wps
Observed Improvement.
Begin Testing...
[Epoch 11 Batch 30/162] avg loss 0.00538754, throughput 13.2645K wps
[Epoch 11 Batch 60/162] avg loss 0.00556826, throughput 12.7676K wps
[Epoch 11 Batch 90/162] avg loss 0.00529092, throughput 12.7848K wps
[Epoch 11 Batch 120/162] avg loss 0.00530952, throughput 12.8544K wps
[Epoch 11 Batch 150/162] avg loss 0.0051578, throughput 12.8784K wps
Begin Testing...
[Epoch 11] train avg loss 0.00532846, test acc 0.9033, test avg loss 0.27251, throughput 12.9101K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00497038, throughput 13.1737K wps
[Epoch 12 Batch 60/162] avg loss 0.00519901, throughput 12.6909K wps
[Epoch 12 Batch 90/162] avg loss 0.00514457, throughput 12.9132K wps
[Epoch 12 Batch 120/162] avg loss 0.00500543, throughput 12.8046K wps
[Epoch 12 Batch 150/162] avg loss 0.00517069, throughput 12.8684K wps
Begin Testing...
[Epoch 12] train avg loss 0.00509048, test acc 0.9044, test avg loss 0.264245, throughput 12.8865K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.0046554, throughput 13.1424K wps
[Epoch 13 Batch 60/162] avg loss 0.00507277, throughput 12.8481K wps
[Epoch 13 Batch 90/162] avg loss 0.00453658, throughput 12.956K wps
[Epoch 13 Batch 120/162] avg loss 0.00517908, throughput 12.928K wps
[Epoch 13 Batch 150/162] avg loss 0.00481283, throughput 12.9258K wps
Begin Testing...
[Epoch 13] train avg loss 0.00485205, test acc 0.9067, test avg loss 0.263626, throughput 12.9543K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.00475809, throughput 13.268K wps
[Epoch 14 Batch 60/162] avg loss 0.00451627, throughput 12.7279K wps
[Epoch 14 Batch 90/162] avg loss 0.00497568, throughput 12.8434K wps
[Epoch 14 Batch 120/162] avg loss 0.00459468, throughput 12.875K wps
[Epoch 14 Batch 150/162] avg loss 0.00443517, throughput 12.7479K wps
Begin Testing...
[Epoch 14] train avg loss 0.00463365, test acc 0.9033, test avg loss 0.251052, throughput 12.8718K wps
[Epoch 15 Batch 30/162] avg loss 0.00451158, throughput 13.1594K wps
[Epoch 15 Batch 60/162] avg loss 0.00438656, throughput 12.7446K wps
[Epoch 15 Batch 90/162] avg loss 0.00454034, throughput 12.8819K wps
[Epoch 15 Batch 120/162] avg loss 0.00459276, throughput 12.8214K wps
[Epoch 15 Batch 150/162] avg loss 0.00458082, throughput 12.8671K wps
Begin Testing...
[Epoch 15] train avg loss 0.00447913, test acc 0.9089, test avg loss 0.250995, throughput 12.9002K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00425103, throughput 13.2978K wps
[Epoch 16 Batch 60/162] avg loss 0.00437298, throughput 12.809K wps
[Epoch 16 Batch 90/162] avg loss 0.00430823, throughput 12.9435K wps
[Epoch 16 Batch 120/162] avg loss 0.00402052, throughput 12.9789K wps
[Epoch 16 Batch 150/162] avg loss 0.0045041, throughput 12.9482K wps
Begin Testing...
[Epoch 16] train avg loss 0.00426992, test acc 0.9033, test avg loss 0.243536, throughput 12.9909K wps
[Epoch 17 Batch 30/162] avg loss 0.00422474, throughput 13.336K wps
[Epoch 17 Batch 60/162] avg loss 0.0039713, throughput 12.839K wps
[Epoch 17 Batch 90/162] avg loss 0.00419909, throughput 12.9643K wps
[Epoch 17 Batch 120/162] avg loss 0.00405687, throughput 12.8479K wps
[Epoch 17 Batch 150/162] avg loss 0.00422941, throughput 12.9608K wps
Begin Testing...
[Epoch 17] train avg loss 0.0041351, test acc 0.9033, test avg loss 0.240845, throughput 12.9883K wps
[Epoch 18 Batch 30/162] avg loss 0.00410636, throughput 13.2171K wps
[Epoch 18 Batch 60/162] avg loss 0.00432847, throughput 12.7939K wps
[Epoch 18 Batch 90/162] avg loss 0.00395017, throughput 12.8956K wps
[Epoch 18 Batch 120/162] avg loss 0.0038198, throughput 12.9327K wps
[Epoch 18 Batch 150/162] avg loss 0.00403096, throughput 12.9518K wps
Begin Testing...
[Epoch 18] train avg loss 0.00405638, test acc 0.9089, test avg loss 0.235752, throughput 12.9547K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.00408525, throughput 13.1983K wps
[Epoch 19 Batch 60/162] avg loss 0.00383756, throughput 12.8153K wps
[Epoch 19 Batch 90/162] avg loss 0.00401665, throughput 12.9213K wps
[Epoch 19 Batch 120/162] avg loss 0.00388121, throughput 12.9175K wps
[Epoch 19 Batch 150/162] avg loss 0.00390497, throughput 12.9436K wps
Begin Testing...
[Epoch 19] train avg loss 0.00391513, test acc 0.9078, test avg loss 0.231062, throughput 12.9546K wps
[Epoch 20 Batch 30/162] avg loss 0.0037982, throughput 13.204K wps
[Epoch 20 Batch 60/162] avg loss 0.00364371, throughput 12.7271K wps
[Epoch 20 Batch 90/162] avg loss 0.00351355, throughput 12.8183K wps
[Epoch 20 Batch 120/162] avg loss 0.00388793, throughput 12.8149K wps
[Epoch 20 Batch 150/162] avg loss 0.00377038, throughput 12.8964K wps
Begin Testing...
[Epoch 20] train avg loss 0.00372188, test acc 0.9100, test avg loss 0.228913, throughput 12.8899K wps
Observed Improvement.
Begin Testing...
[Epoch 21 Batch 30/162] avg loss 0.00390266, throughput 13.3163K wps
[Epoch 21 Batch 60/162] avg loss 0.00360351, throughput 12.8532K wps
[Epoch 21 Batch 90/162] avg loss 0.00347353, throughput 12.9325K wps
[Epoch 21 Batch 120/162] avg loss 0.00361854, throughput 12.9044K wps
[Epoch 21 Batch 150/162] avg loss 0.00345025, throughput 12.9091K wps
Begin Testing...
[Epoch 21] train avg loss 0.00361084, test acc 0.9178, test avg loss 0.224292, throughput 12.9794K wps
Observed Improvement.
Begin Testing...
[Epoch 22 Batch 30/162] avg loss 0.00377556, throughput 13.2904K wps
[Epoch 22 Batch 60/162] avg loss 0.00361895, throughput 12.8284K wps
[Epoch 22 Batch 90/162] avg loss 0.00361788, throughput 12.9739K wps
[Epoch 22 Batch 120/162] avg loss 0.00329748, throughput 12.9742K wps
[Epoch 22 Batch 150/162] avg loss 0.00343128, throughput 12.9431K wps
Begin Testing...
[Epoch 22] train avg loss 0.00353683, test acc 0.9167, test avg loss 0.223511, throughput 12.9955K wps
[Epoch 23 Batch 30/162] avg loss 0.0037979, throughput 13.1892K wps
[Epoch 23 Batch 60/162] avg loss 0.00341751, throughput 12.8203K wps
[Epoch 23 Batch 90/162] avg loss 0.00311145, throughput 12.8341K wps
[Epoch 23 Batch 120/162] avg loss 0.00348151, throughput 12.847K wps
[Epoch 23 Batch 150/162] avg loss 0.0031566, throughput 12.8542K wps
Begin Testing...
[Epoch 23] train avg loss 0.00340507, test acc 0.9178, test avg loss 0.21879, throughput 12.9048K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00323043, throughput 13.2176K wps
[Epoch 24 Batch 60/162] avg loss 0.00362376, throughput 12.8586K wps
[Epoch 24 Batch 90/162] avg loss 0.00325001, throughput 12.8852K wps
[Epoch 24 Batch 120/162] avg loss 0.00311582, throughput 12.9265K wps
[Epoch 24 Batch 150/162] avg loss 0.00303106, throughput 12.8822K wps
Begin Testing...
[Epoch 24] train avg loss 0.00326045, test acc 0.9200, test avg loss 0.218053, throughput 12.949K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.00326237, throughput 13.2336K wps
[Epoch 25 Batch 60/162] avg loss 0.00323153, throughput 12.8916K wps
[Epoch 25 Batch 90/162] avg loss 0.00307716, throughput 12.9568K wps
[Epoch 25 Batch 120/162] avg loss 0.00330636, throughput 12.8262K wps
[Epoch 25 Batch 150/162] avg loss 0.00317599, throughput 12.8001K wps
Begin Testing...
[Epoch 25] train avg loss 0.00320777, test acc 0.9167, test avg loss 0.218016, throughput 12.9408K wps
[Epoch 26 Batch 30/162] avg loss 0.00293242, throughput 13.2919K wps
[Epoch 26 Batch 60/162] avg loss 0.00308211, throughput 12.7428K wps
[Epoch 26 Batch 90/162] avg loss 0.0031104, throughput 12.8459K wps
[Epoch 26 Batch 120/162] avg loss 0.00310369, throughput 12.9183K wps
[Epoch 26 Batch 150/162] avg loss 0.00276652, throughput 12.8179K wps
Begin Testing...
[Epoch 26] train avg loss 0.00301303, test acc 0.9233, test avg loss 0.214978, throughput 12.9184K wps
Observed Improvement.
Begin Testing...
[Epoch 27 Batch 30/162] avg loss 0.0030057, throughput 13.2478K wps
[Epoch 27 Batch 60/162] avg loss 0.00277389, throughput 12.7382K wps
[Epoch 27 Batch 90/162] avg loss 0.00309105, throughput 12.9106K wps
[Epoch 27 Batch 120/162] avg loss 0.00290783, throughput 12.9143K wps
[Epoch 27 Batch 150/162] avg loss 0.00290076, throughput 12.9196K wps
Begin Testing...
[Epoch 27] train avg loss 0.0029035, test acc 0.9222, test avg loss 0.214492, throughput 12.9417K wps
[Epoch 28 Batch 30/162] avg loss 0.0031412, throughput 13.075K wps
[Epoch 28 Batch 60/162] avg loss 0.00282931, throughput 12.8274K wps
[Epoch 28 Batch 90/162] avg loss 0.0027633, throughput 12.9322K wps
[Epoch 28 Batch 120/162] avg loss 0.00301644, throughput 12.9451K wps
[Epoch 28 Batch 150/162] avg loss 0.00302787, throughput 12.9644K wps
Begin Testing...
[Epoch 28] train avg loss 0.00294045, test acc 0.9222, test avg loss 0.21218, throughput 12.9463K wps
[Epoch 29 Batch 30/162] avg loss 0.00281821, throughput 13.1554K wps
[Epoch 29 Batch 60/162] avg loss 0.00274033, throughput 12.7528K wps
[Epoch 29 Batch 90/162] avg loss 0.00259959, throughput 12.8229K wps
[Epoch 29 Batch 120/162] avg loss 0.00278855, throughput 12.8459K wps
[Epoch 29 Batch 150/162] avg loss 0.00274003, throughput 12.8561K wps
Begin Testing...
[Epoch 29] train avg loss 0.00274462, test acc 0.9244, test avg loss 0.21024, throughput 12.8906K wps
Observed Improvement.
Begin Testing...
[Epoch 30 Batch 30/162] avg loss 0.00275021, throughput 13.2986K wps
[Epoch 30 Batch 60/162] avg loss 0.00269488, throughput 12.8688K wps
[Epoch 30 Batch 90/162] avg loss 0.00241315, throughput 12.9447K wps
[Epoch 30 Batch 120/162] avg loss 0.00279496, throughput 12.9159K wps
[Epoch 30 Batch 150/162] avg loss 0.00259193, throughput 12.9402K wps
Begin Testing...
[Epoch 30] train avg loss 0.00266116, test acc 0.9267, test avg loss 0.211799, throughput 12.9876K wps
Observed Improvement.
Begin Testing...
[Epoch 31 Batch 30/162] avg loss 0.00228959, throughput 13.2962K wps
[Epoch 31 Batch 60/162] avg loss 0.00275544, throughput 12.8617K wps
[Epoch 31 Batch 90/162] avg loss 0.0028051, throughput 12.8861K wps
[Epoch 31 Batch 120/162] avg loss 0.00272212, throughput 12.8775K wps
[Epoch 31 Batch 150/162] avg loss 0.00239727, throughput 12.8625K wps
Begin Testing...
[Epoch 31] train avg loss 0.00258846, test acc 0.9211, test avg loss 0.214279, throughput 12.9508K wps
[Epoch 32 Batch 30/162] avg loss 0.00232007, throughput 13.1657K wps
[Epoch 32 Batch 60/162] avg loss 0.00233264, throughput 12.9377K wps
[Epoch 32 Batch 90/162] avg loss 0.00247008, throughput 12.8771K wps
[Epoch 32 Batch 120/162] avg loss 0.00282822, throughput 12.8353K wps
[Epoch 32 Batch 150/162] avg loss 0.0027559, throughput 12.8787K wps
Begin Testing...
[Epoch 32] train avg loss 0.00253363, test acc 0.9222, test avg loss 0.20657, throughput 12.9289K wps
[Epoch 33 Batch 30/162] avg loss 0.00254128, throughput 13.1171K wps
[Epoch 33 Batch 60/162] avg loss 0.00241394, throughput 12.689K wps
[Epoch 33 Batch 90/162] avg loss 0.00242941, throughput 12.8455K wps
[Epoch 33 Batch 120/162] avg loss 0.0023113, throughput 12.9016K wps
[Epoch 33 Batch 150/162] avg loss 0.00258931, throughput 12.9371K wps
Begin Testing...
[Epoch 33] train avg loss 0.00246311, test acc 0.9233, test avg loss 0.206798, throughput 12.8989K wps
[Epoch 34 Batch 30/162] avg loss 0.00244655, throughput 13.1513K wps
[Epoch 34 Batch 60/162] avg loss 0.00234095, throughput 12.8148K wps
[Epoch 34 Batch 90/162] avg loss 0.00229434, throughput 12.9521K wps
[Epoch 34 Batch 120/162] avg loss 0.00260119, throughput 12.9618K wps
[Epoch 34 Batch 150/162] avg loss 0.00224266, throughput 12.8624K wps
Begin Testing...
[Epoch 34] train avg loss 0.0023856, test acc 0.9278, test avg loss 0.205137, throughput 12.9468K wps
Observed Improvement.
Begin Testing...
[Epoch 35 Batch 30/162] avg loss 0.00244204, throughput 13.1873K wps
[Epoch 35 Batch 60/162] avg loss 0.00210554, throughput 12.9164K wps
[Epoch 35 Batch 90/162] avg loss 0.00231938, throughput 12.9681K wps
[Epoch 35 Batch 120/162] avg loss 0.00221766, throughput 12.9136K wps
[Epoch 35 Batch 150/162] avg loss 0.00249169, throughput 12.8941K wps
Begin Testing...
[Epoch 35] train avg loss 0.00231174, test acc 0.9233, test avg loss 0.206752, throughput 12.9715K wps
[Epoch 36 Batch 30/162] avg loss 0.00224865, throughput 13.2124K wps
[Epoch 36 Batch 60/162] avg loss 0.0021355, throughput 12.799K wps
[Epoch 36 Batch 90/162] avg loss 0.00238779, throughput 12.9406K wps
[Epoch 36 Batch 120/162] avg loss 0.0022556, throughput 12.9498K wps
[Epoch 36 Batch 150/162] avg loss 0.00210934, throughput 12.9202K wps
Begin Testing...
[Epoch 36] train avg loss 0.00222544, test acc 0.9222, test avg loss 0.205615, throughput 12.959K wps
[Epoch 37 Batch 30/162] avg loss 0.0018667, throughput 13.1684K wps
[Epoch 37 Batch 60/162] avg loss 0.00224339, throughput 12.8776K wps
[Epoch 37 Batch 90/162] avg loss 0.00208419, throughput 12.8567K wps
[Epoch 37 Batch 120/162] avg loss 0.00231683, throughput 12.7829K wps
[Epoch 37 Batch 150/162] avg loss 0.00203128, throughput 12.6942K wps
Begin Testing...
[Epoch 37] train avg loss 0.00210079, test acc 0.9267, test avg loss 0.205685, throughput 12.8775K wps
[Epoch 38 Batch 30/162] avg loss 0.00193241, throughput 13.2023K wps
[Epoch 38 Batch 60/162] avg loss 0.00227314, throughput 12.8463K wps
[Epoch 38 Batch 90/162] avg loss 0.00195975, throughput 12.8449K wps
[Epoch 38 Batch 120/162] avg loss 0.00216486, throughput 12.847K wps
[Epoch 38 Batch 150/162] avg loss 0.00211001, throughput 12.8942K wps
Begin Testing...
[Epoch 38] train avg loss 0.00209866, test acc 0.9267, test avg loss 0.203235, throughput 12.9189K wps
[Epoch 39 Batch 30/162] avg loss 0.00217403, throughput 13.2707K wps
[Epoch 39 Batch 60/162] avg loss 0.00181696, throughput 12.7884K wps
[Epoch 39 Batch 90/162] avg loss 0.00184459, throughput 12.8101K wps
[Epoch 39 Batch 120/162] avg loss 0.00186026, throughput 12.94K wps
[Epoch 39 Batch 150/162] avg loss 0.00189495, throughput 12.9442K wps
Begin Testing...
[Epoch 39] train avg loss 0.00193812, test acc 0.9256, test avg loss 0.199893, throughput 12.9496K wps
[Epoch 40 Batch 30/162] avg loss 0.00206014, throughput 13.1945K wps
[Epoch 40 Batch 60/162] avg loss 0.00192651, throughput 12.6802K wps
[Epoch 40 Batch 90/162] avg loss 0.00204826, throughput 12.8979K wps
[Epoch 40 Batch 120/162] avg loss 0.00196595, throughput 12.9026K wps
[Epoch 40 Batch 150/162] avg loss 0.00198187, throughput 12.897K wps
Begin Testing...
[Epoch 40] train avg loss 0.00197724, test acc 0.9289, test avg loss 0.204375, throughput 12.9103K wps
Observed Improvement.
Begin Testing...
[Epoch 41 Batch 30/162] avg loss 0.00163632, throughput 13.1785K wps
[Epoch 41 Batch 60/162] avg loss 0.00195405, throughput 12.744K wps
[Epoch 41 Batch 90/162] avg loss 0.00199575, throughput 12.9232K wps
[Epoch 41 Batch 120/162] avg loss 0.0019775, throughput 12.8972K wps
[Epoch 41 Batch 150/162] avg loss 0.00165073, throughput 12.8732K wps
Begin Testing...
[Epoch 41] train avg loss 0.00185613, test acc 0.9222, test avg loss 0.201861, throughput 12.9164K wps
[Epoch 42 Batch 30/162] avg loss 0.00192403, throughput 13.0466K wps
[Epoch 42 Batch 60/162] avg loss 0.00181839, throughput 12.8343K wps
[Epoch 42 Batch 90/162] avg loss 0.00179637, throughput 12.8874K wps
[Epoch 42 Batch 120/162] avg loss 0.00179258, throughput 12.8677K wps
[Epoch 42 Batch 150/162] avg loss 0.0022473, throughput 12.8788K wps
Begin Testing...
[Epoch 42] train avg loss 0.00189332, test acc 0.9267, test avg loss 0.199636, throughput 12.905K wps
[Epoch 43 Batch 30/162] avg loss 0.00181496, throughput 13.2687K wps
[Epoch 43 Batch 60/162] avg loss 0.00177622, throughput 12.8028K wps
[Epoch 43 Batch 90/162] avg loss 0.00166161, throughput 12.9492K wps
[Epoch 43 Batch 120/162] avg loss 0.00195295, throughput 12.9303K wps
[Epoch 43 Batch 150/162] avg loss 0.00171792, throughput 12.923K wps
Begin Testing...
[Epoch 43] train avg loss 0.00176239, test acc 0.9278, test avg loss 0.20087, throughput 12.9705K wps
[Epoch 44 Batch 30/162] avg loss 0.00176924, throughput 13.2256K wps
[Epoch 44 Batch 60/162] avg loss 0.0015758, throughput 12.8175K wps
[Epoch 44 Batch 90/162] avg loss 0.00166414, throughput 12.9607K wps
[Epoch 44 Batch 120/162] avg loss 0.00175178, throughput 12.9909K wps
[Epoch 44 Batch 150/162] avg loss 0.00158731, throughput 12.899K wps
Begin Testing...
[Epoch 44] train avg loss 0.00166708, test acc 0.9278, test avg loss 0.199919, throughput 12.9694K wps
[Epoch 45 Batch 30/162] avg loss 0.00158863, throughput 13.3213K wps
[Epoch 45 Batch 60/162] avg loss 0.00163387, throughput 12.83K wps
[Epoch 45 Batch 90/162] avg loss 0.00147835, throughput 12.8458K wps
[Epoch 45 Batch 120/162] avg loss 0.00169034, throughput 12.9902K wps
[Epoch 45 Batch 150/162] avg loss 0.00181108, throughput 12.8041K wps
Begin Testing...
[Epoch 45] train avg loss 0.00163793, test acc 0.9267, test avg loss 0.200298, throughput 12.9397K wps
[Epoch 46 Batch 30/162] avg loss 0.0016626, throughput 13.1977K wps
[Epoch 46 Batch 60/162] avg loss 0.00137863, throughput 12.8352K wps
[Epoch 46 Batch 90/162] avg loss 0.00170256, throughput 12.8954K wps
[Epoch 46 Batch 120/162] avg loss 0.00142418, throughput 12.9248K wps
[Epoch 46 Batch 150/162] avg loss 0.00161353, throughput 12.938K wps
Begin Testing...
[Epoch 46] train avg loss 0.00156082, test acc 0.9289, test avg loss 0.198001, throughput 12.9526K wps
Observed Improvement.
Begin Testing...
[Epoch 47 Batch 30/162] avg loss 0.00151935, throughput 13.2868K wps
[Epoch 47 Batch 60/162] avg loss 0.00167139, throughput 12.7092K wps
[Epoch 47 Batch 90/162] avg loss 0.00164763, throughput 12.8463K wps
[Epoch 47 Batch 120/162] avg loss 0.00157127, throughput 12.8365K wps
[Epoch 47 Batch 150/162] avg loss 0.00152932, throughput 12.8485K wps
Begin Testing...
[Epoch 47] train avg loss 0.00158228, test acc 0.9289, test avg loss 0.198167, throughput 12.9061K wps
Observed Improvement.
Begin Testing...
[Epoch 48 Batch 30/162] avg loss 0.00147789, throughput 13.215K wps
[Epoch 48 Batch 60/162] avg loss 0.00136266, throughput 12.8885K wps
[Epoch 48 Batch 90/162] avg loss 0.0013857, throughput 12.9907K wps
[Epoch 48 Batch 120/162] avg loss 0.00152097, throughput 12.9313K wps
[Epoch 48 Batch 150/162] avg loss 0.00141843, throughput 12.9141K wps
Begin Testing...
[Epoch 48] train avg loss 0.00144022, test acc 0.9333, test avg loss 0.204746, throughput 12.9839K wps
Observed Improvement.
Begin Testing...
[Epoch 49 Batch 30/162] avg loss 0.00151686, throughput 13.2042K wps
[Epoch 49 Batch 60/162] avg loss 0.00144728, throughput 12.8388K wps
[Epoch 49 Batch 90/162] avg loss 0.00135875, throughput 12.9454K wps
[Epoch 49 Batch 120/162] avg loss 0.00141783, throughput 12.9159K wps
[Epoch 49 Batch 150/162] avg loss 0.00160125, throughput 12.9066K wps
Begin Testing...
[Epoch 49] train avg loss 0.00147439, test acc 0.9278, test avg loss 0.198256, throughput 12.9541K wps
[Epoch 50 Batch 30/162] avg loss 0.00147252, throughput 13.1548K wps
[Epoch 50 Batch 60/162] avg loss 0.00144505, throughput 12.7023K wps
[Epoch 50 Batch 90/162] avg loss 0.00143536, throughput 12.8349K wps
[Epoch 50 Batch 120/162] avg loss 0.00140501, throughput 12.8266K wps
[Epoch 50 Batch 150/162] avg loss 0.00142053, throughput 12.7878K wps
Begin Testing...
[Epoch 50] train avg loss 0.00143396, test acc 0.9322, test avg loss 0.197724, throughput 12.8531K wps
[Epoch 51 Batch 30/162] avg loss 0.00136157, throughput 13.1025K wps
[Epoch 51 Batch 60/162] avg loss 0.0013634, throughput 12.6956K wps
[Epoch 51 Batch 90/162] avg loss 0.00140553, throughput 12.8137K wps
[Epoch 51 Batch 120/162] avg loss 0.0013172, throughput 12.8401K wps
[Epoch 51 Batch 150/162] avg loss 0.00118839, throughput 12.9519K wps
Begin Testing...
[Epoch 51] train avg loss 0.0013389, test acc 0.9267, test avg loss 0.202011, throughput 12.8821K wps
[Epoch 52 Batch 30/162] avg loss 0.00116603, throughput 13.1087K wps
[Epoch 52 Batch 60/162] avg loss 0.00136629, throughput 12.814K wps
[Epoch 52 Batch 90/162] avg loss 0.00119531, throughput 12.9864K wps
[Epoch 52 Batch 120/162] avg loss 0.00119328, throughput 13.0109K wps
[Epoch 52 Batch 150/162] avg loss 0.00149753, throughput 13.002K wps
Begin Testing...
[Epoch 52] train avg loss 0.00127095, test acc 0.9300, test avg loss 0.198414, throughput 12.9807K wps
[Epoch 53 Batch 30/162] avg loss 0.00136011, throughput 13.2016K wps
[Epoch 53 Batch 60/162] avg loss 0.00104206, throughput 12.7636K wps
[Epoch 53 Batch 90/162] avg loss 0.00133726, throughput 12.9152K wps
[Epoch 53 Batch 120/162] avg loss 0.00118667, throughput 12.9329K wps
[Epoch 53 Batch 150/162] avg loss 0.00138709, throughput 12.8216K wps
Begin Testing...
[Epoch 53] train avg loss 0.00125684, test acc 0.9311, test avg loss 0.202554, throughput 12.927K wps
[Epoch 54 Batch 30/162] avg loss 0.00116788, throughput 13.1125K wps
[Epoch 54 Batch 60/162] avg loss 0.00121599, throughput 12.8094K wps
[Epoch 54 Batch 90/162] avg loss 0.00123742, throughput 12.9459K wps
[Epoch 54 Batch 120/162] avg loss 0.00123656, throughput 12.946K wps
[Epoch 54 Batch 150/162] avg loss 0.00118796, throughput 12.9579K wps
Begin Testing...
[Epoch 54] train avg loss 0.00120004, test acc 0.9333, test avg loss 0.200044, throughput 12.9516K wps
Observed Improvement.
Begin Testing...
[Epoch 55 Batch 30/162] avg loss 0.00128043, throughput 13.1604K wps
[Epoch 55 Batch 60/162] avg loss 0.0011125, throughput 12.7524K wps
[Epoch 55 Batch 90/162] avg loss 0.00101943, throughput 12.9069K wps
[Epoch 55 Batch 120/162] avg loss 0.00119308, throughput 12.8891K wps
[Epoch 55 Batch 150/162] avg loss 0.00111921, throughput 12.8508K wps
Begin Testing...
[Epoch 55] train avg loss 0.00114662, test acc 0.9322, test avg loss 0.198585, throughput 12.9093K wps
[Epoch 56 Batch 30/162] avg loss 0.000999227, throughput 13.1496K wps
[Epoch 56 Batch 60/162] avg loss 0.00125503, throughput 12.8827K wps
[Epoch 56 Batch 90/162] avg loss 0.0011069, throughput 12.8698K wps
[Epoch 56 Batch 120/162] avg loss 0.00104799, throughput 12.8585K wps
[Epoch 56 Batch 150/162] avg loss 0.0012914, throughput 12.8481K wps
Begin Testing...
[Epoch 56] train avg loss 0.00114276, test acc 0.9311, test avg loss 0.199107, throughput 12.9154K wps
[Epoch 57 Batch 30/162] avg loss 0.00113428, throughput 13.2299K wps
[Epoch 57 Batch 60/162] avg loss 0.00116903, throughput 12.7096K wps
[Epoch 57 Batch 90/162] avg loss 0.00112673, throughput 12.8935K wps
[Epoch 57 Batch 120/162] avg loss 0.00102015, throughput 12.908K wps
[Epoch 57 Batch 150/162] avg loss 0.00108893, throughput 12.9446K wps
Begin Testing...
[Epoch 57] train avg loss 0.00110243, test acc 0.9333, test avg loss 0.199045, throughput 12.9363K wps
Observed Improvement.
Begin Testing...
[Epoch 58 Batch 30/162] avg loss 0.00109292, throughput 13.1756K wps
[Epoch 58 Batch 60/162] avg loss 0.00102914, throughput 12.7598K wps
[Epoch 58 Batch 90/162] avg loss 0.00112214, throughput 12.9473K wps
[Epoch 58 Batch 120/162] avg loss 0.00101939, throughput 12.9727K wps
[Epoch 58 Batch 150/162] avg loss 0.00118979, throughput 12.9885K wps
Begin Testing...
[Epoch 58] train avg loss 0.00110797, test acc 0.9300, test avg loss 0.19965, throughput 12.9689K wps
[Epoch 59 Batch 30/162] avg loss 0.0009824, throughput 13.1321K wps
[Epoch 59 Batch 60/162] avg loss 0.00117416, throughput 12.7964K wps
[Epoch 59 Batch 90/162] avg loss 0.00109098, throughput 12.9311K wps
[Epoch 59 Batch 120/162] avg loss 0.00116722, throughput 12.8248K wps
[Epoch 59 Batch 150/162] avg loss 0.000879585, throughput 12.8582K wps
Begin Testing...
[Epoch 59] train avg loss 0.00105404, test acc 0.9344, test avg loss 0.20041, throughput 12.9046K wps
Observed Improvement.
Begin Testing...
Test loss 0.191088, test acc 0.9260
Total time cost 166.24s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0154124, throughput 11.4364K wps
[Epoch 0 Batch 60/162] avg loss 0.0144583, throughput 12.6825K wps
[Epoch 0 Batch 90/162] avg loss 0.013975, throughput 12.8154K wps
[Epoch 0 Batch 120/162] avg loss 0.013514, throughput 12.7655K wps
[Epoch 0 Batch 150/162] avg loss 0.0129058, throughput 12.6825K wps
Begin Testing...
[Epoch 0] train avg loss 0.0139811, test acc 0.7044, test avg loss 0.582906, throughput 12.4907K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.012718, throughput 13.207K wps
[Epoch 1 Batch 60/162] avg loss 0.0121836, throughput 12.8051K wps
[Epoch 1 Batch 90/162] avg loss 0.0122058, throughput 12.8381K wps
[Epoch 1 Batch 120/162] avg loss 0.0120963, throughput 12.8792K wps
[Epoch 1 Batch 150/162] avg loss 0.0116751, throughput 12.825K wps
Begin Testing...
[Epoch 1] train avg loss 0.012122, test acc 0.7600, test avg loss 0.535268, throughput 12.8973K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0114958, throughput 13.2767K wps
[Epoch 2 Batch 60/162] avg loss 0.011074, throughput 12.7984K wps
[Epoch 2 Batch 90/162] avg loss 0.0109981, throughput 12.8477K wps
[Epoch 2 Batch 120/162] avg loss 0.0107219, throughput 12.8311K wps
[Epoch 2 Batch 150/162] avg loss 0.0105755, throughput 12.9867K wps
Begin Testing...
[Epoch 2] train avg loss 0.0109254, test acc 0.8022, test avg loss 0.498708, throughput 12.9433K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0104566, throughput 13.256K wps
[Epoch 3 Batch 60/162] avg loss 0.0102154, throughput 12.8124K wps
[Epoch 3 Batch 90/162] avg loss 0.0101055, throughput 12.9108K wps
[Epoch 3 Batch 120/162] avg loss 0.00976297, throughput 12.8411K wps
[Epoch 3 Batch 150/162] avg loss 0.00933808, throughput 12.8115K wps
Begin Testing...
[Epoch 3] train avg loss 0.00995058, test acc 0.8356, test avg loss 0.451832, throughput 12.9117K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00944099, throughput 13.0858K wps
[Epoch 4 Batch 60/162] avg loss 0.00913296, throughput 12.9049K wps
[Epoch 4 Batch 90/162] avg loss 0.00887773, throughput 12.9278K wps
[Epoch 4 Batch 120/162] avg loss 0.00902196, throughput 12.9779K wps
[Epoch 4 Batch 150/162] avg loss 0.00863788, throughput 12.9638K wps
Begin Testing...
[Epoch 4] train avg loss 0.00900352, test acc 0.8611, test avg loss 0.412499, throughput 12.9699K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00813969, throughput 13.2107K wps
[Epoch 5 Batch 60/162] avg loss 0.00837637, throughput 12.7778K wps
[Epoch 5 Batch 90/162] avg loss 0.00784618, throughput 12.9315K wps
[Epoch 5 Batch 120/162] avg loss 0.00801148, throughput 12.9485K wps
[Epoch 5 Batch 150/162] avg loss 0.00785141, throughput 12.8741K wps
Begin Testing...
[Epoch 5] train avg loss 0.00803299, test acc 0.8800, test avg loss 0.375021, throughput 12.9354K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00742477, throughput 13.1788K wps
[Epoch 6 Batch 60/162] avg loss 0.0072805, throughput 12.7634K wps
[Epoch 6 Batch 90/162] avg loss 0.00724345, throughput 12.8295K wps
[Epoch 6 Batch 120/162] avg loss 0.00743816, throughput 12.8679K wps
[Epoch 6 Batch 150/162] avg loss 0.00712403, throughput 12.9167K wps
Begin Testing...
[Epoch 6] train avg loss 0.0072851, test acc 0.8944, test avg loss 0.341956, throughput 12.9099K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00673765, throughput 13.1262K wps
[Epoch 7 Batch 60/162] avg loss 0.00661355, throughput 12.7552K wps
[Epoch 7 Batch 90/162] avg loss 0.00652508, throughput 12.9064K wps
[Epoch 7 Batch 120/162] avg loss 0.00684932, throughput 12.9251K wps
[Epoch 7 Batch 150/162] avg loss 0.00682898, throughput 12.9438K wps
Begin Testing...
[Epoch 7] train avg loss 0.00667621, test acc 0.8978, test avg loss 0.319628, throughput 12.9319K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00627905, throughput 13.2255K wps
[Epoch 8 Batch 60/162] avg loss 0.00619718, throughput 12.7687K wps
[Epoch 8 Batch 90/162] avg loss 0.00629303, throughput 12.8582K wps
[Epoch 8 Batch 120/162] avg loss 0.00621512, throughput 12.8927K wps
[Epoch 8 Batch 150/162] avg loss 0.00612452, throughput 12.9357K wps
Begin Testing...
[Epoch 8] train avg loss 0.00620354, test acc 0.8978, test avg loss 0.301062, throughput 12.9289K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00605736, throughput 13.233K wps
[Epoch 9 Batch 60/162] avg loss 0.00580604, throughput 12.7185K wps
[Epoch 9 Batch 90/162] avg loss 0.00619987, throughput 12.7882K wps
[Epoch 9 Batch 120/162] avg loss 0.00584352, throughput 12.7895K wps
[Epoch 9 Batch 150/162] avg loss 0.00557635, throughput 12.775K wps
Begin Testing...
[Epoch 9] train avg loss 0.00588467, test acc 0.9067, test avg loss 0.283894, throughput 12.8486K wps
Observed Improvement.
Begin Testing...
[Epoch 10 Batch 30/162] avg loss 0.00581648, throughput 13.2473K wps
[Epoch 10 Batch 60/162] avg loss 0.00520015, throughput 12.8107K wps
[Epoch 10 Batch 90/162] avg loss 0.00594127, throughput 12.9108K wps
[Epoch 10 Batch 120/162] avg loss 0.00553871, throughput 12.9391K wps
[Epoch 10 Batch 150/162] avg loss 0.00546701, throughput 12.8885K wps
Begin Testing...
[Epoch 10] train avg loss 0.0055763, test acc 0.9044, test avg loss 0.27265, throughput 12.9517K wps
[Epoch 11 Batch 30/162] avg loss 0.005323, throughput 13.2248K wps
[Epoch 11 Batch 60/162] avg loss 0.00506914, throughput 12.851K wps
[Epoch 11 Batch 90/162] avg loss 0.00492948, throughput 12.8858K wps
[Epoch 11 Batch 120/162] avg loss 0.00560097, throughput 12.9002K wps
[Epoch 11 Batch 150/162] avg loss 0.00551367, throughput 12.8787K wps
Begin Testing...
[Epoch 11] train avg loss 0.00529714, test acc 0.9100, test avg loss 0.263911, throughput 12.9351K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00492719, throughput 13.2072K wps
[Epoch 12 Batch 60/162] avg loss 0.00506281, throughput 12.8566K wps
[Epoch 12 Batch 90/162] avg loss 0.00536453, throughput 12.8642K wps
[Epoch 12 Batch 120/162] avg loss 0.00474933, throughput 12.9298K wps
[Epoch 12 Batch 150/162] avg loss 0.00505316, throughput 12.7728K wps
Begin Testing...
[Epoch 12] train avg loss 0.00502593, test acc 0.9078, test avg loss 0.255571, throughput 12.911K wps
[Epoch 13 Batch 30/162] avg loss 0.00500866, throughput 13.181K wps
[Epoch 13 Batch 60/162] avg loss 0.00491718, throughput 12.7459K wps
[Epoch 13 Batch 90/162] avg loss 0.00474686, throughput 12.9293K wps
[Epoch 13 Batch 120/162] avg loss 0.00487448, throughput 12.8701K wps
[Epoch 13 Batch 150/162] avg loss 0.00490907, throughput 12.8711K wps
Begin Testing...
[Epoch 13] train avg loss 0.00486575, test acc 0.9078, test avg loss 0.25006, throughput 12.9153K wps
[Epoch 14 Batch 30/162] avg loss 0.00456499, throughput 13.0816K wps
[Epoch 14 Batch 60/162] avg loss 0.00493447, throughput 12.8057K wps
[Epoch 14 Batch 90/162] avg loss 0.00456071, throughput 12.7375K wps
[Epoch 14 Batch 120/162] avg loss 0.00422157, throughput 12.7072K wps
[Epoch 14 Batch 150/162] avg loss 0.0042824, throughput 12.8848K wps
Begin Testing...
[Epoch 14] train avg loss 0.00454854, test acc 0.9144, test avg loss 0.242696, throughput 12.8394K wps
Observed Improvement.
Begin Testing...
[Epoch 15 Batch 30/162] avg loss 0.00455523, throughput 13.2392K wps
[Epoch 15 Batch 60/162] avg loss 0.00411217, throughput 12.8076K wps
[Epoch 15 Batch 90/162] avg loss 0.0045128, throughput 12.8096K wps
[Epoch 15 Batch 120/162] avg loss 0.00462408, throughput 12.9203K wps
[Epoch 15 Batch 150/162] avg loss 0.00445115, throughput 12.9108K wps
Begin Testing...
[Epoch 15] train avg loss 0.0044229, test acc 0.9133, test avg loss 0.237293, throughput 12.9343K wps
[Epoch 16 Batch 30/162] avg loss 0.00451498, throughput 13.2119K wps
[Epoch 16 Batch 60/162] avg loss 0.00405962, throughput 12.7693K wps
[Epoch 16 Batch 90/162] avg loss 0.00436664, throughput 12.8162K wps
[Epoch 16 Batch 120/162] avg loss 0.00458533, throughput 12.819K wps
[Epoch 16 Batch 150/162] avg loss 0.00402301, throughput 12.9671K wps
Begin Testing...
[Epoch 16] train avg loss 0.00425863, test acc 0.9111, test avg loss 0.231855, throughput 12.9136K wps
[Epoch 17 Batch 30/162] avg loss 0.00373754, throughput 13.2779K wps
[Epoch 17 Batch 60/162] avg loss 0.00425514, throughput 12.7876K wps
[Epoch 17 Batch 90/162] avg loss 0.00411229, throughput 12.9699K wps
[Epoch 17 Batch 120/162] avg loss 0.00413288, throughput 12.8936K wps
[Epoch 17 Batch 150/162] avg loss 0.0039497, throughput 12.8804K wps
Begin Testing...
[Epoch 17] train avg loss 0.00407906, test acc 0.9200, test avg loss 0.225528, throughput 12.9522K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00403469, throughput 13.1634K wps
[Epoch 18 Batch 60/162] avg loss 0.00409846, throughput 12.8136K wps
[Epoch 18 Batch 90/162] avg loss 0.00393286, throughput 12.9256K wps
[Epoch 18 Batch 120/162] avg loss 0.00386911, throughput 12.9595K wps
[Epoch 18 Batch 150/162] avg loss 0.00408288, throughput 12.9385K wps
Begin Testing...
[Epoch 18] train avg loss 0.00399475, test acc 0.9167, test avg loss 0.221801, throughput 12.9628K wps
[Epoch 19 Batch 30/162] avg loss 0.00391885, throughput 13.1915K wps
[Epoch 19 Batch 60/162] avg loss 0.00378946, throughput 12.7478K wps
[Epoch 19 Batch 90/162] avg loss 0.0043364, throughput 12.8592K wps
[Epoch 19 Batch 120/162] avg loss 0.003669, throughput 12.9168K wps
[Epoch 19 Batch 150/162] avg loss 0.00381725, throughput 12.8879K wps
Begin Testing...
[Epoch 19] train avg loss 0.00389717, test acc 0.9178, test avg loss 0.219941, throughput 12.8995K wps
[Epoch 20 Batch 30/162] avg loss 0.0036591, throughput 13.3036K wps
[Epoch 20 Batch 60/162] avg loss 0.00408274, throughput 12.8783K wps
[Epoch 20 Batch 90/162] avg loss 0.00364305, throughput 12.9291K wps
[Epoch 20 Batch 120/162] avg loss 0.00352528, throughput 12.9409K wps
[Epoch 20 Batch 150/162] avg loss 0.00408446, throughput 12.9696K wps
Begin Testing...
[Epoch 20] train avg loss 0.00378106, test acc 0.9256, test avg loss 0.217819, throughput 12.9996K wps
Observed Improvement.
Begin Testing...
[Epoch 21 Batch 30/162] avg loss 0.00356494, throughput 13.3128K wps
[Epoch 21 Batch 60/162] avg loss 0.0037148, throughput 12.7707K wps
[Epoch 21 Batch 90/162] avg loss 0.00360369, throughput 12.8406K wps
[Epoch 21 Batch 120/162] avg loss 0.00351423, throughput 12.9297K wps
[Epoch 21 Batch 150/162] avg loss 0.00358607, throughput 12.9374K wps
Begin Testing...
[Epoch 21] train avg loss 0.00361953, test acc 0.9256, test avg loss 0.211837, throughput 12.9495K wps
Observed Improvement.
Begin Testing...
[Epoch 22 Batch 30/162] avg loss 0.00366161, throughput 13.1381K wps
[Epoch 22 Batch 60/162] avg loss 0.00329349, throughput 12.8014K wps
[Epoch 22 Batch 90/162] avg loss 0.00351937, throughput 12.9422K wps
[Epoch 22 Batch 120/162] avg loss 0.00389112, throughput 12.9262K wps
[Epoch 22 Batch 150/162] avg loss 0.00335618, throughput 12.8676K wps
Begin Testing...
[Epoch 22] train avg loss 0.00350922, test acc 0.9278, test avg loss 0.210106, throughput 12.9341K wps
Observed Improvement.
Begin Testing...
[Epoch 23 Batch 30/162] avg loss 0.00349177, throughput 13.2091K wps
[Epoch 23 Batch 60/162] avg loss 0.00340724, throughput 12.7659K wps
[Epoch 23 Batch 90/162] avg loss 0.00342344, throughput 12.7955K wps
[Epoch 23 Batch 120/162] avg loss 0.0034234, throughput 12.9101K wps
[Epoch 23 Batch 150/162] avg loss 0.00365495, throughput 12.9275K wps
Begin Testing...
[Epoch 23] train avg loss 0.00348113, test acc 0.9289, test avg loss 0.207376, throughput 12.913K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00326723, throughput 13.1831K wps
[Epoch 24 Batch 60/162] avg loss 0.00304645, throughput 12.8061K wps
[Epoch 24 Batch 90/162] avg loss 0.00314118, throughput 12.8393K wps
[Epoch 24 Batch 120/162] avg loss 0.00303102, throughput 12.8983K wps
[Epoch 24 Batch 150/162] avg loss 0.00364947, throughput 12.9129K wps
Begin Testing...
[Epoch 24] train avg loss 0.00324196, test acc 0.9244, test avg loss 0.205821, throughput 12.92K wps
[Epoch 25 Batch 30/162] avg loss 0.00296198, throughput 13.237K wps
[Epoch 25 Batch 60/162] avg loss 0.00334718, throughput 12.7869K wps
[Epoch 25 Batch 90/162] avg loss 0.00313713, throughput 12.9429K wps
[Epoch 25 Batch 120/162] avg loss 0.00318242, throughput 12.9464K wps
[Epoch 25 Batch 150/162] avg loss 0.0032405, throughput 12.8939K wps
Begin Testing...
[Epoch 25] train avg loss 0.00315219, test acc 0.9256, test avg loss 0.20143, throughput 12.9511K wps
[Epoch 26 Batch 30/162] avg loss 0.00280696, throughput 13.2429K wps
[Epoch 26 Batch 60/162] avg loss 0.00311104, throughput 12.8117K wps
[Epoch 26 Batch 90/162] avg loss 0.00327772, throughput 12.9783K wps
[Epoch 26 Batch 120/162] avg loss 0.00321617, throughput 12.9613K wps
[Epoch 26 Batch 150/162] avg loss 0.00304127, throughput 12.8572K wps
Begin Testing...
[Epoch 26] train avg loss 0.00307284, test acc 0.9267, test avg loss 0.199163, throughput 12.9706K wps
[Epoch 27 Batch 30/162] avg loss 0.00289696, throughput 13.187K wps
[Epoch 27 Batch 60/162] avg loss 0.00304004, throughput 12.7983K wps
[Epoch 27 Batch 90/162] avg loss 0.0028331, throughput 12.8762K wps
[Epoch 27 Batch 120/162] avg loss 0.00307786, throughput 12.8588K wps
[Epoch 27 Batch 150/162] avg loss 0.00303711, throughput 12.7465K wps
Begin Testing...
[Epoch 27] train avg loss 0.00294209, test acc 0.9267, test avg loss 0.198607, throughput 12.8896K wps
[Epoch 28 Batch 30/162] avg loss 0.00268706, throughput 13.1712K wps
[Epoch 28 Batch 60/162] avg loss 0.0028204, throughput 12.7986K wps
[Epoch 28 Batch 90/162] avg loss 0.00276921, throughput 12.9278K wps
[Epoch 28 Batch 120/162] avg loss 0.00291788, throughput 12.867K wps
[Epoch 28 Batch 150/162] avg loss 0.00302299, throughput 12.8023K wps
Begin Testing...
[Epoch 28] train avg loss 0.00284704, test acc 0.9278, test avg loss 0.197589, throughput 12.9029K wps
[Epoch 29 Batch 30/162] avg loss 0.00292313, throughput 13.2033K wps
[Epoch 29 Batch 60/162] avg loss 0.00298793, throughput 12.8219K wps
[Epoch 29 Batch 90/162] avg loss 0.0024871, throughput 12.9129K wps
[Epoch 29 Batch 120/162] avg loss 0.00303846, throughput 12.7982K wps
[Epoch 29 Batch 150/162] avg loss 0.00270256, throughput 12.8108K wps
Begin Testing...
[Epoch 29] train avg loss 0.00279268, test acc 0.9322, test avg loss 0.19655, throughput 12.8945K wps
Observed Improvement.
Begin Testing...
[Epoch 30 Batch 30/162] avg loss 0.00253485, throughput 13.2406K wps
[Epoch 30 Batch 60/162] avg loss 0.0027805, throughput 12.8352K wps
[Epoch 30 Batch 90/162] avg loss 0.00296212, throughput 12.9894K wps
[Epoch 30 Batch 120/162] avg loss 0.00277444, throughput 12.9639K wps
[Epoch 30 Batch 150/162] avg loss 0.00275938, throughput 12.7818K wps
Begin Testing...
[Epoch 30] train avg loss 0.00273275, test acc 0.9300, test avg loss 0.193451, throughput 12.9581K wps
[Epoch 31 Batch 30/162] avg loss 0.00257005, throughput 13.1531K wps
[Epoch 31 Batch 60/162] avg loss 0.00265951, throughput 12.8378K wps
[Epoch 31 Batch 90/162] avg loss 0.00259486, throughput 12.9475K wps
[Epoch 31 Batch 120/162] avg loss 0.00261716, throughput 12.8384K wps
[Epoch 31 Batch 150/162] avg loss 0.00275721, throughput 12.893K wps
Begin Testing...
[Epoch 31] train avg loss 0.00263815, test acc 0.9344, test avg loss 0.190207, throughput 12.9395K wps
Observed Improvement.
Begin Testing...
[Epoch 32 Batch 30/162] avg loss 0.0026457, throughput 13.2103K wps
[Epoch 32 Batch 60/162] avg loss 0.00271003, throughput 12.7689K wps
[Epoch 32 Batch 90/162] avg loss 0.00263716, throughput 12.9145K wps
[Epoch 32 Batch 120/162] avg loss 0.00264217, throughput 12.829K wps
[Epoch 32 Batch 150/162] avg loss 0.00233483, throughput 12.8471K wps
Begin Testing...
[Epoch 32] train avg loss 0.00256866, test acc 0.9356, test avg loss 0.188998, throughput 12.9095K wps
Observed Improvement.
Begin Testing...
[Epoch 33 Batch 30/162] avg loss 0.00217452, throughput 13.2498K wps
[Epoch 33 Batch 60/162] avg loss 0.00242158, throughput 12.8238K wps
[Epoch 33 Batch 90/162] avg loss 0.00253199, throughput 12.9167K wps
[Epoch 33 Batch 120/162] avg loss 0.00252115, throughput 12.9425K wps
[Epoch 33 Batch 150/162] avg loss 0.00259342, throughput 12.8693K wps
Begin Testing...
[Epoch 33] train avg loss 0.00246145, test acc 0.9311, test avg loss 0.19072, throughput 12.9533K wps
[Epoch 34 Batch 30/162] avg loss 0.00247446, throughput 13.153K wps
[Epoch 34 Batch 60/162] avg loss 0.00232555, throughput 12.8128K wps
[Epoch 34 Batch 90/162] avg loss 0.00238663, throughput 12.9302K wps
[Epoch 34 Batch 120/162] avg loss 0.00228899, throughput 12.9469K wps
[Epoch 34 Batch 150/162] avg loss 0.00239036, throughput 12.9324K wps
Begin Testing...
[Epoch 34] train avg loss 0.00241033, test acc 0.9311, test avg loss 0.187055, throughput 12.9502K wps
[Epoch 35 Batch 30/162] avg loss 0.00232352, throughput 13.1614K wps
[Epoch 35 Batch 60/162] avg loss 0.00213969, throughput 12.8848K wps
[Epoch 35 Batch 90/162] avg loss 0.00242195, throughput 12.8946K wps
[Epoch 35 Batch 120/162] avg loss 0.00227907, throughput 12.8865K wps
[Epoch 35 Batch 150/162] avg loss 0.00224904, throughput 12.8427K wps
Begin Testing...
[Epoch 35] train avg loss 0.00227381, test acc 0.9322, test avg loss 0.186363, throughput 12.927K wps
[Epoch 36 Batch 30/162] avg loss 0.00239099, throughput 13.0807K wps
[Epoch 36 Batch 60/162] avg loss 0.002131, throughput 12.6533K wps
[Epoch 36 Batch 90/162] avg loss 0.00227417, throughput 12.8616K wps
[Epoch 36 Batch 120/162] avg loss 0.00217614, throughput 12.8495K wps
[Epoch 36 Batch 150/162] avg loss 0.00218816, throughput 12.9152K wps
Begin Testing...
[Epoch 36] train avg loss 0.00223353, test acc 0.9333, test avg loss 0.186881, throughput 12.8705K wps
[Epoch 37 Batch 30/162] avg loss 0.0021741, throughput 13.1444K wps
[Epoch 37 Batch 60/162] avg loss 0.00202013, throughput 12.7047K wps
[Epoch 37 Batch 90/162] avg loss 0.00228219, throughput 12.8266K wps
[Epoch 37 Batch 120/162] avg loss 0.00234206, throughput 12.8154K wps
[Epoch 37 Batch 150/162] avg loss 0.00197243, throughput 12.9048K wps
Begin Testing...
[Epoch 37] train avg loss 0.00217667, test acc 0.9289, test avg loss 0.193015, throughput 12.8831K wps
[Epoch 38 Batch 30/162] avg loss 0.00207762, throughput 13.2276K wps
[Epoch 38 Batch 60/162] avg loss 0.00188452, throughput 12.8283K wps
[Epoch 38 Batch 90/162] avg loss 0.00205697, throughput 12.954K wps
[Epoch 38 Batch 120/162] avg loss 0.00226737, throughput 12.8076K wps
[Epoch 38 Batch 150/162] avg loss 0.00211643, throughput 12.8481K wps
Begin Testing...
[Epoch 38] train avg loss 0.00206424, test acc 0.9356, test avg loss 0.182838, throughput 12.9296K wps
Observed Improvement.
Begin Testing...
[Epoch 39 Batch 30/162] avg loss 0.001866, throughput 13.2608K wps
[Epoch 39 Batch 60/162] avg loss 0.00182284, throughput 12.8285K wps
[Epoch 39 Batch 90/162] avg loss 0.00222442, throughput 12.9264K wps
[Epoch 39 Batch 120/162] avg loss 0.00218069, throughput 12.9553K wps
[Epoch 39 Batch 150/162] avg loss 0.00226755, throughput 12.932K wps
Begin Testing...
[Epoch 39] train avg loss 0.002064, test acc 0.9300, test avg loss 0.188412, throughput 12.9595K wps
[Epoch 40 Batch 30/162] avg loss 0.00203668, throughput 13.2432K wps
[Epoch 40 Batch 60/162] avg loss 0.00207954, throughput 12.8514K wps
[Epoch 40 Batch 90/162] avg loss 0.0019544, throughput 12.9197K wps
[Epoch 40 Batch 120/162] avg loss 0.00182716, throughput 12.9293K wps
[Epoch 40 Batch 150/162] avg loss 0.0021082, throughput 12.9441K wps
Begin Testing...
[Epoch 40] train avg loss 0.00199642, test acc 0.9333, test avg loss 0.18096, throughput 12.9664K wps
[Epoch 41 Batch 30/162] avg loss 0.00200704, throughput 13.1282K wps
[Epoch 41 Batch 60/162] avg loss 0.00202306, throughput 12.7025K wps
[Epoch 41 Batch 90/162] avg loss 0.00186586, throughput 12.8139K wps
[Epoch 41 Batch 120/162] avg loss 0.00198555, throughput 12.919K wps
[Epoch 41 Batch 150/162] avg loss 0.0016887, throughput 12.8274K wps
Begin Testing...
[Epoch 41] train avg loss 0.00192175, test acc 0.9300, test avg loss 0.189206, throughput 12.8751K wps
[Epoch 42 Batch 30/162] avg loss 0.00186812, throughput 13.248K wps
[Epoch 42 Batch 60/162] avg loss 0.00193672, throughput 12.7843K wps
[Epoch 42 Batch 90/162] avg loss 0.00162406, throughput 12.7823K wps
[Epoch 42 Batch 120/162] avg loss 0.00197133, throughput 12.9323K wps
[Epoch 42 Batch 150/162] avg loss 0.00167214, throughput 12.9581K wps
Begin Testing...
[Epoch 42] train avg loss 0.00182322, test acc 0.9367, test avg loss 0.180101, throughput 12.9368K wps
Observed Improvement.
Begin Testing...
[Epoch 43 Batch 30/162] avg loss 0.00171356, throughput 13.2425K wps
[Epoch 43 Batch 60/162] avg loss 0.00177655, throughput 12.7758K wps
[Epoch 43 Batch 90/162] avg loss 0.00179822, throughput 12.7622K wps
[Epoch 43 Batch 120/162] avg loss 0.00173989, throughput 12.953K wps
[Epoch 43 Batch 150/162] avg loss 0.00158384, throughput 12.9513K wps
Begin Testing...
[Epoch 43] train avg loss 0.00175, test acc 0.9356, test avg loss 0.178574, throughput 12.934K wps
[Epoch 44 Batch 30/162] avg loss 0.00166479, throughput 13.0449K wps
[Epoch 44 Batch 60/162] avg loss 0.00154581, throughput 12.7765K wps
[Epoch 44 Batch 90/162] avg loss 0.00171931, throughput 12.8503K wps
[Epoch 44 Batch 120/162] avg loss 0.0019975, throughput 12.845K wps
[Epoch 44 Batch 150/162] avg loss 0.00175151, throughput 12.8424K wps
Begin Testing...
[Epoch 44] train avg loss 0.00174377, test acc 0.9367, test avg loss 0.178229, throughput 12.8671K wps
Observed Improvement.
Begin Testing...
[Epoch 45 Batch 30/162] avg loss 0.00146867, throughput 13.3259K wps
[Epoch 45 Batch 60/162] avg loss 0.00151006, throughput 12.8335K wps
[Epoch 45 Batch 90/162] avg loss 0.00174439, throughput 12.8889K wps
[Epoch 45 Batch 120/162] avg loss 0.00177672, throughput 12.8723K wps
[Epoch 45 Batch 150/162] avg loss 0.00179288, throughput 12.9123K wps
Begin Testing...
[Epoch 45] train avg loss 0.00168638, test acc 0.9322, test avg loss 0.183086, throughput 12.9663K wps
[Epoch 46 Batch 30/162] avg loss 0.00146003, throughput 13.2776K wps
[Epoch 46 Batch 60/162] avg loss 0.00158346, throughput 12.8445K wps
[Epoch 46 Batch 90/162] avg loss 0.00167015, throughput 12.9792K wps
[Epoch 46 Batch 120/162] avg loss 0.00156763, throughput 12.9784K wps
[Epoch 46 Batch 150/162] avg loss 0.00163541, throughput 12.9804K wps
Begin Testing...
[Epoch 46] train avg loss 0.00159548, test acc 0.9367, test avg loss 0.177103, throughput 13.0066K wps
Observed Improvement.
Begin Testing...
[Epoch 47 Batch 30/162] avg loss 0.00156376, throughput 13.305K wps
[Epoch 47 Batch 60/162] avg loss 0.00138612, throughput 12.8713K wps
[Epoch 47 Batch 90/162] avg loss 0.00172607, throughput 12.9193K wps
[Epoch 47 Batch 120/162] avg loss 0.00171183, throughput 12.8394K wps
[Epoch 47 Batch 150/162] avg loss 0.00157747, throughput 12.9748K wps
Begin Testing...
[Epoch 47] train avg loss 0.00159111, test acc 0.9367, test avg loss 0.176579, throughput 12.9763K wps
Observed Improvement.
Begin Testing...
[Epoch 48 Batch 30/162] avg loss 0.00149089, throughput 13.1472K wps
[Epoch 48 Batch 60/162] avg loss 0.00156911, throughput 12.8166K wps
[Epoch 48 Batch 90/162] avg loss 0.00139566, throughput 12.9233K wps
[Epoch 48 Batch 120/162] avg loss 0.00145726, throughput 12.9507K wps
[Epoch 48 Batch 150/162] avg loss 0.00180333, throughput 12.9029K wps
Begin Testing...
[Epoch 48] train avg loss 0.0015325, test acc 0.9389, test avg loss 0.174505, throughput 12.9436K wps
Observed Improvement.
Begin Testing...
[Epoch 49 Batch 30/162] avg loss 0.00138575, throughput 13.2529K wps
[Epoch 49 Batch 60/162] avg loss 0.00153231, throughput 12.693K wps
[Epoch 49 Batch 90/162] avg loss 0.00136665, throughput 12.6952K wps
[Epoch 49 Batch 120/162] avg loss 0.00156238, throughput 12.8963K wps
[Epoch 49 Batch 150/162] avg loss 0.00141639, throughput 12.9278K wps
Begin Testing...
[Epoch 49] train avg loss 0.00148466, test acc 0.9356, test avg loss 0.182979, throughput 12.8911K wps
[Epoch 50 Batch 30/162] avg loss 0.00148343, throughput 13.2122K wps
[Epoch 50 Batch 60/162] avg loss 0.00133362, throughput 12.7236K wps
[Epoch 50 Batch 90/162] avg loss 0.00152342, throughput 12.9316K wps
[Epoch 50 Batch 120/162] avg loss 0.00151049, throughput 12.9376K wps
[Epoch 50 Batch 150/162] avg loss 0.00133714, throughput 12.7873K wps
Begin Testing...
[Epoch 50] train avg loss 0.00144159, test acc 0.9344, test avg loss 0.17817, throughput 12.9188K wps
[Epoch 51 Batch 30/162] avg loss 0.00150321, throughput 13.1343K wps
[Epoch 51 Batch 60/162] avg loss 0.0013233, throughput 12.7821K wps
[Epoch 51 Batch 90/162] avg loss 0.00145713, throughput 12.9135K wps
[Epoch 51 Batch 120/162] avg loss 0.0014767, throughput 12.9053K wps
[Epoch 51 Batch 150/162] avg loss 0.00133852, throughput 12.9267K wps
Begin Testing...
[Epoch 51] train avg loss 0.00141768, test acc 0.9356, test avg loss 0.174112, throughput 12.9277K wps
[Epoch 52 Batch 30/162] avg loss 0.00120728, throughput 13.2471K wps
[Epoch 52 Batch 60/162] avg loss 0.00137455, throughput 12.7925K wps
[Epoch 52 Batch 90/162] avg loss 0.00137649, throughput 12.7107K wps
[Epoch 52 Batch 120/162] avg loss 0.00130152, throughput 12.8158K wps
[Epoch 52 Batch 150/162] avg loss 0.00124294, throughput 12.8277K wps
Begin Testing...
[Epoch 52] train avg loss 0.00132588, test acc 0.9422, test avg loss 0.171871, throughput 12.8734K wps
Observed Improvement.
Begin Testing...
[Epoch 53 Batch 30/162] avg loss 0.00128936, throughput 13.1778K wps
[Epoch 53 Batch 60/162] avg loss 0.00135402, throughput 12.85K wps
[Epoch 53 Batch 90/162] avg loss 0.00150549, throughput 12.8945K wps
[Epoch 53 Batch 120/162] avg loss 0.00114607, throughput 12.8799K wps
[Epoch 53 Batch 150/162] avg loss 0.00145232, throughput 12.9876K wps
Begin Testing...
[Epoch 53] train avg loss 0.00135172, test acc 0.9400, test avg loss 0.174087, throughput 12.9566K wps
[Epoch 54 Batch 30/162] avg loss 0.00121463, throughput 13.2948K wps
[Epoch 54 Batch 60/162] avg loss 0.00119, throughput 12.7928K wps
[Epoch 54 Batch 90/162] avg loss 0.00112901, throughput 12.9576K wps
[Epoch 54 Batch 120/162] avg loss 0.00128594, throughput 12.9846K wps
[Epoch 54 Batch 150/162] avg loss 0.00127843, throughput 12.9599K wps
Begin Testing...
[Epoch 54] train avg loss 0.00124604, test acc 0.9367, test avg loss 0.176768, throughput 12.9943K wps
[Epoch 55 Batch 30/162] avg loss 0.00111534, throughput 13.2863K wps
[Epoch 55 Batch 60/162] avg loss 0.00106857, throughput 12.7789K wps
[Epoch 55 Batch 90/162] avg loss 0.00135343, throughput 12.7934K wps
[Epoch 55 Batch 120/162] avg loss 0.00129431, throughput 12.8942K wps
[Epoch 55 Batch 150/162] avg loss 0.00122653, throughput 12.9428K wps
Begin Testing...
[Epoch 55] train avg loss 0.00121957, test acc 0.9411, test avg loss 0.174642, throughput 12.9359K wps
[Epoch 56 Batch 30/162] avg loss 0.00104784, throughput 13.2981K wps
[Epoch 56 Batch 60/162] avg loss 0.00124367, throughput 12.7623K wps
[Epoch 56 Batch 90/162] avg loss 0.00120021, throughput 12.8779K wps
[Epoch 56 Batch 120/162] avg loss 0.00102788, throughput 12.7826K wps
[Epoch 56 Batch 150/162] avg loss 0.00131849, throughput 12.816K wps
Begin Testing...
[Epoch 56] train avg loss 0.0011642, test acc 0.9378, test avg loss 0.171918, throughput 12.9045K wps
[Epoch 57 Batch 30/162] avg loss 0.00106245, throughput 13.2152K wps
[Epoch 57 Batch 60/162] avg loss 0.00104331, throughput 12.7223K wps
[Epoch 57 Batch 90/162] avg loss 0.00110307, throughput 12.9159K wps
[Epoch 57 Batch 120/162] avg loss 0.00120931, throughput 12.8368K wps
[Epoch 57 Batch 150/162] avg loss 0.00128864, throughput 12.8454K wps
Begin Testing...
[Epoch 57] train avg loss 0.00115621, test acc 0.9378, test avg loss 0.172679, throughput 12.9023K wps
[Epoch 58 Batch 30/162] avg loss 0.00129978, throughput 13.1368K wps
[Epoch 58 Batch 60/162] avg loss 0.000978952, throughput 12.8134K wps
[Epoch 58 Batch 90/162] avg loss 0.00105669, throughput 12.9828K wps
[Epoch 58 Batch 120/162] avg loss 0.00101782, throughput 12.9401K wps
[Epoch 58 Batch 150/162] avg loss 0.000995074, throughput 12.9736K wps
Begin Testing...
[Epoch 58] train avg loss 0.00107456, test acc 0.9378, test avg loss 0.1739, throughput 12.9638K wps
[Epoch 59 Batch 30/162] avg loss 0.00107556, throughput 13.2083K wps
[Epoch 59 Batch 60/162] avg loss 0.00116163, throughput 12.7829K wps
[Epoch 59 Batch 90/162] avg loss 0.00113906, throughput 12.9031K wps
[Epoch 59 Batch 120/162] avg loss 0.000935151, throughput 12.9427K wps
[Epoch 59 Batch 150/162] avg loss 0.00116168, throughput 12.8222K wps
Begin Testing...
[Epoch 59] train avg loss 0.00107473, test acc 0.9378, test avg loss 0.175147, throughput 12.9313K wps
Test loss 0.182272, test acc 0.9230
Total time cost 165.95s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0147894, throughput 11.3768K wps
[Epoch 0 Batch 60/162] avg loss 0.0145386, throughput 12.6915K wps
[Epoch 0 Batch 90/162] avg loss 0.0141428, throughput 12.8321K wps
[Epoch 0 Batch 120/162] avg loss 0.0133375, throughput 12.8739K wps
[Epoch 0 Batch 150/162] avg loss 0.0132325, throughput 12.8849K wps
Begin Testing...
[Epoch 0] train avg loss 0.0139316, test acc 0.6700, test avg loss 0.599284, throughput 12.514K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0127048, throughput 13.1792K wps
[Epoch 1 Batch 60/162] avg loss 0.0125815, throughput 12.7209K wps
[Epoch 1 Batch 90/162] avg loss 0.0119799, throughput 12.8039K wps
[Epoch 1 Batch 120/162] avg loss 0.0121128, throughput 12.8229K wps
[Epoch 1 Batch 150/162] avg loss 0.0117425, throughput 12.84K wps
Begin Testing...
[Epoch 1] train avg loss 0.0121902, test acc 0.7656, test avg loss 0.548155, throughput 12.8702K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0113661, throughput 13.1265K wps
[Epoch 2 Batch 60/162] avg loss 0.0110451, throughput 12.8428K wps
[Epoch 2 Batch 90/162] avg loss 0.0111549, throughput 12.8503K wps
[Epoch 2 Batch 120/162] avg loss 0.0106219, throughput 12.7882K wps
[Epoch 2 Batch 150/162] avg loss 0.0104499, throughput 12.7921K wps
Begin Testing...
[Epoch 2] train avg loss 0.0108703, test acc 0.8078, test avg loss 0.506892, throughput 12.8714K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0100313, throughput 13.2472K wps
[Epoch 3 Batch 60/162] avg loss 0.0101017, throughput 12.7698K wps
[Epoch 3 Batch 90/162] avg loss 0.00965496, throughput 12.9069K wps
[Epoch 3 Batch 120/162] avg loss 0.00964892, throughput 12.8163K wps
[Epoch 3 Batch 150/162] avg loss 0.00979167, throughput 12.8865K wps
Begin Testing...
[Epoch 3] train avg loss 0.00981742, test acc 0.8011, test avg loss 0.480311, throughput 12.9216K wps
[Epoch 4 Batch 30/162] avg loss 0.00928901, throughput 13.2669K wps
[Epoch 4 Batch 60/162] avg loss 0.00930165, throughput 12.8808K wps
[Epoch 4 Batch 90/162] avg loss 0.00897835, throughput 12.9687K wps
[Epoch 4 Batch 120/162] avg loss 0.00837207, throughput 12.849K wps
[Epoch 4 Batch 150/162] avg loss 0.00872198, throughput 12.8319K wps
Begin Testing...
[Epoch 4] train avg loss 0.00888657, test acc 0.8633, test avg loss 0.428135, throughput 12.9517K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00832889, throughput 13.0908K wps
[Epoch 5 Batch 60/162] avg loss 0.00802155, throughput 12.8677K wps
[Epoch 5 Batch 90/162] avg loss 0.00804988, throughput 12.9218K wps
[Epoch 5 Batch 120/162] avg loss 0.00768564, throughput 12.8989K wps
[Epoch 5 Batch 150/162] avg loss 0.00807934, throughput 12.8858K wps
Begin Testing...
[Epoch 5] train avg loss 0.00799825, test acc 0.8744, test avg loss 0.394357, throughput 12.9307K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00758611, throughput 13.1061K wps
[Epoch 6 Batch 60/162] avg loss 0.00754877, throughput 12.654K wps
[Epoch 6 Batch 90/162] avg loss 0.00739994, throughput 12.8717K wps
[Epoch 6 Batch 120/162] avg loss 0.00709879, throughput 12.836K wps
[Epoch 6 Batch 150/162] avg loss 0.00713771, throughput 12.9331K wps
Begin Testing...
[Epoch 6] train avg loss 0.00731652, test acc 0.8789, test avg loss 0.365889, throughput 12.8802K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00689232, throughput 13.0898K wps
[Epoch 7 Batch 60/162] avg loss 0.00701886, throughput 12.8919K wps
[Epoch 7 Batch 90/162] avg loss 0.00684973, throughput 12.9201K wps
[Epoch 7 Batch 120/162] avg loss 0.00679455, throughput 12.9403K wps
[Epoch 7 Batch 150/162] avg loss 0.00630942, throughput 12.9212K wps
Begin Testing...
[Epoch 7] train avg loss 0.00677067, test acc 0.8689, test avg loss 0.347001, throughput 12.9543K wps
[Epoch 8 Batch 30/162] avg loss 0.00624485, throughput 13.1456K wps
[Epoch 8 Batch 60/162] avg loss 0.006309, throughput 12.7821K wps
[Epoch 8 Batch 90/162] avg loss 0.00647443, throughput 12.8333K wps
[Epoch 8 Batch 120/162] avg loss 0.00602939, throughput 12.8057K wps
[Epoch 8 Batch 150/162] avg loss 0.00601579, throughput 12.864K wps
Begin Testing...
[Epoch 8] train avg loss 0.00622616, test acc 0.8822, test avg loss 0.327074, throughput 12.8782K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00610792, throughput 12.9203K wps
[Epoch 9 Batch 60/162] avg loss 0.00574064, throughput 12.7442K wps
[Epoch 9 Batch 90/162] avg loss 0.0059141, throughput 12.863K wps
[Epoch 9 Batch 120/162] avg loss 0.00565824, throughput 12.8215K wps
[Epoch 9 Batch 150/162] avg loss 0.00556383, throughput 12.8431K wps
Begin Testing...
[Epoch 9] train avg loss 0.00580045, test acc 0.8800, test avg loss 0.310755, throughput 12.8396K wps
[Epoch 10 Batch 30/162] avg loss 0.00540622, throughput 13.1073K wps
[Epoch 10 Batch 60/162] avg loss 0.00558259, throughput 12.7238K wps
[Epoch 10 Batch 90/162] avg loss 0.00534175, throughput 12.9214K wps
[Epoch 10 Batch 120/162] avg loss 0.00580873, throughput 12.9416K wps
[Epoch 10 Batch 150/162] avg loss 0.00533329, throughput 12.8483K wps
Begin Testing...
[Epoch 10] train avg loss 0.00549328, test acc 0.8778, test avg loss 0.300897, throughput 12.9037K wps
[Epoch 11 Batch 30/162] avg loss 0.00491087, throughput 13.2129K wps
[Epoch 11 Batch 60/162] avg loss 0.00557605, throughput 12.8208K wps
[Epoch 11 Batch 90/162] avg loss 0.00520117, throughput 12.9454K wps
[Epoch 11 Batch 120/162] avg loss 0.00503171, throughput 12.9274K wps
[Epoch 11 Batch 150/162] avg loss 0.00517502, throughput 12.795K wps
Begin Testing...
[Epoch 11] train avg loss 0.00520839, test acc 0.8833, test avg loss 0.292783, throughput 12.9247K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.0050169, throughput 13.2626K wps
[Epoch 12 Batch 60/162] avg loss 0.00481419, throughput 12.7933K wps
[Epoch 12 Batch 90/162] avg loss 0.0049543, throughput 12.828K wps
[Epoch 12 Batch 120/162] avg loss 0.00522858, throughput 12.7698K wps
[Epoch 12 Batch 150/162] avg loss 0.00493577, throughput 12.8789K wps
Begin Testing...
[Epoch 12] train avg loss 0.00499348, test acc 0.8822, test avg loss 0.285904, throughput 12.8897K wps
[Epoch 13 Batch 30/162] avg loss 0.00506319, throughput 13.1911K wps
[Epoch 13 Batch 60/162] avg loss 0.00485752, throughput 12.7664K wps
[Epoch 13 Batch 90/162] avg loss 0.00460425, throughput 12.8276K wps
[Epoch 13 Batch 120/162] avg loss 0.00451365, throughput 12.8951K wps
[Epoch 13 Batch 150/162] avg loss 0.00507895, throughput 12.7412K wps
Begin Testing...
[Epoch 13] train avg loss 0.00482706, test acc 0.8833, test avg loss 0.278535, throughput 12.8827K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.00476101, throughput 13.2087K wps
[Epoch 14 Batch 60/162] avg loss 0.00431883, throughput 12.6985K wps
[Epoch 14 Batch 90/162] avg loss 0.00438749, throughput 12.7527K wps
[Epoch 14 Batch 120/162] avg loss 0.00466911, throughput 12.8399K wps
[Epoch 14 Batch 150/162] avg loss 0.00485171, throughput 12.8268K wps
Begin Testing...
[Epoch 14] train avg loss 0.00459838, test acc 0.8878, test avg loss 0.271725, throughput 12.854K wps
Observed Improvement.
Begin Testing...
[Epoch 15 Batch 30/162] avg loss 0.00440878, throughput 13.2864K wps
[Epoch 15 Batch 60/162] avg loss 0.00458891, throughput 12.7643K wps
[Epoch 15 Batch 90/162] avg loss 0.00429244, throughput 12.9159K wps
[Epoch 15 Batch 120/162] avg loss 0.0046041, throughput 12.9326K wps
[Epoch 15 Batch 150/162] avg loss 0.00450126, throughput 12.7775K wps
Begin Testing...
[Epoch 15] train avg loss 0.00447549, test acc 0.8889, test avg loss 0.266949, throughput 12.9313K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00419989, throughput 13.2603K wps
[Epoch 16 Batch 60/162] avg loss 0.00411221, throughput 12.8052K wps
[Epoch 16 Batch 90/162] avg loss 0.00424415, throughput 12.9644K wps
[Epoch 16 Batch 120/162] avg loss 0.00442564, throughput 12.9546K wps
[Epoch 16 Batch 150/162] avg loss 0.00418791, throughput 12.8574K wps
Begin Testing...
[Epoch 16] train avg loss 0.00423601, test acc 0.8989, test avg loss 0.263803, throughput 12.9637K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00407149, throughput 13.226K wps
[Epoch 17 Batch 60/162] avg loss 0.00438714, throughput 12.7138K wps
[Epoch 17 Batch 90/162] avg loss 0.00400908, throughput 12.8461K wps
[Epoch 17 Batch 120/162] avg loss 0.00385795, throughput 12.8792K wps
[Epoch 17 Batch 150/162] avg loss 0.00424537, throughput 12.8376K wps
Begin Testing...
[Epoch 17] train avg loss 0.00411659, test acc 0.8989, test avg loss 0.260709, throughput 12.897K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00390065, throughput 12.9747K wps
[Epoch 18 Batch 60/162] avg loss 0.00377163, throughput 12.7403K wps
[Epoch 18 Batch 90/162] avg loss 0.00407277, throughput 12.8986K wps
[Epoch 18 Batch 120/162] avg loss 0.00375149, throughput 12.9438K wps
[Epoch 18 Batch 150/162] avg loss 0.00389829, throughput 12.9352K wps
Begin Testing...
[Epoch 18] train avg loss 0.00388189, test acc 0.8978, test avg loss 0.254669, throughput 12.9007K wps
[Epoch 19 Batch 30/162] avg loss 0.0038475, throughput 13.1603K wps
[Epoch 19 Batch 60/162] avg loss 0.00394879, throughput 12.8305K wps
[Epoch 19 Batch 90/162] avg loss 0.00380953, throughput 12.9604K wps
[Epoch 19 Batch 120/162] avg loss 0.00393252, throughput 12.9813K wps
[Epoch 19 Batch 150/162] avg loss 0.00389858, throughput 12.969K wps
Begin Testing...
[Epoch 19] train avg loss 0.00387899, test acc 0.9011, test avg loss 0.253479, throughput 12.9796K wps
Observed Improvement.
Begin Testing...
[Epoch 20 Batch 30/162] avg loss 0.00341074, throughput 13.2785K wps
[Epoch 20 Batch 60/162] avg loss 0.00394659, throughput 12.8421K wps
[Epoch 20 Batch 90/162] avg loss 0.00376869, throughput 12.9949K wps
[Epoch 20 Batch 120/162] avg loss 0.00411518, throughput 12.8717K wps
[Epoch 20 Batch 150/162] avg loss 0.00339335, throughput 12.9928K wps
Begin Testing...
[Epoch 20] train avg loss 0.00372282, test acc 0.8867, test avg loss 0.255464, throughput 12.9811K wps
[Epoch 21 Batch 30/162] avg loss 0.00399687, throughput 13.1533K wps
[Epoch 21 Batch 60/162] avg loss 0.00354432, throughput 12.8619K wps
[Epoch 21 Batch 90/162] avg loss 0.00360915, throughput 12.9533K wps
[Epoch 21 Batch 120/162] avg loss 0.00349049, throughput 12.9258K wps
[Epoch 21 Batch 150/162] avg loss 0.00333684, throughput 12.967K wps
Begin Testing...
[Epoch 21] train avg loss 0.00360518, test acc 0.8922, test avg loss 0.248873, throughput 12.9727K wps
[Epoch 22 Batch 30/162] avg loss 0.00325084, throughput 13.1747K wps
[Epoch 22 Batch 60/162] avg loss 0.00336893, throughput 12.7974K wps
[Epoch 22 Batch 90/162] avg loss 0.00346738, throughput 12.8771K wps
[Epoch 22 Batch 120/162] avg loss 0.0038513, throughput 12.8485K wps
[Epoch 22 Batch 150/162] avg loss 0.00322117, throughput 12.866K wps
Begin Testing...
[Epoch 22] train avg loss 0.00343378, test acc 0.8933, test avg loss 0.245106, throughput 12.9052K wps
[Epoch 23 Batch 30/162] avg loss 0.0034283, throughput 13.3328K wps
[Epoch 23 Batch 60/162] avg loss 0.00374982, throughput 12.7963K wps
[Epoch 23 Batch 90/162] avg loss 0.00354311, throughput 12.9694K wps
[Epoch 23 Batch 120/162] avg loss 0.00280443, throughput 12.9512K wps
[Epoch 23 Batch 150/162] avg loss 0.00338136, throughput 12.9276K wps
Begin Testing...
[Epoch 23] train avg loss 0.00338606, test acc 0.9033, test avg loss 0.243062, throughput 12.9781K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00298628, throughput 13.3063K wps
[Epoch 24 Batch 60/162] avg loss 0.00296363, throughput 12.8583K wps
[Epoch 24 Batch 90/162] avg loss 0.00335792, throughput 12.9439K wps
[Epoch 24 Batch 120/162] avg loss 0.00360906, throughput 12.9365K wps
[Epoch 24 Batch 150/162] avg loss 0.00328852, throughput 12.9335K wps
Begin Testing...
[Epoch 24] train avg loss 0.00327677, test acc 0.9056, test avg loss 0.239214, throughput 12.9898K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.00353633, throughput 13.3035K wps
[Epoch 25 Batch 60/162] avg loss 0.00299301, throughput 12.8527K wps
[Epoch 25 Batch 90/162] avg loss 0.0030472, throughput 12.9735K wps
[Epoch 25 Batch 120/162] avg loss 0.00305399, throughput 12.8242K wps
[Epoch 25 Batch 150/162] avg loss 0.00293608, throughput 12.8696K wps
Begin Testing...
[Epoch 25] train avg loss 0.0031352, test acc 0.9078, test avg loss 0.238363, throughput 12.956K wps
Observed Improvement.
Begin Testing...
[Epoch 26 Batch 30/162] avg loss 0.0031205, throughput 13.1951K wps
[Epoch 26 Batch 60/162] avg loss 0.00327874, throughput 12.7747K wps
[Epoch 26 Batch 90/162] avg loss 0.00294515, throughput 12.8718K wps
[Epoch 26 Batch 120/162] avg loss 0.00296193, throughput 12.9075K wps
[Epoch 26 Batch 150/162] avg loss 0.00318524, throughput 12.9336K wps
Begin Testing...
[Epoch 26] train avg loss 0.00306878, test acc 0.9044, test avg loss 0.235229, throughput 12.939K wps
[Epoch 27 Batch 30/162] avg loss 0.00292699, throughput 13.2362K wps
[Epoch 27 Batch 60/162] avg loss 0.00306623, throughput 12.8943K wps
[Epoch 27 Batch 90/162] avg loss 0.00270018, throughput 12.9526K wps
[Epoch 27 Batch 120/162] avg loss 0.00285918, throughput 12.9897K wps
[Epoch 27 Batch 150/162] avg loss 0.00287112, throughput 12.9701K wps
Begin Testing...
[Epoch 27] train avg loss 0.00290983, test acc 0.9044, test avg loss 0.235388, throughput 13.0055K wps
[Epoch 28 Batch 30/162] avg loss 0.00254964, throughput 13.2286K wps
[Epoch 28 Batch 60/162] avg loss 0.00298987, throughput 12.847K wps
[Epoch 28 Batch 90/162] avg loss 0.00269629, throughput 12.8819K wps
[Epoch 28 Batch 120/162] avg loss 0.0027521, throughput 12.8177K wps
[Epoch 28 Batch 150/162] avg loss 0.00286216, throughput 12.9721K wps
Begin Testing...
[Epoch 28] train avg loss 0.00280871, test acc 0.8978, test avg loss 0.233143, throughput 12.9494K wps
[Epoch 29 Batch 30/162] avg loss 0.00270326, throughput 13.2127K wps
[Epoch 29 Batch 60/162] avg loss 0.00280888, throughput 12.8264K wps
[Epoch 29 Batch 90/162] avg loss 0.00279016, throughput 12.8033K wps
[Epoch 29 Batch 120/162] avg loss 0.00277208, throughput 12.9012K wps
[Epoch 29 Batch 150/162] avg loss 0.00295352, throughput 12.9431K wps
Begin Testing...
[Epoch 29] train avg loss 0.0028158, test acc 0.9067, test avg loss 0.233806, throughput 12.9376K wps
[Epoch 30 Batch 30/162] avg loss 0.00220843, throughput 13.2864K wps
[Epoch 30 Batch 60/162] avg loss 0.00264272, throughput 12.7684K wps
[Epoch 30 Batch 90/162] avg loss 0.00283844, throughput 12.9161K wps
[Epoch 30 Batch 120/162] avg loss 0.0026494, throughput 12.9835K wps
[Epoch 30 Batch 150/162] avg loss 0.00250368, throughput 12.7865K wps
Begin Testing...
[Epoch 30] train avg loss 0.00259066, test acc 0.9078, test avg loss 0.230741, throughput 12.9244K wps
Observed Improvement.
Begin Testing...
[Epoch 31 Batch 30/162] avg loss 0.0025393, throughput 13.1297K wps
[Epoch 31 Batch 60/162] avg loss 0.00278459, throughput 12.6756K wps
[Epoch 31 Batch 90/162] avg loss 0.00255304, throughput 12.9796K wps
[Epoch 31 Batch 120/162] avg loss 0.00267885, throughput 12.9691K wps
[Epoch 31 Batch 150/162] avg loss 0.00261154, throughput 12.9293K wps
Begin Testing...
[Epoch 31] train avg loss 0.00262752, test acc 0.9133, test avg loss 0.227732, throughput 12.9357K wps
Observed Improvement.
Begin Testing...
[Epoch 32 Batch 30/162] avg loss 0.00268347, throughput 13.1502K wps
[Epoch 32 Batch 60/162] avg loss 0.00273265, throughput 12.7685K wps
[Epoch 32 Batch 90/162] avg loss 0.00232908, throughput 12.7741K wps
[Epoch 32 Batch 120/162] avg loss 0.00240007, throughput 12.9339K wps
[Epoch 32 Batch 150/162] avg loss 0.00265278, throughput 12.8435K wps
Begin Testing...
[Epoch 32] train avg loss 0.00253331, test acc 0.9122, test avg loss 0.228631, throughput 12.8934K wps
[Epoch 33 Batch 30/162] avg loss 0.00252871, throughput 13.2016K wps
[Epoch 33 Batch 60/162] avg loss 0.00218104, throughput 12.8039K wps
[Epoch 33 Batch 90/162] avg loss 0.00220367, throughput 12.9056K wps
[Epoch 33 Batch 120/162] avg loss 0.00250903, throughput 12.8974K wps
[Epoch 33 Batch 150/162] avg loss 0.00262288, throughput 12.8748K wps
Begin Testing...
[Epoch 33] train avg loss 0.00240646, test acc 0.9111, test avg loss 0.227161, throughput 12.9241K wps
[Epoch 34 Batch 30/162] avg loss 0.00231273, throughput 13.2244K wps
[Epoch 34 Batch 60/162] avg loss 0.00241651, throughput 12.7642K wps
[Epoch 34 Batch 90/162] avg loss 0.00201885, throughput 12.8639K wps
[Epoch 34 Batch 120/162] avg loss 0.00225805, throughput 12.871K wps
[Epoch 34 Batch 150/162] avg loss 0.00242081, throughput 12.82K wps
Begin Testing...
[Epoch 34] train avg loss 0.00229882, test acc 0.9056, test avg loss 0.225084, throughput 12.8872K wps
[Epoch 35 Batch 30/162] avg loss 0.00213865, throughput 13.1919K wps
[Epoch 35 Batch 60/162] avg loss 0.00223769, throughput 12.8118K wps
[Epoch 35 Batch 90/162] avg loss 0.00234404, throughput 12.9813K wps
[Epoch 35 Batch 120/162] avg loss 0.00215955, throughput 12.9348K wps
[Epoch 35 Batch 150/162] avg loss 0.00239657, throughput 12.9159K wps
Begin Testing...
[Epoch 35] train avg loss 0.00226617, test acc 0.9056, test avg loss 0.225873, throughput 12.954K wps
[Epoch 36 Batch 30/162] avg loss 0.00229538, throughput 13.2956K wps
[Epoch 36 Batch 60/162] avg loss 0.0021329, throughput 12.8849K wps
[Epoch 36 Batch 90/162] avg loss 0.00207805, throughput 12.9339K wps
[Epoch 36 Batch 120/162] avg loss 0.00197752, throughput 12.8521K wps
[Epoch 36 Batch 150/162] avg loss 0.00207422, throughput 12.9933K wps
Begin Testing...
[Epoch 36] train avg loss 0.00213927, test acc 0.9144, test avg loss 0.22612, throughput 12.9899K wps
Observed Improvement.
Begin Testing...
[Epoch 37 Batch 30/162] avg loss 0.00205599, throughput 13.248K wps
[Epoch 37 Batch 60/162] avg loss 0.0022343, throughput 12.7945K wps
[Epoch 37 Batch 90/162] avg loss 0.00203067, throughput 12.9142K wps
[Epoch 37 Batch 120/162] avg loss 0.00200508, throughput 12.9238K wps
[Epoch 37 Batch 150/162] avg loss 0.00213049, throughput 12.8151K wps
Begin Testing...
[Epoch 37] train avg loss 0.00209195, test acc 0.9167, test avg loss 0.222637, throughput 12.935K wps
Observed Improvement.
Begin Testing...
[Epoch 38 Batch 30/162] avg loss 0.0021085, throughput 13.2049K wps
[Epoch 38 Batch 60/162] avg loss 0.00196564, throughput 12.769K wps
[Epoch 38 Batch 90/162] avg loss 0.00205148, throughput 12.9054K wps
[Epoch 38 Batch 120/162] avg loss 0.00208107, throughput 12.8816K wps
[Epoch 38 Batch 150/162] avg loss 0.00194324, throughput 12.9237K wps
Begin Testing...
[Epoch 38] train avg loss 0.00203963, test acc 0.9111, test avg loss 0.218805, throughput 12.932K wps
[Epoch 39 Batch 30/162] avg loss 0.0021251, throughput 13.2092K wps
[Epoch 39 Batch 60/162] avg loss 0.00203033, throughput 12.7459K wps
[Epoch 39 Batch 90/162] avg loss 0.00191168, throughput 12.8553K wps
[Epoch 39 Batch 120/162] avg loss 0.00186263, throughput 12.8238K wps
[Epoch 39 Batch 150/162] avg loss 0.00194805, throughput 12.9055K wps
Begin Testing...
[Epoch 39] train avg loss 0.00199273, test acc 0.9200, test avg loss 0.218344, throughput 12.9054K wps
Observed Improvement.
Begin Testing...
[Epoch 40 Batch 30/162] avg loss 0.0019199, throughput 13.2506K wps
[Epoch 40 Batch 60/162] avg loss 0.00190493, throughput 12.8427K wps
[Epoch 40 Batch 90/162] avg loss 0.00184729, throughput 12.9003K wps
[Epoch 40 Batch 120/162] avg loss 0.00196014, throughput 12.8823K wps
[Epoch 40 Batch 150/162] avg loss 0.0019899, throughput 12.8892K wps
Begin Testing...
[Epoch 40] train avg loss 0.00193229, test acc 0.9133, test avg loss 0.220432, throughput 12.9505K wps
[Epoch 41 Batch 30/162] avg loss 0.00171538, throughput 13.2559K wps
[Epoch 41 Batch 60/162] avg loss 0.00203242, throughput 12.8177K wps
[Epoch 41 Batch 90/162] avg loss 0.00178565, throughput 12.9539K wps
[Epoch 41 Batch 120/162] avg loss 0.0019151, throughput 12.8698K wps
[Epoch 41 Batch 150/162] avg loss 0.00176573, throughput 12.9205K wps
Begin Testing...
[Epoch 41] train avg loss 0.0018665, test acc 0.9178, test avg loss 0.22032, throughput 12.9645K wps
[Epoch 42 Batch 30/162] avg loss 0.00171102, throughput 13.2761K wps
[Epoch 42 Batch 60/162] avg loss 0.00161625, throughput 12.771K wps
[Epoch 42 Batch 90/162] avg loss 0.00181099, throughput 12.8538K wps
[Epoch 42 Batch 120/162] avg loss 0.00223985, throughput 12.8538K wps
[Epoch 42 Batch 150/162] avg loss 0.00189141, throughput 12.8371K wps
Begin Testing...
[Epoch 42] train avg loss 0.00186309, test acc 0.9211, test avg loss 0.218758, throughput 12.9072K wps
Observed Improvement.
Begin Testing...
[Epoch 43 Batch 30/162] avg loss 0.0018991, throughput 13.149K wps
[Epoch 43 Batch 60/162] avg loss 0.00177125, throughput 12.8662K wps
[Epoch 43 Batch 90/162] avg loss 0.00170566, throughput 12.85K wps
[Epoch 43 Batch 120/162] avg loss 0.00186311, throughput 12.9367K wps
[Epoch 43 Batch 150/162] avg loss 0.00165474, throughput 12.943K wps
Begin Testing...
[Epoch 43] train avg loss 0.00176844, test acc 0.9022, test avg loss 0.221249, throughput 12.945K wps
[Epoch 44 Batch 30/162] avg loss 0.00193766, throughput 13.216K wps
[Epoch 44 Batch 60/162] avg loss 0.00157092, throughput 12.8264K wps
[Epoch 44 Batch 90/162] avg loss 0.00172299, throughput 12.8299K wps
[Epoch 44 Batch 120/162] avg loss 0.00152508, throughput 12.9639K wps
[Epoch 44 Batch 150/162] avg loss 0.00188772, throughput 12.8642K wps
Begin Testing...
[Epoch 44] train avg loss 0.0017132, test acc 0.9178, test avg loss 0.215954, throughput 12.9421K wps
[Epoch 45 Batch 30/162] avg loss 0.00151996, throughput 13.2477K wps
[Epoch 45 Batch 60/162] avg loss 0.00175369, throughput 12.7897K wps
[Epoch 45 Batch 90/162] avg loss 0.00164175, throughput 12.9008K wps
[Epoch 45 Batch 120/162] avg loss 0.00150613, throughput 12.9512K wps
[Epoch 45 Batch 150/162] avg loss 0.00147048, throughput 12.951K wps
Begin Testing...
[Epoch 45] train avg loss 0.00158961, test acc 0.9189, test avg loss 0.21777, throughput 12.9703K wps
[Epoch 46 Batch 30/162] avg loss 0.00159848, throughput 13.0868K wps
[Epoch 46 Batch 60/162] avg loss 0.00145853, throughput 12.781K wps
[Epoch 46 Batch 90/162] avg loss 0.00160345, throughput 12.9119K wps
[Epoch 46 Batch 120/162] avg loss 0.00131987, throughput 12.9182K wps
[Epoch 46 Batch 150/162] avg loss 0.00164809, throughput 12.7334K wps
Begin Testing...
[Epoch 46] train avg loss 0.00152803, test acc 0.9022, test avg loss 0.221191, throughput 12.8879K wps
[Epoch 47 Batch 30/162] avg loss 0.00133427, throughput 13.1299K wps
[Epoch 47 Batch 60/162] avg loss 0.00162422, throughput 12.7991K wps
[Epoch 47 Batch 90/162] avg loss 0.00155508, throughput 12.9426K wps
[Epoch 47 Batch 120/162] avg loss 0.00154414, throughput 12.9209K wps
[Epoch 47 Batch 150/162] avg loss 0.00152931, throughput 12.7242K wps
Begin Testing...
[Epoch 47] train avg loss 0.00150646, test acc 0.9011, test avg loss 0.220538, throughput 12.9039K wps
[Epoch 48 Batch 30/162] avg loss 0.00142851, throughput 13.0645K wps
[Epoch 48 Batch 60/162] avg loss 0.00137031, throughput 12.787K wps
[Epoch 48 Batch 90/162] avg loss 0.00136826, throughput 12.873K wps
[Epoch 48 Batch 120/162] avg loss 0.00163319, throughput 12.7807K wps
[Epoch 48 Batch 150/162] avg loss 0.00131045, throughput 12.9067K wps
Begin Testing...
[Epoch 48] train avg loss 0.00145365, test acc 0.9200, test avg loss 0.217723, throughput 12.8817K wps
[Epoch 49 Batch 30/162] avg loss 0.00137544, throughput 13.1661K wps
[Epoch 49 Batch 60/162] avg loss 0.00147023, throughput 12.7006K wps
[Epoch 49 Batch 90/162] avg loss 0.00146915, throughput 12.7963K wps
[Epoch 49 Batch 120/162] avg loss 0.00156002, throughput 12.8903K wps
[Epoch 49 Batch 150/162] avg loss 0.00128981, throughput 12.9457K wps
Begin Testing...
[Epoch 49] train avg loss 0.00143378, test acc 0.9133, test avg loss 0.219189, throughput 12.9044K wps
[Epoch 50 Batch 30/162] avg loss 0.00128225, throughput 13.2974K wps
[Epoch 50 Batch 60/162] avg loss 0.00130793, throughput 12.7752K wps
[Epoch 50 Batch 90/162] avg loss 0.00130965, throughput 12.9272K wps
[Epoch 50 Batch 120/162] avg loss 0.00133538, throughput 12.8916K wps
[Epoch 50 Batch 150/162] avg loss 0.00137394, throughput 12.8806K wps
Begin Testing...
[Epoch 50] train avg loss 0.00134506, test acc 0.9189, test avg loss 0.217894, throughput 12.9483K wps
[Epoch 51 Batch 30/162] avg loss 0.00136689, throughput 13.1167K wps
[Epoch 51 Batch 60/162] avg loss 0.00135764, throughput 12.8457K wps
[Epoch 51 Batch 90/162] avg loss 0.00135961, throughput 12.9152K wps
[Epoch 51 Batch 120/162] avg loss 0.00124905, throughput 12.8435K wps
[Epoch 51 Batch 150/162] avg loss 0.00122249, throughput 12.8715K wps
Begin Testing...
[Epoch 51] train avg loss 0.00132945, test acc 0.9056, test avg loss 0.215784, throughput 12.9191K wps
[Epoch 52 Batch 30/162] avg loss 0.00136306, throughput 13.0787K wps
[Epoch 52 Batch 60/162] avg loss 0.00124602, throughput 12.8117K wps
[Epoch 52 Batch 90/162] avg loss 0.00131722, throughput 12.7912K wps
[Epoch 52 Batch 120/162] avg loss 0.0014036, throughput 12.8847K wps
[Epoch 52 Batch 150/162] avg loss 0.00129759, throughput 12.708K wps
Begin Testing...
[Epoch 52] train avg loss 0.00132456, test acc 0.9178, test avg loss 0.21549, throughput 12.8548K wps
[Epoch 53 Batch 30/162] avg loss 0.00130295, throughput 13.072K wps
[Epoch 53 Batch 60/162] avg loss 0.00118045, throughput 12.7451K wps
[Epoch 53 Batch 90/162] avg loss 0.00130155, throughput 12.8779K wps
[Epoch 53 Batch 120/162] avg loss 0.00112511, throughput 12.9034K wps
[Epoch 53 Batch 150/162] avg loss 0.00135842, throughput 12.9529K wps
Begin Testing...
[Epoch 53] train avg loss 0.00123612, test acc 0.9089, test avg loss 0.21977, throughput 12.9131K wps
[Epoch 54 Batch 30/162] avg loss 0.00109445, throughput 13.251K wps
[Epoch 54 Batch 60/162] avg loss 0.00126593, throughput 12.8554K wps
[Epoch 54 Batch 90/162] avg loss 0.00111982, throughput 12.9759K wps
[Epoch 54 Batch 120/162] avg loss 0.00142641, throughput 12.9568K wps
[Epoch 54 Batch 150/162] avg loss 0.00123036, throughput 12.9269K wps
Begin Testing...
[Epoch 54] train avg loss 0.00121862, test acc 0.9133, test avg loss 0.213123, throughput 12.9803K wps
[Epoch 55 Batch 30/162] avg loss 0.00101548, throughput 13.2079K wps
[Epoch 55 Batch 60/162] avg loss 0.00126153, throughput 12.854K wps
[Epoch 55 Batch 90/162] avg loss 0.00122748, throughput 12.9703K wps
[Epoch 55 Batch 120/162] avg loss 0.00103664, throughput 12.9585K wps
[Epoch 55 Batch 150/162] avg loss 0.00112999, throughput 12.9771K wps
Begin Testing...
[Epoch 55] train avg loss 0.00113094, test acc 0.9167, test avg loss 0.216749, throughput 12.9878K wps
[Epoch 56 Batch 30/162] avg loss 0.000873397, throughput 13.1893K wps
[Epoch 56 Batch 60/162] avg loss 0.00125796, throughput 12.8249K wps
[Epoch 56 Batch 90/162] avg loss 0.00109008, throughput 12.9904K wps
[Epoch 56 Batch 120/162] avg loss 0.00105737, throughput 12.8733K wps
[Epoch 56 Batch 150/162] avg loss 0.00119772, throughput 12.8871K wps
Begin Testing...
[Epoch 56] train avg loss 0.00109903, test acc 0.9167, test avg loss 0.21803, throughput 12.9503K wps
[Epoch 57 Batch 30/162] avg loss 0.00101282, throughput 13.2075K wps
[Epoch 57 Batch 60/162] avg loss 0.000937526, throughput 12.7762K wps
[Epoch 57 Batch 90/162] avg loss 0.00111532, throughput 12.9753K wps
[Epoch 57 Batch 120/162] avg loss 0.00102574, throughput 12.9502K wps
[Epoch 57 Batch 150/162] avg loss 0.00108159, throughput 12.9496K wps
Begin Testing...
[Epoch 57] train avg loss 0.00104801, test acc 0.9144, test avg loss 0.216217, throughput 12.9642K wps
[Epoch 58 Batch 30/162] avg loss 0.000998312, throughput 13.2503K wps
[Epoch 58 Batch 60/162] avg loss 0.00102101, throughput 12.8029K wps
[Epoch 58 Batch 90/162] avg loss 0.00109404, throughput 12.8755K wps
[Epoch 58 Batch 120/162] avg loss 0.00103756, throughput 12.785K wps
[Epoch 58 Batch 150/162] avg loss 0.00100858, throughput 12.7836K wps
Begin Testing...
[Epoch 58] train avg loss 0.00102547, test acc 0.9122, test avg loss 0.219053, throughput 12.8952K wps
[Epoch 59 Batch 30/162] avg loss 0.000936297, throughput 13.2002K wps
[Epoch 59 Batch 60/162] avg loss 0.00097648, throughput 12.8604K wps
[Epoch 59 Batch 90/162] avg loss 0.000829199, throughput 12.9159K wps
[Epoch 59 Batch 120/162] avg loss 0.00113897, throughput 12.903K wps
[Epoch 59 Batch 150/162] avg loss 0.000978946, throughput 12.8986K wps
Begin Testing...
[Epoch 59] train avg loss 0.000971147, test acc 0.9200, test avg loss 0.219197, throughput 12.9523K wps
Test loss 0.209778, test acc 0.9160
Total time cost 165.24s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0153836, throughput 11.557K wps
[Epoch 0 Batch 60/162] avg loss 0.0141842, throughput 12.8046K wps
[Epoch 0 Batch 90/162] avg loss 0.0137805, throughput 12.9296K wps
[Epoch 0 Batch 120/162] avg loss 0.0132039, throughput 12.8898K wps
[Epoch 0 Batch 150/162] avg loss 0.0129823, throughput 12.912K wps
Begin Testing...
[Epoch 0] train avg loss 0.0138589, test acc 0.7200, test avg loss 0.587513, throughput 12.6192K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0124393, throughput 13.2019K wps
[Epoch 1 Batch 60/162] avg loss 0.011902, throughput 12.743K wps
[Epoch 1 Batch 90/162] avg loss 0.0120864, throughput 12.8888K wps
[Epoch 1 Batch 120/162] avg loss 0.0118902, throughput 12.7933K wps
[Epoch 1 Batch 150/162] avg loss 0.0114326, throughput 12.8349K wps
Begin Testing...
[Epoch 1] train avg loss 0.0119465, test acc 0.7622, test avg loss 0.549421, throughput 12.8802K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0110607, throughput 13.2312K wps
[Epoch 2 Batch 60/162] avg loss 0.0109249, throughput 12.7236K wps
[Epoch 2 Batch 90/162] avg loss 0.0106948, throughput 12.7566K wps
[Epoch 2 Batch 120/162] avg loss 0.0105493, throughput 12.7586K wps
[Epoch 2 Batch 150/162] avg loss 0.0104988, throughput 12.8966K wps
Begin Testing...
[Epoch 2] train avg loss 0.0107441, test acc 0.8000, test avg loss 0.502685, throughput 12.8674K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0100037, throughput 13.2442K wps
[Epoch 3 Batch 60/162] avg loss 0.00993075, throughput 12.8765K wps
[Epoch 3 Batch 90/162] avg loss 0.00934463, throughput 12.8126K wps
[Epoch 3 Batch 120/162] avg loss 0.00940692, throughput 12.8302K wps
[Epoch 3 Batch 150/162] avg loss 0.00937188, throughput 12.9266K wps
Begin Testing...
[Epoch 3] train avg loss 0.00956481, test acc 0.8522, test avg loss 0.452271, throughput 12.9279K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00904089, throughput 13.1881K wps
[Epoch 4 Batch 60/162] avg loss 0.00869165, throughput 12.7252K wps
[Epoch 4 Batch 90/162] avg loss 0.00869624, throughput 12.7486K wps
[Epoch 4 Batch 120/162] avg loss 0.00861975, throughput 12.8061K wps
[Epoch 4 Batch 150/162] avg loss 0.00818021, throughput 12.7959K wps
Begin Testing...
[Epoch 4] train avg loss 0.00861596, test acc 0.8656, test avg loss 0.408678, throughput 12.854K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00802728, throughput 13.2432K wps
[Epoch 5 Batch 60/162] avg loss 0.00760693, throughput 12.7795K wps
[Epoch 5 Batch 90/162] avg loss 0.00779407, throughput 12.9775K wps
[Epoch 5 Batch 120/162] avg loss 0.00761724, throughput 12.8324K wps
[Epoch 5 Batch 150/162] avg loss 0.00765705, throughput 12.8755K wps
Begin Testing...
[Epoch 5] train avg loss 0.0077202, test acc 0.8767, test avg loss 0.380541, throughput 12.9126K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00697729, throughput 13.2338K wps
[Epoch 6 Batch 60/162] avg loss 0.0073755, throughput 12.8158K wps
[Epoch 6 Batch 90/162] avg loss 0.00728557, throughput 12.8607K wps
[Epoch 6 Batch 120/162] avg loss 0.00681963, throughput 12.9186K wps
[Epoch 6 Batch 150/162] avg loss 0.00689101, throughput 12.827K wps
Begin Testing...
[Epoch 6] train avg loss 0.00705018, test acc 0.8856, test avg loss 0.350527, throughput 12.9127K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00645313, throughput 13.2355K wps
[Epoch 7 Batch 60/162] avg loss 0.00674562, throughput 12.7823K wps
[Epoch 7 Batch 90/162] avg loss 0.0063667, throughput 12.9205K wps
[Epoch 7 Batch 120/162] avg loss 0.00663505, throughput 12.8361K wps
[Epoch 7 Batch 150/162] avg loss 0.00646274, throughput 12.8505K wps
Begin Testing...
[Epoch 7] train avg loss 0.00651995, test acc 0.8833, test avg loss 0.334327, throughput 12.9097K wps
[Epoch 8 Batch 30/162] avg loss 0.00592036, throughput 13.1784K wps
[Epoch 8 Batch 60/162] avg loss 0.00599658, throughput 12.8244K wps
[Epoch 8 Batch 90/162] avg loss 0.0062655, throughput 12.8172K wps
[Epoch 8 Batch 120/162] avg loss 0.00612314, throughput 12.8796K wps
[Epoch 8 Batch 150/162] avg loss 0.00599427, throughput 12.9256K wps
Begin Testing...
[Epoch 8] train avg loss 0.00608174, test acc 0.8867, test avg loss 0.311663, throughput 12.9291K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.0058705, throughput 13.1487K wps
[Epoch 9 Batch 60/162] avg loss 0.00603279, throughput 12.6705K wps
[Epoch 9 Batch 90/162] avg loss 0.00587176, throughput 12.6779K wps
[Epoch 9 Batch 120/162] avg loss 0.00575171, throughput 12.6588K wps
[Epoch 9 Batch 150/162] avg loss 0.0055254, throughput 12.954K wps
Begin Testing...
[Epoch 9] train avg loss 0.00574034, test acc 0.8911, test avg loss 0.298732, throughput 12.8319K wps
Observed Improvement.
Begin Testing...
[Epoch 10 Batch 30/162] avg loss 0.00530246, throughput 13.293K wps
[Epoch 10 Batch 60/162] avg loss 0.00545905, throughput 12.8K wps
[Epoch 10 Batch 90/162] avg loss 0.00562044, throughput 12.7655K wps
[Epoch 10 Batch 120/162] avg loss 0.00562727, throughput 12.7628K wps
[Epoch 10 Batch 150/162] avg loss 0.0052799, throughput 12.8003K wps
Begin Testing...
[Epoch 10] train avg loss 0.00545918, test acc 0.8867, test avg loss 0.291501, throughput 12.8685K wps
[Epoch 11 Batch 30/162] avg loss 0.00509359, throughput 13.1289K wps
[Epoch 11 Batch 60/162] avg loss 0.0053344, throughput 12.7859K wps
[Epoch 11 Batch 90/162] avg loss 0.00494471, throughput 12.8907K wps
[Epoch 11 Batch 120/162] avg loss 0.00546906, throughput 12.8879K wps
[Epoch 11 Batch 150/162] avg loss 0.00520305, throughput 12.9003K wps
Begin Testing...
[Epoch 11] train avg loss 0.00520927, test acc 0.8967, test avg loss 0.279425, throughput 12.9207K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00488667, throughput 13.2281K wps
[Epoch 12 Batch 60/162] avg loss 0.00517409, throughput 12.7842K wps
[Epoch 12 Batch 90/162] avg loss 0.00519745, throughput 12.8706K wps
[Epoch 12 Batch 120/162] avg loss 0.00480344, throughput 12.8157K wps
[Epoch 12 Batch 150/162] avg loss 0.00472163, throughput 12.8467K wps
Begin Testing...
[Epoch 12] train avg loss 0.00495959, test acc 0.8878, test avg loss 0.285481, throughput 12.9012K wps
[Epoch 13 Batch 30/162] avg loss 0.0046894, throughput 13.1872K wps
[Epoch 13 Batch 60/162] avg loss 0.00471307, throughput 12.7503K wps
[Epoch 13 Batch 90/162] avg loss 0.00493759, throughput 12.9248K wps
[Epoch 13 Batch 120/162] avg loss 0.00490552, throughput 12.83K wps
[Epoch 13 Batch 150/162] avg loss 0.00454848, throughput 12.9253K wps
Begin Testing...
[Epoch 13] train avg loss 0.00475156, test acc 0.8878, test avg loss 0.279893, throughput 12.9197K wps
[Epoch 14 Batch 30/162] avg loss 0.00463283, throughput 13.1507K wps
[Epoch 14 Batch 60/162] avg loss 0.00462873, throughput 12.7962K wps
[Epoch 14 Batch 90/162] avg loss 0.00461566, throughput 12.8558K wps
[Epoch 14 Batch 120/162] avg loss 0.004838, throughput 12.9059K wps
[Epoch 14 Batch 150/162] avg loss 0.00452778, throughput 12.9215K wps
Begin Testing...
[Epoch 14] train avg loss 0.00456817, test acc 0.8889, test avg loss 0.270566, throughput 12.9233K wps
[Epoch 15 Batch 30/162] avg loss 0.00441655, throughput 13.299K wps
[Epoch 15 Batch 60/162] avg loss 0.00456023, throughput 12.7789K wps
[Epoch 15 Batch 90/162] avg loss 0.00455329, throughput 12.9091K wps
[Epoch 15 Batch 120/162] avg loss 0.00423204, throughput 12.9238K wps
[Epoch 15 Batch 150/162] avg loss 0.00453872, throughput 12.9147K wps
Begin Testing...
[Epoch 15] train avg loss 0.00443408, test acc 0.8911, test avg loss 0.265747, throughput 12.9596K wps
[Epoch 16 Batch 30/162] avg loss 0.00436107, throughput 13.1985K wps
[Epoch 16 Batch 60/162] avg loss 0.00441771, throughput 12.7998K wps
[Epoch 16 Batch 90/162] avg loss 0.00423306, throughput 12.9686K wps
[Epoch 16 Batch 120/162] avg loss 0.00403068, throughput 12.8352K wps
[Epoch 16 Batch 150/162] avg loss 0.00407715, throughput 12.9157K wps
Begin Testing...
[Epoch 16] train avg loss 0.00424446, test acc 0.9000, test avg loss 0.255064, throughput 12.9386K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00422331, throughput 13.2792K wps
[Epoch 17 Batch 60/162] avg loss 0.00383541, throughput 12.8088K wps
[Epoch 17 Batch 90/162] avg loss 0.0041192, throughput 12.901K wps
[Epoch 17 Batch 120/162] avg loss 0.00430456, throughput 12.7934K wps
[Epoch 17 Batch 150/162] avg loss 0.00418026, throughput 12.9171K wps
Begin Testing...
[Epoch 17] train avg loss 0.00412977, test acc 0.8911, test avg loss 0.263517, throughput 12.9287K wps
[Epoch 18 Batch 30/162] avg loss 0.00405446, throughput 13.2359K wps
[Epoch 18 Batch 60/162] avg loss 0.00400614, throughput 12.7662K wps
[Epoch 18 Batch 90/162] avg loss 0.00396183, throughput 12.8034K wps
[Epoch 18 Batch 120/162] avg loss 0.00354427, throughput 12.8398K wps
[Epoch 18 Batch 150/162] avg loss 0.00371172, throughput 12.8094K wps
Begin Testing...
[Epoch 18] train avg loss 0.00389823, test acc 0.9011, test avg loss 0.251499, throughput 12.879K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.00405125, throughput 13.2551K wps
[Epoch 19 Batch 60/162] avg loss 0.00367343, throughput 12.8064K wps
[Epoch 19 Batch 90/162] avg loss 0.00379684, throughput 12.9514K wps
[Epoch 19 Batch 120/162] avg loss 0.00428465, throughput 12.7752K wps
[Epoch 19 Batch 150/162] avg loss 0.00383296, throughput 12.9683K wps
Begin Testing...
[Epoch 19] train avg loss 0.00389581, test acc 0.9033, test avg loss 0.248202, throughput 12.9473K wps
Observed Improvement.
Begin Testing...
[Epoch 20 Batch 30/162] avg loss 0.00360568, throughput 13.1888K wps
[Epoch 20 Batch 60/162] avg loss 0.00348325, throughput 12.7999K wps
[Epoch 20 Batch 90/162] avg loss 0.00366182, throughput 12.8533K wps
[Epoch 20 Batch 120/162] avg loss 0.00380643, throughput 12.9283K wps
[Epoch 20 Batch 150/162] avg loss 0.00380186, throughput 12.8857K wps
Begin Testing...
[Epoch 20] train avg loss 0.00366235, test acc 0.9100, test avg loss 0.240876, throughput 12.9346K wps
Observed Improvement.
Begin Testing...
[Epoch 21 Batch 30/162] avg loss 0.00352937, throughput 13.2108K wps
[Epoch 21 Batch 60/162] avg loss 0.00327944, throughput 12.8086K wps
[Epoch 21 Batch 90/162] avg loss 0.0036352, throughput 12.8727K wps
[Epoch 21 Batch 120/162] avg loss 0.00365576, throughput 12.8042K wps
[Epoch 21 Batch 150/162] avg loss 0.00390396, throughput 12.8192K wps
Begin Testing...
[Epoch 21] train avg loss 0.00362099, test acc 0.9078, test avg loss 0.239965, throughput 12.8963K wps
[Epoch 22 Batch 30/162] avg loss 0.00320747, throughput 13.1552K wps
[Epoch 22 Batch 60/162] avg loss 0.00355437, throughput 12.7285K wps
[Epoch 22 Batch 90/162] avg loss 0.00337753, throughput 12.8755K wps
[Epoch 22 Batch 120/162] avg loss 0.00366971, throughput 12.9086K wps
[Epoch 22 Batch 150/162] avg loss 0.00323295, throughput 12.8434K wps
Begin Testing...
[Epoch 22] train avg loss 0.00342582, test acc 0.9078, test avg loss 0.235214, throughput 12.8991K wps
[Epoch 23 Batch 30/162] avg loss 0.00309858, throughput 13.2288K wps
[Epoch 23 Batch 60/162] avg loss 0.00384201, throughput 12.9096K wps
[Epoch 23 Batch 90/162] avg loss 0.00330818, throughput 12.8037K wps
[Epoch 23 Batch 120/162] avg loss 0.00322091, throughput 12.8133K wps
[Epoch 23 Batch 150/162] avg loss 0.00361659, throughput 12.738K wps
Begin Testing...
[Epoch 23] train avg loss 0.00338848, test acc 0.9067, test avg loss 0.236275, throughput 12.8994K wps
[Epoch 24 Batch 30/162] avg loss 0.00305783, throughput 13.1107K wps
[Epoch 24 Batch 60/162] avg loss 0.00356475, throughput 12.7982K wps
[Epoch 24 Batch 90/162] avg loss 0.00330263, throughput 12.8694K wps
[Epoch 24 Batch 120/162] avg loss 0.00324563, throughput 12.9416K wps
[Epoch 24 Batch 150/162] avg loss 0.00326938, throughput 12.83K wps
Begin Testing...
[Epoch 24] train avg loss 0.003257, test acc 0.9067, test avg loss 0.241796, throughput 12.912K wps
[Epoch 25 Batch 30/162] avg loss 0.00294926, throughput 13.2277K wps
[Epoch 25 Batch 60/162] avg loss 0.00317881, throughput 12.7103K wps
[Epoch 25 Batch 90/162] avg loss 0.00286962, throughput 12.9265K wps
[Epoch 25 Batch 120/162] avg loss 0.00346476, throughput 12.9201K wps
[Epoch 25 Batch 150/162] avg loss 0.00329168, throughput 12.9179K wps
Begin Testing...
[Epoch 25] train avg loss 0.00312977, test acc 0.9122, test avg loss 0.228541, throughput 12.9256K wps
Observed Improvement.
Begin Testing...
[Epoch 26 Batch 30/162] avg loss 0.00317392, throughput 13.2338K wps
[Epoch 26 Batch 60/162] avg loss 0.0025798, throughput 12.7456K wps
[Epoch 26 Batch 90/162] avg loss 0.00303141, throughput 12.9194K wps
[Epoch 26 Batch 120/162] avg loss 0.00315568, throughput 12.9566K wps
[Epoch 26 Batch 150/162] avg loss 0.00339337, throughput 12.8805K wps
Begin Testing...
[Epoch 26] train avg loss 0.00305474, test acc 0.9122, test avg loss 0.231149, throughput 12.9418K wps
Observed Improvement.
Begin Testing...
[Epoch 27 Batch 30/162] avg loss 0.00302697, throughput 13.1599K wps
[Epoch 27 Batch 60/162] avg loss 0.00288178, throughput 12.677K wps
[Epoch 27 Batch 90/162] avg loss 0.0032225, throughput 12.7834K wps
[Epoch 27 Batch 120/162] avg loss 0.00300118, throughput 12.861K wps
[Epoch 27 Batch 150/162] avg loss 0.00272306, throughput 12.8702K wps
Begin Testing...
[Epoch 27] train avg loss 0.00298829, test acc 0.9100, test avg loss 0.228699, throughput 12.8722K wps
[Epoch 28 Batch 30/162] avg loss 0.00262076, throughput 13.2281K wps
[Epoch 28 Batch 60/162] avg loss 0.0030431, throughput 12.7654K wps
[Epoch 28 Batch 90/162] avg loss 0.00302485, throughput 12.9251K wps
[Epoch 28 Batch 120/162] avg loss 0.00277322, throughput 12.8998K wps
[Epoch 28 Batch 150/162] avg loss 0.00277695, throughput 12.9515K wps
Begin Testing...
[Epoch 28] train avg loss 0.00287347, test acc 0.9133, test avg loss 0.229818, throughput 12.9466K wps
Observed Improvement.
Begin Testing...
[Epoch 29 Batch 30/162] avg loss 0.00292297, throughput 13.2115K wps
[Epoch 29 Batch 60/162] avg loss 0.00272093, throughput 12.7927K wps
[Epoch 29 Batch 90/162] avg loss 0.00286341, throughput 12.9085K wps
[Epoch 29 Batch 120/162] avg loss 0.00273458, throughput 12.7567K wps
[Epoch 29 Batch 150/162] avg loss 0.00272405, throughput 12.8773K wps
Begin Testing...
[Epoch 29] train avg loss 0.00275445, test acc 0.9122, test avg loss 0.227575, throughput 12.9095K wps
[Epoch 30 Batch 30/162] avg loss 0.00270253, throughput 13.1627K wps
[Epoch 30 Batch 60/162] avg loss 0.00248164, throughput 12.7894K wps
[Epoch 30 Batch 90/162] avg loss 0.00278389, throughput 12.7482K wps
[Epoch 30 Batch 120/162] avg loss 0.00251752, throughput 12.8556K wps
[Epoch 30 Batch 150/162] avg loss 0.00273145, throughput 12.9074K wps
Begin Testing...
[Epoch 30] train avg loss 0.00268781, test acc 0.9111, test avg loss 0.229768, throughput 12.8845K wps
[Epoch 31 Batch 30/162] avg loss 0.00259693, throughput 13.1643K wps
[Epoch 31 Batch 60/162] avg loss 0.00245178, throughput 12.7361K wps
[Epoch 31 Batch 90/162] avg loss 0.00268435, throughput 12.8823K wps
[Epoch 31 Batch 120/162] avg loss 0.00267022, throughput 12.8422K wps
[Epoch 31 Batch 150/162] avg loss 0.00277473, throughput 12.9553K wps
Begin Testing...
[Epoch 31] train avg loss 0.00262973, test acc 0.9122, test avg loss 0.230753, throughput 12.914K wps
[Epoch 32 Batch 30/162] avg loss 0.00246461, throughput 13.1597K wps
[Epoch 32 Batch 60/162] avg loss 0.00267552, throughput 12.8198K wps
[Epoch 32 Batch 90/162] avg loss 0.00255611, throughput 12.9096K wps
[Epoch 32 Batch 120/162] avg loss 0.00231388, throughput 12.9323K wps
[Epoch 32 Batch 150/162] avg loss 0.00266485, throughput 12.9256K wps
Begin Testing...
[Epoch 32] train avg loss 0.00253547, test acc 0.9156, test avg loss 0.221498, throughput 12.947K wps
Observed Improvement.
Begin Testing...
[Epoch 33 Batch 30/162] avg loss 0.00249809, throughput 13.1764K wps
[Epoch 33 Batch 60/162] avg loss 0.00261127, throughput 12.8671K wps
[Epoch 33 Batch 90/162] avg loss 0.00247485, throughput 12.9765K wps
[Epoch 33 Batch 120/162] avg loss 0.002399, throughput 12.9759K wps
[Epoch 33 Batch 150/162] avg loss 0.00227832, throughput 12.9615K wps
Begin Testing...
[Epoch 33] train avg loss 0.00242265, test acc 0.9100, test avg loss 0.228226, throughput 12.9838K wps
[Epoch 34 Batch 30/162] avg loss 0.00230638, throughput 13.2889K wps
[Epoch 34 Batch 60/162] avg loss 0.00230696, throughput 12.7716K wps
[Epoch 34 Batch 90/162] avg loss 0.00234386, throughput 12.9616K wps
[Epoch 34 Batch 120/162] avg loss 0.00240893, throughput 12.9608K wps
[Epoch 34 Batch 150/162] avg loss 0.00251906, throughput 12.9977K wps
Begin Testing...
[Epoch 34] train avg loss 0.00236057, test acc 0.9133, test avg loss 0.226439, throughput 12.9929K wps
[Epoch 35 Batch 30/162] avg loss 0.00217118, throughput 13.17K wps
[Epoch 35 Batch 60/162] avg loss 0.00232458, throughput 12.7683K wps
[Epoch 35 Batch 90/162] avg loss 0.00243667, throughput 12.8247K wps
[Epoch 35 Batch 120/162] avg loss 0.00229724, throughput 12.9369K wps
[Epoch 35 Batch 150/162] avg loss 0.00232529, throughput 12.9056K wps
Begin Testing...
[Epoch 35] train avg loss 0.00232181, test acc 0.9144, test avg loss 0.222887, throughput 12.9208K wps
[Epoch 36 Batch 30/162] avg loss 0.00202496, throughput 13.2415K wps
[Epoch 36 Batch 60/162] avg loss 0.00218925, throughput 12.8044K wps
[Epoch 36 Batch 90/162] avg loss 0.00216713, throughput 12.8692K wps
[Epoch 36 Batch 120/162] avg loss 0.00248574, throughput 12.8256K wps
[Epoch 36 Batch 150/162] avg loss 0.00234878, throughput 12.7636K wps
Begin Testing...
[Epoch 36] train avg loss 0.0022263, test acc 0.9144, test avg loss 0.225723, throughput 12.8824K wps
[Epoch 37 Batch 30/162] avg loss 0.00230993, throughput 13.3496K wps
[Epoch 37 Batch 60/162] avg loss 0.0022962, throughput 12.825K wps
[Epoch 37 Batch 90/162] avg loss 0.00212892, throughput 12.9155K wps
[Epoch 37 Batch 120/162] avg loss 0.00213869, throughput 12.8236K wps
[Epoch 37 Batch 150/162] avg loss 0.00189713, throughput 12.8211K wps
Begin Testing...
[Epoch 37] train avg loss 0.00214935, test acc 0.9133, test avg loss 0.217554, throughput 12.9302K wps
[Epoch 38 Batch 30/162] avg loss 0.00198918, throughput 13.2273K wps
[Epoch 38 Batch 60/162] avg loss 0.00221088, throughput 12.7891K wps
[Epoch 38 Batch 90/162] avg loss 0.00207095, throughput 12.7858K wps
[Epoch 38 Batch 120/162] avg loss 0.00207738, throughput 12.7709K wps
[Epoch 38 Batch 150/162] avg loss 0.00215, throughput 12.9775K wps
Begin Testing...
[Epoch 38] train avg loss 0.00210845, test acc 0.9133, test avg loss 0.223111, throughput 12.9118K wps
[Epoch 39 Batch 30/162] avg loss 0.0017446, throughput 13.1808K wps
[Epoch 39 Batch 60/162] avg loss 0.00223122, throughput 12.7482K wps
[Epoch 39 Batch 90/162] avg loss 0.00241558, throughput 12.8038K wps
[Epoch 39 Batch 120/162] avg loss 0.00195155, throughput 12.804K wps
[Epoch 39 Batch 150/162] avg loss 0.00201984, throughput 12.7238K wps
Begin Testing...
[Epoch 39] train avg loss 0.00206776, test acc 0.9167, test avg loss 0.21285, throughput 12.8545K wps
Observed Improvement.
Begin Testing...
[Epoch 40 Batch 30/162] avg loss 0.00228364, throughput 13.1911K wps
[Epoch 40 Batch 60/162] avg loss 0.00201882, throughput 12.7543K wps
[Epoch 40 Batch 90/162] avg loss 0.00185111, throughput 12.8407K wps
[Epoch 40 Batch 120/162] avg loss 0.0020731, throughput 12.8294K wps
[Epoch 40 Batch 150/162] avg loss 0.00180798, throughput 12.8724K wps
Begin Testing...
[Epoch 40] train avg loss 0.00198863, test acc 0.9167, test avg loss 0.214729, throughput 12.8937K wps
Observed Improvement.
Begin Testing...
[Epoch 41 Batch 30/162] avg loss 0.00176998, throughput 13.1736K wps
[Epoch 41 Batch 60/162] avg loss 0.00194046, throughput 12.7767K wps
[Epoch 41 Batch 90/162] avg loss 0.00199333, throughput 12.9099K wps
[Epoch 41 Batch 120/162] avg loss 0.00173222, throughput 12.8211K wps
[Epoch 41 Batch 150/162] avg loss 0.00181251, throughput 12.8635K wps
Begin Testing...
[Epoch 41] train avg loss 0.00186652, test acc 0.9156, test avg loss 0.213875, throughput 12.9163K wps
[Epoch 42 Batch 30/162] avg loss 0.00181048, throughput 13.2434K wps
[Epoch 42 Batch 60/162] avg loss 0.00199323, throughput 12.7734K wps
[Epoch 42 Batch 90/162] avg loss 0.00171963, throughput 13.0108K wps
[Epoch 42 Batch 120/162] avg loss 0.00190281, throughput 12.9865K wps
[Epoch 42 Batch 150/162] avg loss 0.00198961, throughput 13.0205K wps
Begin Testing...
[Epoch 42] train avg loss 0.00189382, test acc 0.9100, test avg loss 0.216834, throughput 13.0018K wps
[Epoch 43 Batch 30/162] avg loss 0.0018788, throughput 13.2174K wps
[Epoch 43 Batch 60/162] avg loss 0.00181222, throughput 12.9345K wps
[Epoch 43 Batch 90/162] avg loss 0.00208879, throughput 12.894K wps
[Epoch 43 Batch 120/162] avg loss 0.00165938, throughput 12.9718K wps
[Epoch 43 Batch 150/162] avg loss 0.00180443, throughput 12.902K wps
Begin Testing...
[Epoch 43] train avg loss 0.0018362, test acc 0.9156, test avg loss 0.212913, throughput 12.9693K wps
[Epoch 44 Batch 30/162] avg loss 0.00171305, throughput 13.1866K wps
[Epoch 44 Batch 60/162] avg loss 0.00177127, throughput 12.7808K wps
[Epoch 44 Batch 90/162] avg loss 0.00167524, throughput 12.9267K wps
[Epoch 44 Batch 120/162] avg loss 0.00180074, throughput 12.8526K wps
[Epoch 44 Batch 150/162] avg loss 0.00176759, throughput 12.8007K wps
Begin Testing...
[Epoch 44] train avg loss 0.00175573, test acc 0.9156, test avg loss 0.211741, throughput 12.9093K wps
[Epoch 45 Batch 30/162] avg loss 0.00169997, throughput 13.2735K wps
[Epoch 45 Batch 60/162] avg loss 0.00152501, throughput 12.7821K wps
[Epoch 45 Batch 90/162] avg loss 0.00160475, throughput 12.9369K wps
[Epoch 45 Batch 120/162] avg loss 0.00188231, throughput 12.8994K wps
[Epoch 45 Batch 150/162] avg loss 0.0018009, throughput 12.9007K wps
Begin Testing...
[Epoch 45] train avg loss 0.00170992, test acc 0.9178, test avg loss 0.209224, throughput 12.9545K wps
Observed Improvement.
Begin Testing...
[Epoch 46 Batch 30/162] avg loss 0.00166218, throughput 13.1747K wps
[Epoch 46 Batch 60/162] avg loss 0.00153385, throughput 12.8749K wps
[Epoch 46 Batch 90/162] avg loss 0.00163305, throughput 12.9203K wps
[Epoch 46 Batch 120/162] avg loss 0.00181805, throughput 12.945K wps
[Epoch 46 Batch 150/162] avg loss 0.00151999, throughput 12.9016K wps
Begin Testing...
[Epoch 46] train avg loss 0.00162071, test acc 0.9167, test avg loss 0.212165, throughput 12.9605K wps
[Epoch 47 Batch 30/162] avg loss 0.00148316, throughput 13.1707K wps
[Epoch 47 Batch 60/162] avg loss 0.00176782, throughput 12.8298K wps
[Epoch 47 Batch 90/162] avg loss 0.00125773, throughput 12.9533K wps
[Epoch 47 Batch 120/162] avg loss 0.00145301, throughput 12.9518K wps
[Epoch 47 Batch 150/162] avg loss 0.00180191, throughput 12.9458K wps
Begin Testing...
[Epoch 47] train avg loss 0.00157372, test acc 0.9144, test avg loss 0.211589, throughput 12.969K wps
[Epoch 48 Batch 30/162] avg loss 0.00146201, throughput 13.1566K wps
[Epoch 48 Batch 60/162] avg loss 0.0013506, throughput 12.8144K wps
[Epoch 48 Batch 90/162] avg loss 0.00159084, throughput 12.8814K wps
[Epoch 48 Batch 120/162] avg loss 0.00154974, throughput 12.9467K wps
[Epoch 48 Batch 150/162] avg loss 0.00154882, throughput 12.9245K wps
Begin Testing...
[Epoch 48] train avg loss 0.00151111, test acc 0.9122, test avg loss 0.219335, throughput 12.9357K wps
[Epoch 49 Batch 30/162] avg loss 0.00148262, throughput 13.1281K wps
[Epoch 49 Batch 60/162] avg loss 0.00154554, throughput 12.7703K wps
[Epoch 49 Batch 90/162] avg loss 0.00153032, throughput 12.8517K wps
[Epoch 49 Batch 120/162] avg loss 0.00147258, throughput 12.8828K wps
[Epoch 49 Batch 150/162] avg loss 0.00142194, throughput 12.8706K wps
Begin Testing...
[Epoch 49] train avg loss 0.0014763, test acc 0.9178, test avg loss 0.208922, throughput 12.8988K wps
Observed Improvement.
Begin Testing...
[Epoch 50 Batch 30/162] avg loss 0.00136816, throughput 13.2276K wps
[Epoch 50 Batch 60/162] avg loss 0.0013682, throughput 12.8363K wps
[Epoch 50 Batch 90/162] avg loss 0.00154885, throughput 12.9978K wps
[Epoch 50 Batch 120/162] avg loss 0.00135078, throughput 12.8836K wps
[Epoch 50 Batch 150/162] avg loss 0.00144378, throughput 12.8565K wps
Begin Testing...
[Epoch 50] train avg loss 0.00141186, test acc 0.9211, test avg loss 0.207711, throughput 12.9597K wps
Observed Improvement.
Begin Testing...
[Epoch 51 Batch 30/162] avg loss 0.00142461, throughput 13.3185K wps
[Epoch 51 Batch 60/162] avg loss 0.00152201, throughput 12.7893K wps
[Epoch 51 Batch 90/162] avg loss 0.00144157, throughput 12.8333K wps
[Epoch 51 Batch 120/162] avg loss 0.00142768, throughput 12.9427K wps
[Epoch 51 Batch 150/162] avg loss 0.00154222, throughput 12.9808K wps
Begin Testing...
[Epoch 51] train avg loss 0.00144449, test acc 0.9167, test avg loss 0.211429, throughput 12.9674K wps
[Epoch 52 Batch 30/162] avg loss 0.00123359, throughput 13.3008K wps
[Epoch 52 Batch 60/162] avg loss 0.00137112, throughput 12.8145K wps
[Epoch 52 Batch 90/162] avg loss 0.00131824, throughput 12.9844K wps
[Epoch 52 Batch 120/162] avg loss 0.00136434, throughput 12.9619K wps
[Epoch 52 Batch 150/162] avg loss 0.0013825, throughput 12.8966K wps
Begin Testing...
[Epoch 52] train avg loss 0.0013548, test acc 0.9167, test avg loss 0.215506, throughput 12.9736K wps
[Epoch 53 Batch 30/162] avg loss 0.00124871, throughput 13.2703K wps
[Epoch 53 Batch 60/162] avg loss 0.00123016, throughput 12.8708K wps
[Epoch 53 Batch 90/162] avg loss 0.00140613, throughput 12.8648K wps
[Epoch 53 Batch 120/162] avg loss 0.00121588, throughput 12.9369K wps
[Epoch 53 Batch 150/162] avg loss 0.00141055, throughput 12.8664K wps
Begin Testing...
[Epoch 53] train avg loss 0.0013077, test acc 0.9144, test avg loss 0.213055, throughput 12.953K wps
[Epoch 54 Batch 30/162] avg loss 0.0010874, throughput 13.2299K wps
[Epoch 54 Batch 60/162] avg loss 0.0012986, throughput 12.7825K wps
[Epoch 54 Batch 90/162] avg loss 0.00134364, throughput 12.9084K wps
[Epoch 54 Batch 120/162] avg loss 0.00117904, throughput 12.8589K wps
[Epoch 54 Batch 150/162] avg loss 0.00133452, throughput 12.9284K wps
Begin Testing...
[Epoch 54] train avg loss 0.00123156, test acc 0.9133, test avg loss 0.219972, throughput 12.9343K wps
[Epoch 55 Batch 30/162] avg loss 0.0011579, throughput 13.1779K wps
[Epoch 55 Batch 60/162] avg loss 0.00117664, throughput 12.8138K wps
[Epoch 55 Batch 90/162] avg loss 0.00120486, throughput 12.9643K wps
[Epoch 55 Batch 120/162] avg loss 0.00122123, throughput 12.941K wps
[Epoch 55 Batch 150/162] avg loss 0.00127236, throughput 12.9204K wps
Begin Testing...
[Epoch 55] train avg loss 0.00120895, test acc 0.9167, test avg loss 0.213901, throughput 12.9569K wps
[Epoch 56 Batch 30/162] avg loss 0.00112006, throughput 13.2177K wps
[Epoch 56 Batch 60/162] avg loss 0.00128244, throughput 12.8996K wps
[Epoch 56 Batch 90/162] avg loss 0.0014261, throughput 12.8982K wps
[Epoch 56 Batch 120/162] avg loss 0.00111007, throughput 12.9462K wps
[Epoch 56 Batch 150/162] avg loss 0.00108653, throughput 12.9392K wps
Begin Testing...
[Epoch 56] train avg loss 0.00121583, test acc 0.9167, test avg loss 0.212912, throughput 12.9647K wps
[Epoch 57 Batch 30/162] avg loss 0.00123399, throughput 13.2818K wps
[Epoch 57 Batch 60/162] avg loss 0.00121211, throughput 12.8229K wps
[Epoch 57 Batch 90/162] avg loss 0.00104968, throughput 12.9003K wps
[Epoch 57 Batch 120/162] avg loss 0.00112569, throughput 12.9742K wps
[Epoch 57 Batch 150/162] avg loss 0.00124093, throughput 12.9325K wps
Begin Testing...
[Epoch 57] train avg loss 0.00116784, test acc 0.9167, test avg loss 0.211687, throughput 12.9745K wps
[Epoch 58 Batch 30/162] avg loss 0.00105187, throughput 13.1413K wps
[Epoch 58 Batch 60/162] avg loss 0.00108279, throughput 12.8014K wps
[Epoch 58 Batch 90/162] avg loss 0.00107506, throughput 12.8833K wps
[Epoch 58 Batch 120/162] avg loss 0.000940412, throughput 12.864K wps
[Epoch 58 Batch 150/162] avg loss 0.00119183, throughput 12.8746K wps
Begin Testing...
[Epoch 58] train avg loss 0.00107722, test acc 0.9167, test avg loss 0.212334, throughput 12.9087K wps
[Epoch 59 Batch 30/162] avg loss 0.000999831, throughput 13.1951K wps
[Epoch 59 Batch 60/162] avg loss 0.000892022, throughput 12.8542K wps
[Epoch 59 Batch 90/162] avg loss 0.00121686, throughput 12.9656K wps
[Epoch 59 Batch 120/162] avg loss 0.00108563, throughput 13.0067K wps
[Epoch 59 Batch 150/162] avg loss 0.00113274, throughput 12.9833K wps
Begin Testing...
[Epoch 59] train avg loss 0.00107689, test acc 0.9156, test avg loss 0.211419, throughput 12.9959K wps
Test loss 0.176073, test acc 0.9250
Total time cost 165.22s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0148768, throughput 11.4587K wps
[Epoch 0 Batch 60/162] avg loss 0.0145128, throughput 12.8137K wps
[Epoch 0 Batch 90/162] avg loss 0.0138466, throughput 12.8893K wps
[Epoch 0 Batch 120/162] avg loss 0.0131534, throughput 12.8443K wps
[Epoch 0 Batch 150/162] avg loss 0.0132686, throughput 12.8029K wps
Begin Testing...
[Epoch 0] train avg loss 0.0138345, test acc 0.7056, test avg loss 0.592439, throughput 12.5673K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0122869, throughput 13.2799K wps
[Epoch 1 Batch 60/162] avg loss 0.012117, throughput 12.7979K wps
[Epoch 1 Batch 90/162] avg loss 0.0122319, throughput 12.8469K wps
[Epoch 1 Batch 120/162] avg loss 0.0117353, throughput 13.0203K wps
[Epoch 1 Batch 150/162] avg loss 0.0111854, throughput 12.9973K wps
Begin Testing...
[Epoch 1] train avg loss 0.0119013, test acc 0.7578, test avg loss 0.538608, throughput 12.9867K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0110994, throughput 13.2296K wps
[Epoch 2 Batch 60/162] avg loss 0.0111503, throughput 12.7956K wps
[Epoch 2 Batch 90/162] avg loss 0.0108511, throughput 12.8106K wps
[Epoch 2 Batch 120/162] avg loss 0.0106141, throughput 12.8607K wps
[Epoch 2 Batch 150/162] avg loss 0.0102822, throughput 12.8863K wps
Begin Testing...
[Epoch 2] train avg loss 0.0107621, test acc 0.8022, test avg loss 0.494347, throughput 12.9108K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.00999932, throughput 13.3195K wps
[Epoch 3 Batch 60/162] avg loss 0.00984923, throughput 12.7753K wps
[Epoch 3 Batch 90/162] avg loss 0.0096933, throughput 12.8965K wps
[Epoch 3 Batch 120/162] avg loss 0.00973883, throughput 12.8864K wps
[Epoch 3 Batch 150/162] avg loss 0.00950527, throughput 12.7716K wps
Begin Testing...
[Epoch 3] train avg loss 0.00970125, test acc 0.8400, test avg loss 0.452377, throughput 12.9234K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00900992, throughput 13.1721K wps
[Epoch 4 Batch 60/162] avg loss 0.00903706, throughput 12.7253K wps
[Epoch 4 Batch 90/162] avg loss 0.0086454, throughput 12.906K wps
[Epoch 4 Batch 120/162] avg loss 0.00867648, throughput 12.8311K wps
[Epoch 4 Batch 150/162] avg loss 0.00877801, throughput 12.8066K wps
Begin Testing...
[Epoch 4] train avg loss 0.00881679, test acc 0.8656, test avg loss 0.407576, throughput 12.8901K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00852832, throughput 13.1874K wps
[Epoch 5 Batch 60/162] avg loss 0.00808794, throughput 12.8124K wps
[Epoch 5 Batch 90/162] avg loss 0.00785204, throughput 12.9167K wps
[Epoch 5 Batch 120/162] avg loss 0.00749488, throughput 12.9045K wps
[Epoch 5 Batch 150/162] avg loss 0.00787181, throughput 12.8973K wps
Begin Testing...
[Epoch 5] train avg loss 0.007935, test acc 0.8678, test avg loss 0.380919, throughput 12.9399K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00753647, throughput 13.2236K wps
[Epoch 6 Batch 60/162] avg loss 0.00719818, throughput 12.7768K wps
[Epoch 6 Batch 90/162] avg loss 0.00730997, throughput 12.8985K wps
[Epoch 6 Batch 120/162] avg loss 0.0071237, throughput 12.8084K wps
[Epoch 6 Batch 150/162] avg loss 0.00690823, throughput 12.8148K wps
Begin Testing...
[Epoch 6] train avg loss 0.00719817, test acc 0.8811, test avg loss 0.341779, throughput 12.8859K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00676727, throughput 13.2214K wps
[Epoch 7 Batch 60/162] avg loss 0.00668205, throughput 12.7439K wps
[Epoch 7 Batch 90/162] avg loss 0.00664854, throughput 12.8487K wps
[Epoch 7 Batch 120/162] avg loss 0.00648581, throughput 12.875K wps
[Epoch 7 Batch 150/162] avg loss 0.00667051, throughput 12.8916K wps
Begin Testing...
[Epoch 7] train avg loss 0.0066596, test acc 0.8833, test avg loss 0.315456, throughput 12.9129K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00585434, throughput 13.2883K wps
[Epoch 8 Batch 60/162] avg loss 0.00661242, throughput 12.7973K wps
[Epoch 8 Batch 90/162] avg loss 0.00641973, throughput 12.8917K wps
[Epoch 8 Batch 120/162] avg loss 0.0061027, throughput 12.9922K wps
[Epoch 8 Batch 150/162] avg loss 0.00619786, throughput 12.9372K wps
Begin Testing...
[Epoch 8] train avg loss 0.00624027, test acc 0.8889, test avg loss 0.297945, throughput 12.9754K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00612882, throughput 13.2271K wps
[Epoch 9 Batch 60/162] avg loss 0.00554386, throughput 12.7953K wps
[Epoch 9 Batch 90/162] avg loss 0.00585424, throughput 12.836K wps
[Epoch 9 Batch 120/162] avg loss 0.00596589, throughput 12.8829K wps
[Epoch 9 Batch 150/162] avg loss 0.00554916, throughput 12.9295K wps
Begin Testing...
[Epoch 9] train avg loss 0.00579789, test acc 0.8878, test avg loss 0.2875, throughput 12.932K wps
[Epoch 10 Batch 30/162] avg loss 0.00538098, throughput 13.2316K wps
[Epoch 10 Batch 60/162] avg loss 0.00553926, throughput 12.7831K wps
[Epoch 10 Batch 90/162] avg loss 0.00550779, throughput 12.9912K wps
[Epoch 10 Batch 120/162] avg loss 0.00561132, throughput 12.9559K wps
[Epoch 10 Batch 150/162] avg loss 0.00582475, throughput 12.8583K wps
Begin Testing...
[Epoch 10] train avg loss 0.00558423, test acc 0.8967, test avg loss 0.272356, throughput 12.9486K wps
Observed Improvement.
Begin Testing...
[Epoch 11 Batch 30/162] avg loss 0.00525381, throughput 13.2461K wps
[Epoch 11 Batch 60/162] avg loss 0.00564898, throughput 12.8175K wps
[Epoch 11 Batch 90/162] avg loss 0.0048706, throughput 12.824K wps
[Epoch 11 Batch 120/162] avg loss 0.00565024, throughput 12.8615K wps
[Epoch 11 Batch 150/162] avg loss 0.00497878, throughput 12.929K wps
Begin Testing...
[Epoch 11] train avg loss 0.00522548, test acc 0.8911, test avg loss 0.264588, throughput 12.9196K wps
[Epoch 12 Batch 30/162] avg loss 0.00517249, throughput 13.2118K wps
[Epoch 12 Batch 60/162] avg loss 0.00525254, throughput 12.7525K wps
[Epoch 12 Batch 90/162] avg loss 0.00508385, throughput 12.9155K wps
[Epoch 12 Batch 120/162] avg loss 0.00518328, throughput 12.8845K wps
[Epoch 12 Batch 150/162] avg loss 0.00483317, throughput 12.8193K wps
Begin Testing...
[Epoch 12] train avg loss 0.00510633, test acc 0.9022, test avg loss 0.253233, throughput 12.9077K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.00488302, throughput 13.1213K wps
[Epoch 13 Batch 60/162] avg loss 0.00491354, throughput 12.7561K wps
[Epoch 13 Batch 90/162] avg loss 0.00507471, throughput 12.8635K wps
[Epoch 13 Batch 120/162] avg loss 0.00440197, throughput 12.8186K wps
[Epoch 13 Batch 150/162] avg loss 0.00492465, throughput 12.9339K wps
Begin Testing...
[Epoch 13] train avg loss 0.00484771, test acc 0.9022, test avg loss 0.248518, throughput 12.898K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.00487016, throughput 13.1863K wps
[Epoch 14 Batch 60/162] avg loss 0.00486024, throughput 12.7658K wps
[Epoch 14 Batch 90/162] avg loss 0.00448268, throughput 12.972K wps
[Epoch 14 Batch 120/162] avg loss 0.00462379, throughput 12.8079K wps
[Epoch 14 Batch 150/162] avg loss 0.00457268, throughput 12.8676K wps
Begin Testing...
[Epoch 14] train avg loss 0.00466716, test acc 0.9011, test avg loss 0.256687, throughput 12.9023K wps
[Epoch 15 Batch 30/162] avg loss 0.00439684, throughput 13.2729K wps
[Epoch 15 Batch 60/162] avg loss 0.00442508, throughput 12.8267K wps
[Epoch 15 Batch 90/162] avg loss 0.00414422, throughput 12.9294K wps
[Epoch 15 Batch 120/162] avg loss 0.00445422, throughput 12.935K wps
[Epoch 15 Batch 150/162] avg loss 0.00445228, throughput 12.9127K wps
Begin Testing...
[Epoch 15] train avg loss 0.00443403, test acc 0.9067, test avg loss 0.235613, throughput 12.9635K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00407811, throughput 13.1783K wps
[Epoch 16 Batch 60/162] avg loss 0.00417622, throughput 12.6942K wps
[Epoch 16 Batch 90/162] avg loss 0.00459639, throughput 12.8324K wps
[Epoch 16 Batch 120/162] avg loss 0.00414984, throughput 12.8782K wps
[Epoch 16 Batch 150/162] avg loss 0.00411651, throughput 12.8474K wps
Begin Testing...
[Epoch 16] train avg loss 0.00422485, test acc 0.9056, test avg loss 0.235045, throughput 12.8818K wps
[Epoch 17 Batch 30/162] avg loss 0.00400063, throughput 13.2289K wps
[Epoch 17 Batch 60/162] avg loss 0.00375764, throughput 12.8146K wps
[Epoch 17 Batch 90/162] avg loss 0.00418914, throughput 12.9707K wps
[Epoch 17 Batch 120/162] avg loss 0.00385229, throughput 12.9318K wps
[Epoch 17 Batch 150/162] avg loss 0.00435415, throughput 12.9502K wps
Begin Testing...
[Epoch 17] train avg loss 0.00406398, test acc 0.9122, test avg loss 0.229804, throughput 12.9748K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.0038433, throughput 13.2594K wps
[Epoch 18 Batch 60/162] avg loss 0.0039518, throughput 12.9421K wps
[Epoch 18 Batch 90/162] avg loss 0.0040477, throughput 12.9203K wps
[Epoch 18 Batch 120/162] avg loss 0.00389849, throughput 12.9864K wps
[Epoch 18 Batch 150/162] avg loss 0.00428105, throughput 12.8274K wps
Begin Testing...
[Epoch 18] train avg loss 0.00401778, test acc 0.9122, test avg loss 0.225287, throughput 12.9844K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.00402856, throughput 13.2125K wps
[Epoch 19 Batch 60/162] avg loss 0.00384465, throughput 12.7948K wps
[Epoch 19 Batch 90/162] avg loss 0.00391319, throughput 12.8422K wps
[Epoch 19 Batch 120/162] avg loss 0.00356406, throughput 12.5858K wps
[Epoch 19 Batch 150/162] avg loss 0.00389058, throughput 12.9863K wps
Begin Testing...
[Epoch 19] train avg loss 0.00385882, test acc 0.9156, test avg loss 0.233091, throughput 12.8886K wps
Observed Improvement.
Begin Testing...
[Epoch 20 Batch 30/162] avg loss 0.00403331, throughput 13.1324K wps
[Epoch 20 Batch 60/162] avg loss 0.00387855, throughput 12.7331K wps
[Epoch 20 Batch 90/162] avg loss 0.00341598, throughput 12.8319K wps
[Epoch 20 Batch 120/162] avg loss 0.00360782, throughput 12.8814K wps
[Epoch 20 Batch 150/162] avg loss 0.00353724, throughput 12.9221K wps
Begin Testing...
[Epoch 20] train avg loss 0.00373603, test acc 0.9144, test avg loss 0.220502, throughput 12.8966K wps
[Epoch 21 Batch 30/162] avg loss 0.00342243, throughput 13.1822K wps
[Epoch 21 Batch 60/162] avg loss 0.00384629, throughput 12.808K wps
[Epoch 21 Batch 90/162] avg loss 0.00370066, throughput 12.9192K wps
[Epoch 21 Batch 120/162] avg loss 0.00359601, throughput 12.9112K wps
[Epoch 21 Batch 150/162] avg loss 0.00362288, throughput 12.8474K wps
Begin Testing...
[Epoch 21] train avg loss 0.00362541, test acc 0.9178, test avg loss 0.215876, throughput 12.9265K wps
Observed Improvement.
Begin Testing...
[Epoch 22 Batch 30/162] avg loss 0.0036641, throughput 13.2897K wps
[Epoch 22 Batch 60/162] avg loss 0.00341692, throughput 12.7942K wps
[Epoch 22 Batch 90/162] avg loss 0.00341885, throughput 12.9347K wps
[Epoch 22 Batch 120/162] avg loss 0.00341948, throughput 12.8826K wps
[Epoch 22 Batch 150/162] avg loss 0.00374957, throughput 12.914K wps
Begin Testing...
[Epoch 22] train avg loss 0.00351106, test acc 0.9189, test avg loss 0.211415, throughput 12.9462K wps
Observed Improvement.
Begin Testing...
[Epoch 23 Batch 30/162] avg loss 0.00331712, throughput 13.2466K wps
[Epoch 23 Batch 60/162] avg loss 0.00338379, throughput 12.8608K wps
[Epoch 23 Batch 90/162] avg loss 0.00318765, throughput 12.907K wps
[Epoch 23 Batch 120/162] avg loss 0.00340892, throughput 12.8837K wps
[Epoch 23 Batch 150/162] avg loss 0.00362877, throughput 12.9286K wps
Begin Testing...
[Epoch 23] train avg loss 0.00335689, test acc 0.9189, test avg loss 0.211919, throughput 12.9624K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00308133, throughput 13.2971K wps
[Epoch 24 Batch 60/162] avg loss 0.00322222, throughput 12.7618K wps
[Epoch 24 Batch 90/162] avg loss 0.00350972, throughput 12.9163K wps
[Epoch 24 Batch 120/162] avg loss 0.00350159, throughput 12.9725K wps
[Epoch 24 Batch 150/162] avg loss 0.00325991, throughput 12.8635K wps
Begin Testing...
[Epoch 24] train avg loss 0.00329751, test acc 0.9256, test avg loss 0.212482, throughput 12.9498K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.00332793, throughput 13.1855K wps
[Epoch 25 Batch 60/162] avg loss 0.00330972, throughput 12.8669K wps
[Epoch 25 Batch 90/162] avg loss 0.00319124, throughput 12.8338K wps
[Epoch 25 Batch 120/162] avg loss 0.00296958, throughput 12.8243K wps
[Epoch 25 Batch 150/162] avg loss 0.00289647, throughput 12.8597K wps
Begin Testing...
[Epoch 25] train avg loss 0.00314383, test acc 0.9211, test avg loss 0.207908, throughput 12.9087K wps
[Epoch 26 Batch 30/162] avg loss 0.00310868, throughput 13.1742K wps
[Epoch 26 Batch 60/162] avg loss 0.00310845, throughput 12.841K wps
[Epoch 26 Batch 90/162] avg loss 0.00281678, throughput 12.9677K wps
[Epoch 26 Batch 120/162] avg loss 0.00316341, throughput 12.9168K wps
[Epoch 26 Batch 150/162] avg loss 0.00312761, throughput 12.7848K wps
Begin Testing...
[Epoch 26] train avg loss 0.00305971, test acc 0.9244, test avg loss 0.208301, throughput 12.9332K wps
[Epoch 27 Batch 30/162] avg loss 0.00295243, throughput 13.3143K wps
[Epoch 27 Batch 60/162] avg loss 0.00301047, throughput 12.8524K wps
[Epoch 27 Batch 90/162] avg loss 0.00289972, throughput 12.9539K wps
[Epoch 27 Batch 120/162] avg loss 0.00299054, throughput 12.945K wps
[Epoch 27 Batch 150/162] avg loss 0.00282548, throughput 12.9308K wps
Begin Testing...
[Epoch 27] train avg loss 0.00296194, test acc 0.9244, test avg loss 0.203304, throughput 12.9935K wps
[Epoch 28 Batch 30/162] avg loss 0.00284307, throughput 13.2804K wps
[Epoch 28 Batch 60/162] avg loss 0.00269438, throughput 12.7853K wps
[Epoch 28 Batch 90/162] avg loss 0.00284769, throughput 12.9304K wps
[Epoch 28 Batch 120/162] avg loss 0.0026812, throughput 12.9114K wps
[Epoch 28 Batch 150/162] avg loss 0.00299686, throughput 12.838K wps
Begin Testing...
[Epoch 28] train avg loss 0.00282707, test acc 0.9189, test avg loss 0.203624, throughput 12.9407K wps
[Epoch 29 Batch 30/162] avg loss 0.00287656, throughput 13.1582K wps
[Epoch 29 Batch 60/162] avg loss 0.00273395, throughput 12.8914K wps
[Epoch 29 Batch 90/162] avg loss 0.00246119, throughput 12.9061K wps
[Epoch 29 Batch 120/162] avg loss 0.00316321, throughput 12.8972K wps
[Epoch 29 Batch 150/162] avg loss 0.00280932, throughput 12.9328K wps
Begin Testing...
[Epoch 29] train avg loss 0.00278947, test acc 0.9256, test avg loss 0.203383, throughput 12.9522K wps
Observed Improvement.
Begin Testing...
[Epoch 30 Batch 30/162] avg loss 0.00277728, throughput 13.147K wps
[Epoch 30 Batch 60/162] avg loss 0.00263046, throughput 12.7733K wps
[Epoch 30 Batch 90/162] avg loss 0.00279436, throughput 12.8304K wps
[Epoch 30 Batch 120/162] avg loss 0.00272032, throughput 12.833K wps
[Epoch 30 Batch 150/162] avg loss 0.00251176, throughput 12.8668K wps
Begin Testing...
[Epoch 30] train avg loss 0.00270112, test acc 0.9267, test avg loss 0.199243, throughput 12.8855K wps
Observed Improvement.
Begin Testing...
[Epoch 31 Batch 30/162] avg loss 0.00243131, throughput 13.2192K wps
[Epoch 31 Batch 60/162] avg loss 0.00280161, throughput 12.7386K wps
[Epoch 31 Batch 90/162] avg loss 0.0024761, throughput 12.8254K wps
[Epoch 31 Batch 120/162] avg loss 0.00248498, throughput 12.8877K wps
[Epoch 31 Batch 150/162] avg loss 0.00258896, throughput 12.9356K wps
Begin Testing...
[Epoch 31] train avg loss 0.00258888, test acc 0.9244, test avg loss 0.198463, throughput 12.9234K wps
[Epoch 32 Batch 30/162] avg loss 0.00259328, throughput 13.1038K wps
[Epoch 32 Batch 60/162] avg loss 0.00266962, throughput 12.6504K wps
[Epoch 32 Batch 90/162] avg loss 0.00252579, throughput 12.9251K wps
[Epoch 32 Batch 120/162] avg loss 0.00247007, throughput 12.9012K wps
[Epoch 32 Batch 150/162] avg loss 0.00241196, throughput 12.7334K wps
Begin Testing...
[Epoch 32] train avg loss 0.00252748, test acc 0.9278, test avg loss 0.197316, throughput 12.8618K wps
Observed Improvement.
Begin Testing...
[Epoch 33 Batch 30/162] avg loss 0.00239868, throughput 13.0745K wps
[Epoch 33 Batch 60/162] avg loss 0.00246449, throughput 12.81K wps
[Epoch 33 Batch 90/162] avg loss 0.00230626, throughput 12.8975K wps
[Epoch 33 Batch 120/162] avg loss 0.00267433, throughput 12.8569K wps
[Epoch 33 Batch 150/162] avg loss 0.00238924, throughput 12.9343K wps
Begin Testing...
[Epoch 33] train avg loss 0.00244268, test acc 0.9289, test avg loss 0.199155, throughput 12.9181K wps
Observed Improvement.
Begin Testing...
[Epoch 34 Batch 30/162] avg loss 0.00254187, throughput 13.2329K wps
[Epoch 34 Batch 60/162] avg loss 0.00221477, throughput 12.7217K wps
[Epoch 34 Batch 90/162] avg loss 0.00207434, throughput 12.8816K wps
[Epoch 34 Batch 120/162] avg loss 0.0024721, throughput 12.9115K wps
[Epoch 34 Batch 150/162] avg loss 0.00228037, throughput 12.9481K wps
Begin Testing...
[Epoch 34] train avg loss 0.00233763, test acc 0.9322, test avg loss 0.194734, throughput 12.9338K wps
Observed Improvement.
Begin Testing...
[Epoch 35 Batch 30/162] avg loss 0.00237067, throughput 13.2788K wps
[Epoch 35 Batch 60/162] avg loss 0.00238545, throughput 12.8472K wps
[Epoch 35 Batch 90/162] avg loss 0.00243876, throughput 12.8803K wps
[Epoch 35 Batch 120/162] avg loss 0.00230694, throughput 12.8234K wps
[Epoch 35 Batch 150/162] avg loss 0.00227332, throughput 12.9144K wps
Begin Testing...
[Epoch 35] train avg loss 0.00234529, test acc 0.9311, test avg loss 0.193353, throughput 12.9457K wps
[Epoch 36 Batch 30/162] avg loss 0.00208405, throughput 13.2862K wps
[Epoch 36 Batch 60/162] avg loss 0.00215239, throughput 12.8052K wps
[Epoch 36 Batch 90/162] avg loss 0.00209382, throughput 12.9736K wps
[Epoch 36 Batch 120/162] avg loss 0.00220958, throughput 12.9304K wps
[Epoch 36 Batch 150/162] avg loss 0.00214915, throughput 12.9651K wps
Begin Testing...
[Epoch 36] train avg loss 0.00214794, test acc 0.9367, test avg loss 0.191476, throughput 12.9885K wps
Observed Improvement.
Begin Testing...
[Epoch 37 Batch 30/162] avg loss 0.00206244, throughput 13.2151K wps
[Epoch 37 Batch 60/162] avg loss 0.00189788, throughput 12.8079K wps
[Epoch 37 Batch 90/162] avg loss 0.0021537, throughput 12.9319K wps
[Epoch 37 Batch 120/162] avg loss 0.00237231, throughput 12.9913K wps
[Epoch 37 Batch 150/162] avg loss 0.00218067, throughput 12.9483K wps
Begin Testing...
[Epoch 37] train avg loss 0.00211494, test acc 0.9289, test avg loss 0.191144, throughput 12.9779K wps
[Epoch 38 Batch 30/162] avg loss 0.00187782, throughput 13.226K wps
[Epoch 38 Batch 60/162] avg loss 0.00213014, throughput 12.7833K wps
[Epoch 38 Batch 90/162] avg loss 0.00201093, throughput 12.9028K wps
[Epoch 38 Batch 120/162] avg loss 0.00225919, throughput 12.9658K wps
[Epoch 38 Batch 150/162] avg loss 0.00208672, throughput 12.9563K wps
Begin Testing...
[Epoch 38] train avg loss 0.00204447, test acc 0.9344, test avg loss 0.191361, throughput 12.964K wps
[Epoch 39 Batch 30/162] avg loss 0.00213781, throughput 13.1015K wps
[Epoch 39 Batch 60/162] avg loss 0.00213607, throughput 12.7329K wps
[Epoch 39 Batch 90/162] avg loss 0.00203532, throughput 12.8912K wps
[Epoch 39 Batch 120/162] avg loss 0.00175335, throughput 12.904K wps
[Epoch 39 Batch 150/162] avg loss 0.00208331, throughput 12.8909K wps
Begin Testing...
[Epoch 39] train avg loss 0.002019, test acc 0.9311, test avg loss 0.191516, throughput 12.9022K wps
[Epoch 40 Batch 30/162] avg loss 0.00187683, throughput 13.1788K wps
[Epoch 40 Batch 60/162] avg loss 0.00183746, throughput 12.8034K wps
[Epoch 40 Batch 90/162] avg loss 0.00190911, throughput 12.894K wps
[Epoch 40 Batch 120/162] avg loss 0.00200912, throughput 12.8093K wps
[Epoch 40 Batch 150/162] avg loss 0.00204651, throughput 12.8283K wps
Begin Testing...
[Epoch 40] train avg loss 0.0019373, test acc 0.9322, test avg loss 0.19151, throughput 12.8978K wps
[Epoch 41 Batch 30/162] avg loss 0.00203129, throughput 13.2385K wps
[Epoch 41 Batch 60/162] avg loss 0.00177921, throughput 12.7927K wps
[Epoch 41 Batch 90/162] avg loss 0.00178983, throughput 12.7769K wps
[Epoch 41 Batch 120/162] avg loss 0.00171505, throughput 12.9153K wps
[Epoch 41 Batch 150/162] avg loss 0.00190752, throughput 12.9191K wps
Begin Testing...
[Epoch 41] train avg loss 0.00183589, test acc 0.9356, test avg loss 0.191103, throughput 12.9244K wps
[Epoch 42 Batch 30/162] avg loss 0.00149824, throughput 13.1127K wps
[Epoch 42 Batch 60/162] avg loss 0.00167313, throughput 12.875K wps
[Epoch 42 Batch 90/162] avg loss 0.00185168, throughput 12.927K wps
[Epoch 42 Batch 120/162] avg loss 0.00199876, throughput 12.9001K wps
[Epoch 42 Batch 150/162] avg loss 0.00198085, throughput 12.9406K wps
Begin Testing...
[Epoch 42] train avg loss 0.00181053, test acc 0.9344, test avg loss 0.189387, throughput 12.9505K wps
[Epoch 43 Batch 30/162] avg loss 0.00189074, throughput 13.1837K wps
[Epoch 43 Batch 60/162] avg loss 0.00165084, throughput 12.8269K wps
[Epoch 43 Batch 90/162] avg loss 0.00202938, throughput 12.9629K wps
[Epoch 43 Batch 120/162] avg loss 0.00168822, throughput 12.97K wps
[Epoch 43 Batch 150/162] avg loss 0.00160168, throughput 12.7338K wps
Begin Testing...
[Epoch 43] train avg loss 0.00175307, test acc 0.9389, test avg loss 0.187736, throughput 12.913K wps
Observed Improvement.
Begin Testing...
[Epoch 44 Batch 30/162] avg loss 0.00179889, throughput 13.1762K wps
[Epoch 44 Batch 60/162] avg loss 0.00186465, throughput 12.7347K wps
[Epoch 44 Batch 90/162] avg loss 0.00173629, throughput 12.9453K wps
[Epoch 44 Batch 120/162] avg loss 0.00167306, throughput 12.8678K wps
[Epoch 44 Batch 150/162] avg loss 0.00156696, throughput 12.8815K wps
Begin Testing...
[Epoch 44] train avg loss 0.00169965, test acc 0.9378, test avg loss 0.189173, throughput 12.9159K wps
[Epoch 45 Batch 30/162] avg loss 0.00160696, throughput 13.2492K wps
[Epoch 45 Batch 60/162] avg loss 0.00161223, throughput 12.8071K wps
[Epoch 45 Batch 90/162] avg loss 0.00166931, throughput 12.929K wps
[Epoch 45 Batch 120/162] avg loss 0.00163428, throughput 12.9822K wps
[Epoch 45 Batch 150/162] avg loss 0.00161478, throughput 12.9276K wps
Begin Testing...
[Epoch 45] train avg loss 0.00165192, test acc 0.9344, test avg loss 0.187533, throughput 12.9701K wps
[Epoch 46 Batch 30/162] avg loss 0.00158423, throughput 13.1632K wps
[Epoch 46 Batch 60/162] avg loss 0.00154725, throughput 12.7843K wps
[Epoch 46 Batch 90/162] avg loss 0.00164302, throughput 12.9581K wps
[Epoch 46 Batch 120/162] avg loss 0.00150778, throughput 12.9554K wps
[Epoch 46 Batch 150/162] avg loss 0.00169419, throughput 12.942K wps
Begin Testing...
[Epoch 46] train avg loss 0.00157767, test acc 0.9333, test avg loss 0.186847, throughput 12.9617K wps
[Epoch 47 Batch 30/162] avg loss 0.00133546, throughput 13.2032K wps
[Epoch 47 Batch 60/162] avg loss 0.00139733, throughput 12.9036K wps
[Epoch 47 Batch 90/162] avg loss 0.00153276, throughput 12.9426K wps
[Epoch 47 Batch 120/162] avg loss 0.00150322, throughput 12.9267K wps
[Epoch 47 Batch 150/162] avg loss 0.00147447, throughput 12.9297K wps
Begin Testing...
[Epoch 47] train avg loss 0.00146307, test acc 0.9367, test avg loss 0.187455, throughput 12.9772K wps
[Epoch 48 Batch 30/162] avg loss 0.00131487, throughput 13.2342K wps
[Epoch 48 Batch 60/162] avg loss 0.00140613, throughput 12.8698K wps
[Epoch 48 Batch 90/162] avg loss 0.00150624, throughput 12.9018K wps
[Epoch 48 Batch 120/162] avg loss 0.00172992, throughput 12.847K wps
[Epoch 48 Batch 150/162] avg loss 0.00140126, throughput 12.7902K wps
Begin Testing...
[Epoch 48] train avg loss 0.00145853, test acc 0.9356, test avg loss 0.18647, throughput 12.9244K wps
[Epoch 49 Batch 30/162] avg loss 0.00139211, throughput 13.2757K wps
[Epoch 49 Batch 60/162] avg loss 0.00143257, throughput 12.7577K wps
[Epoch 49 Batch 90/162] avg loss 0.0013755, throughput 12.9651K wps
[Epoch 49 Batch 120/162] avg loss 0.0014494, throughput 12.8963K wps
[Epoch 49 Batch 150/162] avg loss 0.00153992, throughput 12.7211K wps
Begin Testing...
[Epoch 49] train avg loss 0.00144255, test acc 0.9367, test avg loss 0.188981, throughput 12.901K wps
[Epoch 50 Batch 30/162] avg loss 0.00128853, throughput 13.2156K wps
[Epoch 50 Batch 60/162] avg loss 0.00139493, throughput 12.8035K wps
[Epoch 50 Batch 90/162] avg loss 0.00136882, throughput 12.8801K wps
[Epoch 50 Batch 120/162] avg loss 0.00155047, throughput 12.8284K wps
[Epoch 50 Batch 150/162] avg loss 0.00130476, throughput 12.9866K wps
Begin Testing...
[Epoch 50] train avg loss 0.00137, test acc 0.9356, test avg loss 0.195422, throughput 12.9411K wps
[Epoch 51 Batch 30/162] avg loss 0.00122614, throughput 13.1793K wps
[Epoch 51 Batch 60/162] avg loss 0.00133187, throughput 12.7994K wps
[Epoch 51 Batch 90/162] avg loss 0.00154857, throughput 12.9222K wps
[Epoch 51 Batch 120/162] avg loss 0.00130553, throughput 12.9374K wps
[Epoch 51 Batch 150/162] avg loss 0.00146291, throughput 12.8428K wps
Begin Testing...
[Epoch 51] train avg loss 0.00135169, test acc 0.9378, test avg loss 0.191906, throughput 12.9286K wps
[Epoch 52 Batch 30/162] avg loss 0.00129824, throughput 13.1402K wps
[Epoch 52 Batch 60/162] avg loss 0.00118087, throughput 12.7448K wps
[Epoch 52 Batch 90/162] avg loss 0.00130818, throughput 12.9173K wps
[Epoch 52 Batch 120/162] avg loss 0.00131782, throughput 12.9564K wps
[Epoch 52 Batch 150/162] avg loss 0.00141402, throughput 12.948K wps
Begin Testing...
[Epoch 52] train avg loss 0.00130736, test acc 0.9378, test avg loss 0.186271, throughput 12.9393K wps
[Epoch 53 Batch 30/162] avg loss 0.00121684, throughput 13.2874K wps
[Epoch 53 Batch 60/162] avg loss 0.00109955, throughput 12.7834K wps
[Epoch 53 Batch 90/162] avg loss 0.00129073, throughput 12.8958K wps
[Epoch 53 Batch 120/162] avg loss 0.00142804, throughput 12.9352K wps
[Epoch 53 Batch 150/162] avg loss 0.00122676, throughput 12.9441K wps
Begin Testing...
[Epoch 53] train avg loss 0.00124573, test acc 0.9356, test avg loss 0.193169, throughput 12.9639K wps
[Epoch 54 Batch 30/162] avg loss 0.00141578, throughput 13.2469K wps
[Epoch 54 Batch 60/162] avg loss 0.00117285, throughput 12.7973K wps
[Epoch 54 Batch 90/162] avg loss 0.00114824, throughput 12.9786K wps
[Epoch 54 Batch 120/162] avg loss 0.00111666, throughput 12.8827K wps
[Epoch 54 Batch 150/162] avg loss 0.00114239, throughput 12.937K wps
Begin Testing...
[Epoch 54] train avg loss 0.00120763, test acc 0.9367, test avg loss 0.187634, throughput 12.9638K wps
[Epoch 55 Batch 30/162] avg loss 0.0010194, throughput 13.1487K wps
[Epoch 55 Batch 60/162] avg loss 0.0010321, throughput 12.7862K wps
[Epoch 55 Batch 90/162] avg loss 0.00138385, throughput 12.8873K wps
[Epoch 55 Batch 120/162] avg loss 0.0010704, throughput 12.8741K wps
[Epoch 55 Batch 150/162] avg loss 0.00128454, throughput 12.8436K wps
Begin Testing...
[Epoch 55] train avg loss 0.00117433, test acc 0.9389, test avg loss 0.184978, throughput 12.9037K wps
Observed Improvement.
Begin Testing...
[Epoch 56 Batch 30/162] avg loss 0.00113546, throughput 13.187K wps
[Epoch 56 Batch 60/162] avg loss 0.00125723, throughput 12.8738K wps
[Epoch 56 Batch 90/162] avg loss 0.00110112, throughput 12.9598K wps
[Epoch 56 Batch 120/162] avg loss 0.00108005, throughput 12.9455K wps
[Epoch 56 Batch 150/162] avg loss 0.00134507, throughput 12.9574K wps
Begin Testing...
[Epoch 56] train avg loss 0.00117551, test acc 0.9333, test avg loss 0.187436, throughput 12.9779K wps
[Epoch 57 Batch 30/162] avg loss 0.00106544, throughput 13.2862K wps
[Epoch 57 Batch 60/162] avg loss 0.00114324, throughput 12.8005K wps
[Epoch 57 Batch 90/162] avg loss 0.00124408, throughput 12.9698K wps
[Epoch 57 Batch 120/162] avg loss 0.00114754, throughput 12.9724K wps
[Epoch 57 Batch 150/162] avg loss 0.000924603, throughput 12.9478K wps
Begin Testing...
[Epoch 57] train avg loss 0.0011036, test acc 0.9333, test avg loss 0.188113, throughput 12.9904K wps
[Epoch 58 Batch 30/162] avg loss 0.000968913, throughput 13.2169K wps
[Epoch 58 Batch 60/162] avg loss 0.0010089, throughput 12.8786K wps
[Epoch 58 Batch 90/162] avg loss 0.00108045, throughput 12.9811K wps
[Epoch 58 Batch 120/162] avg loss 0.00100219, throughput 12.9902K wps
[Epoch 58 Batch 150/162] avg loss 0.00108726, throughput 12.9556K wps
Begin Testing...
[Epoch 58] train avg loss 0.0010369, test acc 0.9356, test avg loss 0.186033, throughput 12.9966K wps
[Epoch 59 Batch 30/162] avg loss 0.00102605, throughput 13.2896K wps
[Epoch 59 Batch 60/162] avg loss 0.000861122, throughput 12.827K wps
[Epoch 59 Batch 90/162] avg loss 0.00112788, throughput 12.968K wps
[Epoch 59 Batch 120/162] avg loss 0.00103557, throughput 12.9159K wps
[Epoch 59 Batch 150/162] avg loss 0.000970085, throughput 12.9009K wps
Begin Testing...
[Epoch 59] train avg loss 0.0010142, test acc 0.9367, test avg loss 0.186481, throughput 12.9766K wps
Test loss 0.171716, test acc 0.9380
Total time cost 166.03s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0149044, throughput 11.5694K wps
[Epoch 0 Batch 60/162] avg loss 0.0145361, throughput 12.8816K wps
[Epoch 0 Batch 90/162] avg loss 0.0134796, throughput 12.826K wps
[Epoch 0 Batch 120/162] avg loss 0.0131312, throughput 12.8408K wps
[Epoch 0 Batch 150/162] avg loss 0.0131959, throughput 12.8577K wps
Begin Testing...
[Epoch 0] train avg loss 0.0137689, test acc 0.6811, test avg loss 0.611316, throughput 12.5905K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0127667, throughput 13.2478K wps
[Epoch 1 Batch 60/162] avg loss 0.0123152, throughput 12.8799K wps
[Epoch 1 Batch 90/162] avg loss 0.0122798, throughput 12.9356K wps
[Epoch 1 Batch 120/162] avg loss 0.0120117, throughput 12.7834K wps
[Epoch 1 Batch 150/162] avg loss 0.0118938, throughput 12.8567K wps
Begin Testing...
[Epoch 1] train avg loss 0.0122016, test acc 0.7356, test avg loss 0.56214, throughput 12.938K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0111572, throughput 13.1446K wps
[Epoch 2 Batch 60/162] avg loss 0.0110583, throughput 12.8014K wps
[Epoch 2 Batch 90/162] avg loss 0.0110506, throughput 12.8583K wps
[Epoch 2 Batch 120/162] avg loss 0.0106997, throughput 12.7522K wps
[Epoch 2 Batch 150/162] avg loss 0.0103178, throughput 12.6932K wps
Begin Testing...
[Epoch 2] train avg loss 0.0108346, test acc 0.7956, test avg loss 0.518601, throughput 12.8466K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.00995923, throughput 13.1214K wps
[Epoch 3 Batch 60/162] avg loss 0.0100407, throughput 12.8615K wps
[Epoch 3 Batch 90/162] avg loss 0.00979146, throughput 12.8841K wps
[Epoch 3 Batch 120/162] avg loss 0.00969141, throughput 12.7957K wps
[Epoch 3 Batch 150/162] avg loss 0.00970535, throughput 12.8621K wps
Begin Testing...
[Epoch 3] train avg loss 0.00985022, test acc 0.8344, test avg loss 0.47435, throughput 12.9094K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00931799, throughput 13.2813K wps
[Epoch 4 Batch 60/162] avg loss 0.00906745, throughput 12.8046K wps
[Epoch 4 Batch 90/162] avg loss 0.00877808, throughput 12.8226K wps
[Epoch 4 Batch 120/162] avg loss 0.00918093, throughput 12.8401K wps
[Epoch 4 Batch 150/162] avg loss 0.00841705, throughput 12.9439K wps
Begin Testing...
[Epoch 4] train avg loss 0.00889724, test acc 0.8633, test avg loss 0.435607, throughput 12.9273K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00841042, throughput 13.249K wps
[Epoch 5 Batch 60/162] avg loss 0.00838398, throughput 12.7539K wps
[Epoch 5 Batch 90/162] avg loss 0.00842907, throughput 12.7975K wps
[Epoch 5 Batch 120/162] avg loss 0.00786293, throughput 12.9213K wps
[Epoch 5 Batch 150/162] avg loss 0.00772782, throughput 12.8106K wps
Begin Testing...
[Epoch 5] train avg loss 0.00812961, test acc 0.8833, test avg loss 0.400389, throughput 12.9025K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00742209, throughput 13.2541K wps
[Epoch 6 Batch 60/162] avg loss 0.00741598, throughput 12.7168K wps
[Epoch 6 Batch 90/162] avg loss 0.00744054, throughput 12.8108K wps
[Epoch 6 Batch 120/162] avg loss 0.00715802, throughput 12.8688K wps
[Epoch 6 Batch 150/162] avg loss 0.0072793, throughput 12.9269K wps
Begin Testing...
[Epoch 6] train avg loss 0.00731388, test acc 0.8844, test avg loss 0.368259, throughput 12.767K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00687358, throughput 13.1999K wps
[Epoch 7 Batch 60/162] avg loss 0.00685555, throughput 12.7601K wps
[Epoch 7 Batch 90/162] avg loss 0.00685966, throughput 12.9482K wps
[Epoch 7 Batch 120/162] avg loss 0.00662428, throughput 12.8284K wps
[Epoch 7 Batch 150/162] avg loss 0.0066234, throughput 12.778K wps
Begin Testing...
[Epoch 7] train avg loss 0.0067606, test acc 0.8833, test avg loss 0.344006, throughput 12.8906K wps
[Epoch 8 Batch 30/162] avg loss 0.00635419, throughput 13.2378K wps
[Epoch 8 Batch 60/162] avg loss 0.00646117, throughput 12.8647K wps
[Epoch 8 Batch 90/162] avg loss 0.00614246, throughput 12.9713K wps
[Epoch 8 Batch 120/162] avg loss 0.0060258, throughput 12.952K wps
[Epoch 8 Batch 150/162] avg loss 0.00624691, throughput 12.8645K wps
Begin Testing...
[Epoch 8] train avg loss 0.00622919, test acc 0.8844, test avg loss 0.326067, throughput 12.9636K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00601674, throughput 13.1755K wps
[Epoch 9 Batch 60/162] avg loss 0.00581729, throughput 12.8106K wps
[Epoch 9 Batch 90/162] avg loss 0.00584508, throughput 12.9209K wps
[Epoch 9 Batch 120/162] avg loss 0.0058674, throughput 12.8757K wps
[Epoch 9 Batch 150/162] avg loss 0.00552138, throughput 12.8986K wps
Begin Testing...
[Epoch 9] train avg loss 0.00581375, test acc 0.8800, test avg loss 0.312642, throughput 12.9193K wps
[Epoch 10 Batch 30/162] avg loss 0.00538105, throughput 13.1721K wps
[Epoch 10 Batch 60/162] avg loss 0.00567024, throughput 12.8149K wps
[Epoch 10 Batch 90/162] avg loss 0.00551632, throughput 12.8515K wps
[Epoch 10 Batch 120/162] avg loss 0.00565348, throughput 12.7858K wps
[Epoch 10 Batch 150/162] avg loss 0.00536743, throughput 12.888K wps
Begin Testing...
[Epoch 10] train avg loss 0.00551318, test acc 0.8811, test avg loss 0.303163, throughput 12.9027K wps
[Epoch 11 Batch 30/162] avg loss 0.00533346, throughput 13.2094K wps
[Epoch 11 Batch 60/162] avg loss 0.00540994, throughput 12.8K wps
[Epoch 11 Batch 90/162] avg loss 0.00489572, throughput 12.8923K wps
[Epoch 11 Batch 120/162] avg loss 0.00512543, throughput 12.9526K wps
[Epoch 11 Batch 150/162] avg loss 0.00529109, throughput 12.8013K wps
Begin Testing...
[Epoch 11] train avg loss 0.00521201, test acc 0.8856, test avg loss 0.295112, throughput 12.9184K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00514023, throughput 13.2772K wps
[Epoch 12 Batch 60/162] avg loss 0.0050224, throughput 12.878K wps
[Epoch 12 Batch 90/162] avg loss 0.00524107, throughput 12.943K wps
[Epoch 12 Batch 120/162] avg loss 0.00475671, throughput 12.8626K wps
[Epoch 12 Batch 150/162] avg loss 0.0051803, throughput 12.9809K wps
Begin Testing...
[Epoch 12] train avg loss 0.00502374, test acc 0.8911, test avg loss 0.284468, throughput 12.9844K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.00516187, throughput 13.2228K wps
[Epoch 13 Batch 60/162] avg loss 0.00501831, throughput 12.7857K wps
[Epoch 13 Batch 90/162] avg loss 0.00507864, throughput 12.9604K wps
[Epoch 13 Batch 120/162] avg loss 0.00492511, throughput 12.8494K wps
[Epoch 13 Batch 150/162] avg loss 0.00476394, throughput 12.824K wps
Begin Testing...
[Epoch 13] train avg loss 0.00492402, test acc 0.8922, test avg loss 0.279935, throughput 12.9254K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.00492253, throughput 13.2239K wps
[Epoch 14 Batch 60/162] avg loss 0.0046083, throughput 12.8224K wps
[Epoch 14 Batch 90/162] avg loss 0.00454753, throughput 12.7153K wps
[Epoch 14 Batch 120/162] avg loss 0.00445973, throughput 12.9191K wps
[Epoch 14 Batch 150/162] avg loss 0.00447511, throughput 12.7845K wps
Begin Testing...
[Epoch 14] train avg loss 0.00460789, test acc 0.8911, test avg loss 0.273466, throughput 12.8895K wps
[Epoch 15 Batch 30/162] avg loss 0.00432426, throughput 13.1887K wps
[Epoch 15 Batch 60/162] avg loss 0.00429636, throughput 12.8478K wps
[Epoch 15 Batch 90/162] avg loss 0.00468844, throughput 13.0305K wps
[Epoch 15 Batch 120/162] avg loss 0.00443643, throughput 12.8737K wps
[Epoch 15 Batch 150/162] avg loss 0.004636, throughput 12.845K wps
Begin Testing...
[Epoch 15] train avg loss 0.00445615, test acc 0.8933, test avg loss 0.273912, throughput 12.9464K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00433593, throughput 13.325K wps
[Epoch 16 Batch 60/162] avg loss 0.00436575, throughput 12.7918K wps
[Epoch 16 Batch 90/162] avg loss 0.00477713, throughput 12.8691K wps
[Epoch 16 Batch 120/162] avg loss 0.00415313, throughput 12.9639K wps
[Epoch 16 Batch 150/162] avg loss 0.00443543, throughput 12.8311K wps
Begin Testing...
[Epoch 16] train avg loss 0.00436155, test acc 0.8956, test avg loss 0.267202, throughput 12.9503K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00432313, throughput 13.1932K wps
[Epoch 17 Batch 60/162] avg loss 0.00418542, throughput 12.7607K wps
[Epoch 17 Batch 90/162] avg loss 0.00437224, throughput 12.8809K wps
[Epoch 17 Batch 120/162] avg loss 0.00405612, throughput 12.82K wps
[Epoch 17 Batch 150/162] avg loss 0.00400356, throughput 12.7465K wps
Begin Testing...
[Epoch 17] train avg loss 0.00416786, test acc 0.8989, test avg loss 0.25751, throughput 12.8658K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00374152, throughput 13.1018K wps
[Epoch 18 Batch 60/162] avg loss 0.00438907, throughput 12.7655K wps
[Epoch 18 Batch 90/162] avg loss 0.00439783, throughput 12.8709K wps
[Epoch 18 Batch 120/162] avg loss 0.0038121, throughput 12.9283K wps
[Epoch 18 Batch 150/162] avg loss 0.00393527, throughput 12.922K wps
Begin Testing...
[Epoch 18] train avg loss 0.00402856, test acc 0.9000, test avg loss 0.256577, throughput 12.9216K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.00420634, throughput 13.2446K wps
[Epoch 19 Batch 60/162] avg loss 0.00428065, throughput 12.7939K wps
[Epoch 19 Batch 90/162] avg loss 0.00374236, throughput 12.9512K wps
[Epoch 19 Batch 120/162] avg loss 0.0038835, throughput 12.785K wps
[Epoch 19 Batch 150/162] avg loss 0.00380708, throughput 12.928K wps
Begin Testing...
[Epoch 19] train avg loss 0.00397819, test acc 0.8967, test avg loss 0.250802, throughput 12.9388K wps
[Epoch 20 Batch 30/162] avg loss 0.00388107, throughput 13.1606K wps
[Epoch 20 Batch 60/162] avg loss 0.00330699, throughput 12.812K wps
[Epoch 20 Batch 90/162] avg loss 0.00412032, throughput 12.782K wps
[Epoch 20 Batch 120/162] avg loss 0.00381131, throughput 12.9442K wps
[Epoch 20 Batch 150/162] avg loss 0.00397148, throughput 12.9183K wps
Begin Testing...
[Epoch 20] train avg loss 0.00377408, test acc 0.9011, test avg loss 0.248216, throughput 12.9051K wps
Observed Improvement.
Begin Testing...
[Epoch 21 Batch 30/162] avg loss 0.00355975, throughput 13.1493K wps
[Epoch 21 Batch 60/162] avg loss 0.00368603, throughput 12.8077K wps
[Epoch 21 Batch 90/162] avg loss 0.00346229, throughput 12.9382K wps
[Epoch 21 Batch 120/162] avg loss 0.00356255, throughput 12.9337K wps
[Epoch 21 Batch 150/162] avg loss 0.00354514, throughput 12.9608K wps
Begin Testing...
[Epoch 21] train avg loss 0.00358022, test acc 0.8989, test avg loss 0.245656, throughput 12.9523K wps
[Epoch 22 Batch 30/162] avg loss 0.0032853, throughput 13.183K wps
[Epoch 22 Batch 60/162] avg loss 0.00385468, throughput 12.6645K wps
[Epoch 22 Batch 90/162] avg loss 0.00330519, throughput 12.8915K wps
[Epoch 22 Batch 120/162] avg loss 0.00345985, throughput 12.9004K wps
[Epoch 22 Batch 150/162] avg loss 0.00348626, throughput 12.9399K wps
Begin Testing...
[Epoch 22] train avg loss 0.0034629, test acc 0.8989, test avg loss 0.244814, throughput 12.9123K wps
[Epoch 23 Batch 30/162] avg loss 0.00329949, throughput 13.2176K wps
[Epoch 23 Batch 60/162] avg loss 0.00329967, throughput 12.795K wps
[Epoch 23 Batch 90/162] avg loss 0.00350283, throughput 12.9282K wps
[Epoch 23 Batch 120/162] avg loss 0.00350559, throughput 12.9839K wps
[Epoch 23 Batch 150/162] avg loss 0.00352986, throughput 12.9907K wps
Begin Testing...
[Epoch 23] train avg loss 0.00341461, test acc 0.9022, test avg loss 0.239073, throughput 12.9833K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00319426, throughput 13.1163K wps
[Epoch 24 Batch 60/162] avg loss 0.00328349, throughput 12.7165K wps
[Epoch 24 Batch 90/162] avg loss 0.00339074, throughput 12.8449K wps
[Epoch 24 Batch 120/162] avg loss 0.00320718, throughput 12.8492K wps
[Epoch 24 Batch 150/162] avg loss 0.00358699, throughput 12.892K wps
Begin Testing...
[Epoch 24] train avg loss 0.00333631, test acc 0.9033, test avg loss 0.237227, throughput 12.8749K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.00312738, throughput 13.2741K wps
[Epoch 25 Batch 60/162] avg loss 0.00315829, throughput 12.7984K wps
[Epoch 25 Batch 90/162] avg loss 0.00359342, throughput 12.9572K wps
[Epoch 25 Batch 120/162] avg loss 0.00300588, throughput 12.9567K wps
[Epoch 25 Batch 150/162] avg loss 0.00312251, throughput 12.9424K wps
Begin Testing...
[Epoch 25] train avg loss 0.00322459, test acc 0.9044, test avg loss 0.235383, throughput 12.9779K wps
Observed Improvement.
Begin Testing...
[Epoch 26 Batch 30/162] avg loss 0.00347215, throughput 13.2112K wps
[Epoch 26 Batch 60/162] avg loss 0.00297209, throughput 12.7625K wps
[Epoch 26 Batch 90/162] avg loss 0.00317205, throughput 12.8044K wps
[Epoch 26 Batch 120/162] avg loss 0.00301544, throughput 12.8292K wps
[Epoch 26 Batch 150/162] avg loss 0.00304784, throughput 12.7802K wps
Begin Testing...
[Epoch 26] train avg loss 0.00312912, test acc 0.9089, test avg loss 0.237785, throughput 12.8697K wps
Observed Improvement.
Begin Testing...
[Epoch 27 Batch 30/162] avg loss 0.00295767, throughput 13.24K wps
[Epoch 27 Batch 60/162] avg loss 0.00284118, throughput 12.7919K wps
[Epoch 27 Batch 90/162] avg loss 0.00257352, throughput 12.9218K wps
[Epoch 27 Batch 120/162] avg loss 0.00321376, throughput 12.9261K wps
[Epoch 27 Batch 150/162] avg loss 0.00320092, throughput 12.9454K wps
Begin Testing...
[Epoch 27] train avg loss 0.00294936, test acc 0.9022, test avg loss 0.228746, throughput 12.9611K wps
[Epoch 28 Batch 30/162] avg loss 0.00278007, throughput 13.1398K wps
[Epoch 28 Batch 60/162] avg loss 0.00315518, throughput 12.8125K wps
[Epoch 28 Batch 90/162] avg loss 0.0029386, throughput 12.9149K wps
[Epoch 28 Batch 120/162] avg loss 0.0029692, throughput 12.9026K wps
[Epoch 28 Batch 150/162] avg loss 0.00286446, throughput 12.9788K wps
Begin Testing...
[Epoch 28] train avg loss 0.0029159, test acc 0.9044, test avg loss 0.228558, throughput 12.9487K wps
[Epoch 29 Batch 30/162] avg loss 0.00279143, throughput 13.1839K wps
[Epoch 29 Batch 60/162] avg loss 0.00287617, throughput 12.7553K wps
[Epoch 29 Batch 90/162] avg loss 0.00280559, throughput 12.9438K wps
[Epoch 29 Batch 120/162] avg loss 0.00295576, throughput 12.9859K wps
[Epoch 29 Batch 150/162] avg loss 0.0029338, throughput 12.9092K wps
Begin Testing...
[Epoch 29] train avg loss 0.00283974, test acc 0.9078, test avg loss 0.235092, throughput 12.9495K wps
[Epoch 30 Batch 30/162] avg loss 0.00266388, throughput 13.1379K wps
[Epoch 30 Batch 60/162] avg loss 0.00262251, throughput 12.8455K wps
[Epoch 30 Batch 90/162] avg loss 0.00264142, throughput 12.8586K wps
[Epoch 30 Batch 120/162] avg loss 0.00283148, throughput 12.8382K wps
[Epoch 30 Batch 150/162] avg loss 0.00261721, throughput 12.9624K wps
Begin Testing...
[Epoch 30] train avg loss 0.00268762, test acc 0.9000, test avg loss 0.223993, throughput 12.9247K wps
[Epoch 31 Batch 30/162] avg loss 0.00235224, throughput 13.0523K wps
[Epoch 31 Batch 60/162] avg loss 0.00270703, throughput 12.8032K wps
[Epoch 31 Batch 90/162] avg loss 0.00287196, throughput 12.8674K wps
[Epoch 31 Batch 120/162] avg loss 0.00268929, throughput 12.7608K wps
[Epoch 31 Batch 150/162] avg loss 0.00273948, throughput 12.785K wps
Begin Testing...
[Epoch 31] train avg loss 0.00268002, test acc 0.9067, test avg loss 0.233802, throughput 12.8576K wps
[Epoch 32 Batch 30/162] avg loss 0.00254461, throughput 13.2302K wps
[Epoch 32 Batch 60/162] avg loss 0.00271648, throughput 12.795K wps
[Epoch 32 Batch 90/162] avg loss 0.0024824, throughput 12.8482K wps
[Epoch 32 Batch 120/162] avg loss 0.00263571, throughput 12.9571K wps
[Epoch 32 Batch 150/162] avg loss 0.00245338, throughput 12.801K wps
Begin Testing...
[Epoch 32] train avg loss 0.00253814, test acc 0.9022, test avg loss 0.221204, throughput 12.9146K wps
[Epoch 33 Batch 30/162] avg loss 0.00246687, throughput 13.2219K wps
[Epoch 33 Batch 60/162] avg loss 0.00230145, throughput 12.7552K wps
[Epoch 33 Batch 90/162] avg loss 0.0025965, throughput 12.8861K wps
[Epoch 33 Batch 120/162] avg loss 0.00267617, throughput 12.9344K wps
[Epoch 33 Batch 150/162] avg loss 0.00236916, throughput 12.9148K wps
Begin Testing...
[Epoch 33] train avg loss 0.00251902, test acc 0.9067, test avg loss 0.219486, throughput 12.9306K wps
[Epoch 34 Batch 30/162] avg loss 0.00233148, throughput 13.0842K wps
[Epoch 34 Batch 60/162] avg loss 0.00255911, throughput 12.8186K wps
[Epoch 34 Batch 90/162] avg loss 0.00237948, throughput 12.8606K wps
[Epoch 34 Batch 120/162] avg loss 0.00250065, throughput 12.69K wps
[Epoch 34 Batch 150/162] avg loss 0.00247491, throughput 12.8249K wps
Begin Testing...
[Epoch 34] train avg loss 0.0024665, test acc 0.9044, test avg loss 0.219101, throughput 12.8526K wps
[Epoch 35 Batch 30/162] avg loss 0.00227058, throughput 13.1513K wps
[Epoch 35 Batch 60/162] avg loss 0.00216348, throughput 12.8164K wps
[Epoch 35 Batch 90/162] avg loss 0.00247131, throughput 12.77K wps
[Epoch 35 Batch 120/162] avg loss 0.00237915, throughput 12.8245K wps
[Epoch 35 Batch 150/162] avg loss 0.0021761, throughput 12.9725K wps
Begin Testing...
[Epoch 35] train avg loss 0.00230857, test acc 0.9044, test avg loss 0.218286, throughput 12.8928K wps
[Epoch 36 Batch 30/162] avg loss 0.0019923, throughput 13.2463K wps
[Epoch 36 Batch 60/162] avg loss 0.00253117, throughput 12.7833K wps
[Epoch 36 Batch 90/162] avg loss 0.00227234, throughput 12.9238K wps
[Epoch 36 Batch 120/162] avg loss 0.00240085, throughput 12.8542K wps
[Epoch 36 Batch 150/162] avg loss 0.00215444, throughput 12.8348K wps
Begin Testing...
[Epoch 36] train avg loss 0.00227465, test acc 0.9089, test avg loss 0.22084, throughput 12.9179K wps
Observed Improvement.
Begin Testing...
[Epoch 37 Batch 30/162] avg loss 0.00204825, throughput 13.2574K wps
[Epoch 37 Batch 60/162] avg loss 0.0023169, throughput 12.8449K wps
[Epoch 37 Batch 90/162] avg loss 0.00220128, throughput 12.9445K wps
[Epoch 37 Batch 120/162] avg loss 0.00214622, throughput 12.8682K wps
[Epoch 37 Batch 150/162] avg loss 0.00230456, throughput 12.783K wps
Begin Testing...
[Epoch 37] train avg loss 0.00218899, test acc 0.9078, test avg loss 0.214771, throughput 12.9259K wps
[Epoch 38 Batch 30/162] avg loss 0.00231691, throughput 13.2961K wps
[Epoch 38 Batch 60/162] avg loss 0.00202067, throughput 12.878K wps
[Epoch 38 Batch 90/162] avg loss 0.00188455, throughput 12.8948K wps
[Epoch 38 Batch 120/162] avg loss 0.00215297, throughput 12.766K wps
[Epoch 38 Batch 150/162] avg loss 0.00204674, throughput 12.7637K wps
Begin Testing...
[Epoch 38] train avg loss 0.00211921, test acc 0.9122, test avg loss 0.211947, throughput 12.9074K wps
Observed Improvement.
Begin Testing...
[Epoch 39 Batch 30/162] avg loss 0.00194425, throughput 13.2709K wps
[Epoch 39 Batch 60/162] avg loss 0.00209044, throughput 12.8173K wps
[Epoch 39 Batch 90/162] avg loss 0.00195489, throughput 12.8029K wps
[Epoch 39 Batch 120/162] avg loss 0.00207098, throughput 12.9447K wps
[Epoch 39 Batch 150/162] avg loss 0.00221376, throughput 12.8548K wps
Begin Testing...
[Epoch 39] train avg loss 0.00206543, test acc 0.9122, test avg loss 0.213823, throughput 12.9256K wps
Observed Improvement.
Begin Testing...
[Epoch 40 Batch 30/162] avg loss 0.00198039, throughput 13.2484K wps
[Epoch 40 Batch 60/162] avg loss 0.00180244, throughput 12.8293K wps
[Epoch 40 Batch 90/162] avg loss 0.00179999, throughput 12.8726K wps
[Epoch 40 Batch 120/162] avg loss 0.00197188, throughput 12.9624K wps
[Epoch 40 Batch 150/162] avg loss 0.00201233, throughput 12.9336K wps
Begin Testing...
[Epoch 40] train avg loss 0.00193711, test acc 0.9122, test avg loss 0.21195, throughput 12.9665K wps
Observed Improvement.
Begin Testing...
[Epoch 41 Batch 30/162] avg loss 0.00182594, throughput 13.3464K wps
[Epoch 41 Batch 60/162] avg loss 0.00180894, throughput 12.832K wps
[Epoch 41 Batch 90/162] avg loss 0.00211699, throughput 12.9012K wps
[Epoch 41 Batch 120/162] avg loss 0.00193402, throughput 12.9111K wps
[Epoch 41 Batch 150/162] avg loss 0.00197003, throughput 12.9702K wps
Begin Testing...
[Epoch 41] train avg loss 0.00193688, test acc 0.9156, test avg loss 0.211226, throughput 12.9861K wps
Observed Improvement.
Begin Testing...
[Epoch 42 Batch 30/162] avg loss 0.00183224, throughput 13.2435K wps
[Epoch 42 Batch 60/162] avg loss 0.00183633, throughput 12.8732K wps
[Epoch 42 Batch 90/162] avg loss 0.00181619, throughput 12.869K wps
[Epoch 42 Batch 120/162] avg loss 0.00210206, throughput 12.7522K wps
[Epoch 42 Batch 150/162] avg loss 0.00183456, throughput 12.7906K wps
Begin Testing...
[Epoch 42] train avg loss 0.00187616, test acc 0.9133, test avg loss 0.218983, throughput 12.8876K wps
[Epoch 43 Batch 30/162] avg loss 0.00191757, throughput 13.0947K wps
[Epoch 43 Batch 60/162] avg loss 0.00174067, throughput 12.7484K wps
[Epoch 43 Batch 90/162] avg loss 0.00167486, throughput 12.9314K wps
[Epoch 43 Batch 120/162] avg loss 0.00182289, throughput 12.9629K wps
[Epoch 43 Batch 150/162] avg loss 0.00188162, throughput 12.8721K wps
Begin Testing...
[Epoch 43] train avg loss 0.00183832, test acc 0.9111, test avg loss 0.208215, throughput 12.9151K wps
[Epoch 44 Batch 30/162] avg loss 0.00168034, throughput 13.258K wps
[Epoch 44 Batch 60/162] avg loss 0.00184183, throughput 12.8573K wps
[Epoch 44 Batch 90/162] avg loss 0.00180516, throughput 12.9957K wps
[Epoch 44 Batch 120/162] avg loss 0.00177736, throughput 12.961K wps
[Epoch 44 Batch 150/162] avg loss 0.00177426, throughput 12.9597K wps
Begin Testing...
[Epoch 44] train avg loss 0.00179153, test acc 0.9144, test avg loss 0.208571, throughput 13.0011K wps
[Epoch 45 Batch 30/162] avg loss 0.00152054, throughput 13.2215K wps
[Epoch 45 Batch 60/162] avg loss 0.00146355, throughput 12.7637K wps
[Epoch 45 Batch 90/162] avg loss 0.00196347, throughput 12.9098K wps
[Epoch 45 Batch 120/162] avg loss 0.00166551, throughput 12.8674K wps
[Epoch 45 Batch 150/162] avg loss 0.00175234, throughput 12.7747K wps
Begin Testing...
[Epoch 45] train avg loss 0.00168125, test acc 0.9156, test avg loss 0.206074, throughput 12.9062K wps
Observed Improvement.
Begin Testing...
[Epoch 46 Batch 30/162] avg loss 0.00158026, throughput 13.1315K wps
[Epoch 46 Batch 60/162] avg loss 0.0016609, throughput 12.821K wps
[Epoch 46 Batch 90/162] avg loss 0.00161309, throughput 12.8562K wps
[Epoch 46 Batch 120/162] avg loss 0.00161949, throughput 12.8759K wps
[Epoch 46 Batch 150/162] avg loss 0.00159633, throughput 12.8843K wps
Begin Testing...
[Epoch 46] train avg loss 0.00162107, test acc 0.9144, test avg loss 0.209989, throughput 12.9096K wps
[Epoch 47 Batch 30/162] avg loss 0.00142313, throughput 13.1063K wps
[Epoch 47 Batch 60/162] avg loss 0.0016474, throughput 12.8849K wps
[Epoch 47 Batch 90/162] avg loss 0.00155596, throughput 12.948K wps
[Epoch 47 Batch 120/162] avg loss 0.00151389, throughput 12.9412K wps
[Epoch 47 Batch 150/162] avg loss 0.0016481, throughput 12.9751K wps
Begin Testing...
[Epoch 47] train avg loss 0.0015434, test acc 0.9167, test avg loss 0.212312, throughput 12.9718K wps
Observed Improvement.
Begin Testing...
[Epoch 48 Batch 30/162] avg loss 0.00169523, throughput 13.358K wps
[Epoch 48 Batch 60/162] avg loss 0.00156147, throughput 12.789K wps
[Epoch 48 Batch 90/162] avg loss 0.00151557, throughput 12.9484K wps
[Epoch 48 Batch 120/162] avg loss 0.00145135, throughput 12.794K wps
[Epoch 48 Batch 150/162] avg loss 0.00146755, throughput 12.8138K wps
Begin Testing...
[Epoch 48] train avg loss 0.00151793, test acc 0.9167, test avg loss 0.207106, throughput 12.9276K wps
Observed Improvement.
Begin Testing...
[Epoch 49 Batch 30/162] avg loss 0.00138966, throughput 13.2145K wps
[Epoch 49 Batch 60/162] avg loss 0.0016313, throughput 12.8909K wps
[Epoch 49 Batch 90/162] avg loss 0.00155338, throughput 13.0047K wps
[Epoch 49 Batch 120/162] avg loss 0.00145316, throughput 12.8832K wps
[Epoch 49 Batch 150/162] avg loss 0.00187493, throughput 12.9681K wps
Begin Testing...
[Epoch 49] train avg loss 0.00156142, test acc 0.9178, test avg loss 0.206453, throughput 12.9854K wps
Observed Improvement.
Begin Testing...
[Epoch 50 Batch 30/162] avg loss 0.0013497, throughput 13.2928K wps
[Epoch 50 Batch 60/162] avg loss 0.00141723, throughput 12.7909K wps
[Epoch 50 Batch 90/162] avg loss 0.00138786, throughput 12.6839K wps
[Epoch 50 Batch 120/162] avg loss 0.00156434, throughput 12.8649K wps
[Epoch 50 Batch 150/162] avg loss 0.00144908, throughput 12.9053K wps
Begin Testing...
[Epoch 50] train avg loss 0.00142193, test acc 0.9189, test avg loss 0.205758, throughput 12.8951K wps
Observed Improvement.
Begin Testing...
[Epoch 51 Batch 30/162] avg loss 0.00131781, throughput 13.2859K wps
[Epoch 51 Batch 60/162] avg loss 0.00138914, throughput 12.8053K wps
[Epoch 51 Batch 90/162] avg loss 0.00140009, throughput 12.8254K wps
[Epoch 51 Batch 120/162] avg loss 0.00135733, throughput 12.8113K wps
[Epoch 51 Batch 150/162] avg loss 0.00134649, throughput 12.9565K wps
Begin Testing...
[Epoch 51] train avg loss 0.00135485, test acc 0.9156, test avg loss 0.207068, throughput 12.9359K wps
[Epoch 52 Batch 30/162] avg loss 0.00130622, throughput 13.3159K wps
[Epoch 52 Batch 60/162] avg loss 0.00131986, throughput 12.7157K wps
[Epoch 52 Batch 90/162] avg loss 0.00133555, throughput 12.965K wps
[Epoch 52 Batch 120/162] avg loss 0.00137769, throughput 12.8285K wps
[Epoch 52 Batch 150/162] avg loss 0.00126796, throughput 12.9567K wps
Begin Testing...
[Epoch 52] train avg loss 0.00132917, test acc 0.9167, test avg loss 0.210409, throughput 12.9561K wps
[Epoch 53 Batch 30/162] avg loss 0.0011897, throughput 13.1516K wps
[Epoch 53 Batch 60/162] avg loss 0.00120442, throughput 12.7655K wps
[Epoch 53 Batch 90/162] avg loss 0.00108677, throughput 12.8633K wps
[Epoch 53 Batch 120/162] avg loss 0.00144374, throughput 12.8488K wps
[Epoch 53 Batch 150/162] avg loss 0.00150412, throughput 12.781K wps
Begin Testing...
[Epoch 53] train avg loss 0.00130168, test acc 0.9144, test avg loss 0.205095, throughput 12.8804K wps
[Epoch 54 Batch 30/162] avg loss 0.00129654, throughput 13.1897K wps
[Epoch 54 Batch 60/162] avg loss 0.00123255, throughput 12.8459K wps
[Epoch 54 Batch 90/162] avg loss 0.00112739, throughput 12.8542K wps
[Epoch 54 Batch 120/162] avg loss 0.00112349, throughput 12.8702K wps
[Epoch 54 Batch 150/162] avg loss 0.00134743, throughput 12.8328K wps
Begin Testing...
[Epoch 54] train avg loss 0.00122714, test acc 0.9144, test avg loss 0.204756, throughput 12.9169K wps
[Epoch 55 Batch 30/162] avg loss 0.00106753, throughput 13.2157K wps
[Epoch 55 Batch 60/162] avg loss 0.00139447, throughput 12.8635K wps
[Epoch 55 Batch 90/162] avg loss 0.00114307, throughput 12.7997K wps
[Epoch 55 Batch 120/162] avg loss 0.00114807, throughput 12.9268K wps
[Epoch 55 Batch 150/162] avg loss 0.00131866, throughput 12.9672K wps
Begin Testing...
[Epoch 55] train avg loss 0.00121597, test acc 0.9178, test avg loss 0.207317, throughput 12.9409K wps
[Epoch 56 Batch 30/162] avg loss 0.00112452, throughput 13.3092K wps
[Epoch 56 Batch 60/162] avg loss 0.001238, throughput 12.8071K wps
[Epoch 56 Batch 90/162] avg loss 0.00106203, throughput 12.9522K wps
[Epoch 56 Batch 120/162] avg loss 0.00114146, throughput 12.8017K wps
[Epoch 56 Batch 150/162] avg loss 0.0012233, throughput 12.9513K wps
Begin Testing...
[Epoch 56] train avg loss 0.00115852, test acc 0.9144, test avg loss 0.20206, throughput 12.9636K wps
[Epoch 57 Batch 30/162] avg loss 0.00121636, throughput 13.26K wps
[Epoch 57 Batch 60/162] avg loss 0.001108, throughput 12.7499K wps
[Epoch 57 Batch 90/162] avg loss 0.00109438, throughput 12.8116K wps
[Epoch 57 Batch 120/162] avg loss 0.00114104, throughput 12.9306K wps
[Epoch 57 Batch 150/162] avg loss 0.00104248, throughput 12.9809K wps
Begin Testing...
[Epoch 57] train avg loss 0.00112722, test acc 0.9167, test avg loss 0.20521, throughput 12.9445K wps
[Epoch 58 Batch 30/162] avg loss 0.000976588, throughput 13.233K wps
[Epoch 58 Batch 60/162] avg loss 0.00133816, throughput 12.7705K wps
[Epoch 58 Batch 90/162] avg loss 0.00102538, throughput 12.9645K wps
[Epoch 58 Batch 120/162] avg loss 0.00117134, throughput 12.9088K wps
[Epoch 58 Batch 150/162] avg loss 0.00108341, throughput 12.8602K wps
Begin Testing...
[Epoch 58] train avg loss 0.00111412, test acc 0.9211, test avg loss 0.203672, throughput 12.9408K wps
Observed Improvement.
Begin Testing...
[Epoch 59 Batch 30/162] avg loss 0.00110478, throughput 13.2383K wps
[Epoch 59 Batch 60/162] avg loss 0.0010804, throughput 12.9028K wps
[Epoch 59 Batch 90/162] avg loss 0.00126469, throughput 12.9977K wps
[Epoch 59 Batch 120/162] avg loss 0.00102282, throughput 12.9844K wps
[Epoch 59 Batch 150/162] avg loss 0.00117668, throughput 12.9885K wps
Begin Testing...
[Epoch 59] train avg loss 0.00112329, test acc 0.9189, test avg loss 0.203096, throughput 13.0218K wps
Test loss 0.194319, test acc 0.9290
Total time cost 166.65s
9000 1000
[Epoch 0 Batch 30/162] avg loss 0.0148042, throughput 11.4716K wps
[Epoch 0 Batch 60/162] avg loss 0.0140582, throughput 12.7237K wps
[Epoch 0 Batch 90/162] avg loss 0.0136456, throughput 12.9024K wps
[Epoch 0 Batch 120/162] avg loss 0.0131129, throughput 12.8088K wps
[Epoch 0 Batch 150/162] avg loss 0.0130018, throughput 12.7587K wps
Begin Testing...
[Epoch 0] train avg loss 0.0136463, test acc 0.7078, test avg loss 0.586764, throughput 12.5366K wps
Observed Improvement.
Begin Testing...
[Epoch 1 Batch 30/162] avg loss 0.0122708, throughput 13.2397K wps
[Epoch 1 Batch 60/162] avg loss 0.0120097, throughput 12.7129K wps
[Epoch 1 Batch 90/162] avg loss 0.0120808, throughput 12.9159K wps
[Epoch 1 Batch 120/162] avg loss 0.0115398, throughput 12.7763K wps
[Epoch 1 Batch 150/162] avg loss 0.0114755, throughput 12.8797K wps
Begin Testing...
[Epoch 1] train avg loss 0.0118081, test acc 0.7822, test avg loss 0.531455, throughput 12.9038K wps
Observed Improvement.
Begin Testing...
[Epoch 2 Batch 30/162] avg loss 0.0108779, throughput 13.1898K wps
[Epoch 2 Batch 60/162] avg loss 0.0108997, throughput 12.7231K wps
[Epoch 2 Batch 90/162] avg loss 0.010415, throughput 12.8649K wps
[Epoch 2 Batch 120/162] avg loss 0.0105789, throughput 12.7951K wps
[Epoch 2 Batch 150/162] avg loss 0.0104828, throughput 12.7857K wps
Begin Testing...
[Epoch 2] train avg loss 0.0106249, test acc 0.8367, test avg loss 0.484494, throughput 12.8703K wps
Observed Improvement.
Begin Testing...
[Epoch 3 Batch 30/162] avg loss 0.0099823, throughput 13.3038K wps
[Epoch 3 Batch 60/162] avg loss 0.0100036, throughput 12.8341K wps
[Epoch 3 Batch 90/162] avg loss 0.00984409, throughput 12.8487K wps
[Epoch 3 Batch 120/162] avg loss 0.00947802, throughput 12.8861K wps
[Epoch 3 Batch 150/162] avg loss 0.00933718, throughput 12.8973K wps
Begin Testing...
[Epoch 3] train avg loss 0.00970772, test acc 0.8556, test avg loss 0.441537, throughput 12.9402K wps
Observed Improvement.
Begin Testing...
[Epoch 4 Batch 30/162] avg loss 0.00894241, throughput 13.2455K wps
[Epoch 4 Batch 60/162] avg loss 0.00912894, throughput 12.7722K wps
[Epoch 4 Batch 90/162] avg loss 0.00882241, throughput 12.8504K wps
[Epoch 4 Batch 120/162] avg loss 0.00849563, throughput 12.9282K wps
[Epoch 4 Batch 150/162] avg loss 0.00825965, throughput 12.9165K wps
Begin Testing...
[Epoch 4] train avg loss 0.00866268, test acc 0.8678, test avg loss 0.400804, throughput 12.9337K wps
Observed Improvement.
Begin Testing...
[Epoch 5 Batch 30/162] avg loss 0.00812146, throughput 13.2646K wps
[Epoch 5 Batch 60/162] avg loss 0.00814577, throughput 12.7526K wps
[Epoch 5 Batch 90/162] avg loss 0.00796083, throughput 12.7706K wps
[Epoch 5 Batch 120/162] avg loss 0.00785997, throughput 12.7572K wps
[Epoch 5 Batch 150/162] avg loss 0.00745858, throughput 12.937K wps
Begin Testing...
[Epoch 5] train avg loss 0.00786471, test acc 0.8756, test avg loss 0.361011, throughput 12.8954K wps
Observed Improvement.
Begin Testing...
[Epoch 6 Batch 30/162] avg loss 0.00748882, throughput 13.2106K wps
[Epoch 6 Batch 60/162] avg loss 0.00719576, throughput 12.7636K wps
[Epoch 6 Batch 90/162] avg loss 0.00744702, throughput 12.8789K wps
[Epoch 6 Batch 120/162] avg loss 0.00716182, throughput 12.8589K wps
[Epoch 6 Batch 150/162] avg loss 0.00671086, throughput 12.8628K wps
Begin Testing...
[Epoch 6] train avg loss 0.00716767, test acc 0.8922, test avg loss 0.333397, throughput 12.9069K wps
Observed Improvement.
Begin Testing...
[Epoch 7 Batch 30/162] avg loss 0.00670928, throughput 13.2812K wps
[Epoch 7 Batch 60/162] avg loss 0.00652602, throughput 12.8505K wps
[Epoch 7 Batch 90/162] avg loss 0.00647557, throughput 12.9736K wps
[Epoch 7 Batch 120/162] avg loss 0.00640199, throughput 12.8008K wps
[Epoch 7 Batch 150/162] avg loss 0.00683992, throughput 12.8361K wps
Begin Testing...
[Epoch 7] train avg loss 0.00658999, test acc 0.8956, test avg loss 0.306761, throughput 12.9348K wps
Observed Improvement.
Begin Testing...
[Epoch 8 Batch 30/162] avg loss 0.00603446, throughput 13.2315K wps
[Epoch 8 Batch 60/162] avg loss 0.00643101, throughput 12.7697K wps
[Epoch 8 Batch 90/162] avg loss 0.00624035, throughput 12.8959K wps
[Epoch 8 Batch 120/162] avg loss 0.00589253, throughput 12.9145K wps
[Epoch 8 Batch 150/162] avg loss 0.00605636, throughput 12.8466K wps
Begin Testing...
[Epoch 8] train avg loss 0.00610453, test acc 0.8967, test avg loss 0.288943, throughput 12.934K wps
Observed Improvement.
Begin Testing...
[Epoch 9 Batch 30/162] avg loss 0.00604538, throughput 13.1595K wps
[Epoch 9 Batch 60/162] avg loss 0.00580895, throughput 12.7511K wps
[Epoch 9 Batch 90/162] avg loss 0.005843, throughput 12.8001K wps
[Epoch 9 Batch 120/162] avg loss 0.00556758, throughput 12.9611K wps
[Epoch 9 Batch 150/162] avg loss 0.00557267, throughput 12.7332K wps
Begin Testing...
[Epoch 9] train avg loss 0.00582727, test acc 0.8978, test avg loss 0.276331, throughput 12.8606K wps
Observed Improvement.
Begin Testing...
[Epoch 10 Batch 30/162] avg loss 0.00570828, throughput 13.2627K wps
[Epoch 10 Batch 60/162] avg loss 0.00573331, throughput 12.681K wps
[Epoch 10 Batch 90/162] avg loss 0.00524246, throughput 12.8374K wps
[Epoch 10 Batch 120/162] avg loss 0.0054553, throughput 12.9506K wps
[Epoch 10 Batch 150/162] avg loss 0.00560219, throughput 12.8534K wps
Begin Testing...
[Epoch 10] train avg loss 0.00553793, test acc 0.8922, test avg loss 0.266281, throughput 12.9166K wps
[Epoch 11 Batch 30/162] avg loss 0.00523019, throughput 13.261K wps
[Epoch 11 Batch 60/162] avg loss 0.00554835, throughput 12.8179K wps
[Epoch 11 Batch 90/162] avg loss 0.00507643, throughput 12.9665K wps
[Epoch 11 Batch 120/162] avg loss 0.00517046, throughput 12.7777K wps
[Epoch 11 Batch 150/162] avg loss 0.00491481, throughput 12.9069K wps
Begin Testing...
[Epoch 11] train avg loss 0.00520593, test acc 0.8978, test avg loss 0.258158, throughput 12.946K wps
Observed Improvement.
Begin Testing...
[Epoch 12 Batch 30/162] avg loss 0.00523429, throughput 13.2452K wps
[Epoch 12 Batch 60/162] avg loss 0.00495219, throughput 12.7801K wps
[Epoch 12 Batch 90/162] avg loss 0.00514566, throughput 12.9499K wps
[Epoch 12 Batch 120/162] avg loss 0.00520154, throughput 12.9276K wps
[Epoch 12 Batch 150/162] avg loss 0.00489361, throughput 12.8399K wps
Begin Testing...
[Epoch 12] train avg loss 0.00504434, test acc 0.9056, test avg loss 0.246266, throughput 12.9343K wps
Observed Improvement.
Begin Testing...
[Epoch 13 Batch 30/162] avg loss 0.00461706, throughput 13.2532K wps
[Epoch 13 Batch 60/162] avg loss 0.00488229, throughput 12.7508K wps
[Epoch 13 Batch 90/162] avg loss 0.00468586, throughput 12.8671K wps
[Epoch 13 Batch 120/162] avg loss 0.00463971, throughput 12.8917K wps
[Epoch 13 Batch 150/162] avg loss 0.00514972, throughput 12.888K wps
Begin Testing...
[Epoch 13] train avg loss 0.00478351, test acc 0.9100, test avg loss 0.241086, throughput 12.9245K wps
Observed Improvement.
Begin Testing...
[Epoch 14 Batch 30/162] avg loss 0.0045793, throughput 13.2793K wps
[Epoch 14 Batch 60/162] avg loss 0.00447452, throughput 12.8042K wps
[Epoch 14 Batch 90/162] avg loss 0.00460662, throughput 12.8634K wps
[Epoch 14 Batch 120/162] avg loss 0.00512081, throughput 12.9412K wps
[Epoch 14 Batch 150/162] avg loss 0.00449664, throughput 12.9717K wps
Begin Testing...
[Epoch 14] train avg loss 0.0046167, test acc 0.9122, test avg loss 0.233767, throughput 12.971K wps
Observed Improvement.
Begin Testing...
[Epoch 15 Batch 30/162] avg loss 0.0046351, throughput 13.2828K wps
[Epoch 15 Batch 60/162] avg loss 0.00456478, throughput 12.8302K wps
[Epoch 15 Batch 90/162] avg loss 0.00438672, throughput 12.94K wps
[Epoch 15 Batch 120/162] avg loss 0.00436778, throughput 12.977K wps
[Epoch 15 Batch 150/162] avg loss 0.00438504, throughput 12.9292K wps
Begin Testing...
[Epoch 15] train avg loss 0.00445692, test acc 0.9122, test avg loss 0.229046, throughput 12.9774K wps
Observed Improvement.
Begin Testing...
[Epoch 16 Batch 30/162] avg loss 0.00421548, throughput 13.2294K wps
[Epoch 16 Batch 60/162] avg loss 0.00423923, throughput 12.7909K wps
[Epoch 16 Batch 90/162] avg loss 0.00443476, throughput 12.8058K wps
[Epoch 16 Batch 120/162] avg loss 0.00443325, throughput 12.8172K wps
[Epoch 16 Batch 150/162] avg loss 0.00405901, throughput 12.9217K wps
Begin Testing...
[Epoch 16] train avg loss 0.00429295, test acc 0.9167, test avg loss 0.223243, throughput 12.9002K wps
Observed Improvement.
Begin Testing...
[Epoch 17 Batch 30/162] avg loss 0.00401319, throughput 13.1709K wps
[Epoch 17 Batch 60/162] avg loss 0.00386209, throughput 12.7292K wps
[Epoch 17 Batch 90/162] avg loss 0.00449452, throughput 12.8533K wps
[Epoch 17 Batch 120/162] avg loss 0.00403405, throughput 12.9564K wps
[Epoch 17 Batch 150/162] avg loss 0.00409871, throughput 12.9559K wps
Begin Testing...
[Epoch 17] train avg loss 0.00411837, test acc 0.9167, test avg loss 0.219339, throughput 12.9188K wps
Observed Improvement.
Begin Testing...
[Epoch 18 Batch 30/162] avg loss 0.00391454, throughput 13.306K wps
[Epoch 18 Batch 60/162] avg loss 0.00422846, throughput 12.8434K wps
[Epoch 18 Batch 90/162] avg loss 0.00377621, throughput 12.8559K wps
[Epoch 18 Batch 120/162] avg loss 0.00401495, throughput 12.8582K wps
[Epoch 18 Batch 150/162] avg loss 0.00394351, throughput 12.9177K wps
Begin Testing...
[Epoch 18] train avg loss 0.00398811, test acc 0.9167, test avg loss 0.213151, throughput 12.9502K wps
Observed Improvement.
Begin Testing...
[Epoch 19 Batch 30/162] avg loss 0.0037234, throughput 13.3371K wps
[Epoch 19 Batch 60/162] avg loss 0.00383289, throughput 12.7987K wps
[Epoch 19 Batch 90/162] avg loss 0.00374546, throughput 12.8088K wps
[Epoch 19 Batch 120/162] avg loss 0.00369113, throughput 12.905K wps
[Epoch 19 Batch 150/162] avg loss 0.00388707, throughput 12.9594K wps
Begin Testing...
[Epoch 19] train avg loss 0.00380653, test acc 0.9167, test avg loss 0.21128, throughput 12.9592K wps
Observed Improvement.
Begin Testing...
[Epoch 20 Batch 30/162] avg loss 0.0037442, throughput 13.161K wps
[Epoch 20 Batch 60/162] avg loss 0.0036849, throughput 12.8778K wps
[Epoch 20 Batch 90/162] avg loss 0.00391063, throughput 12.8935K wps
[Epoch 20 Batch 120/162] avg loss 0.00375412, throughput 12.8663K wps
[Epoch 20 Batch 150/162] avg loss 0.00344612, throughput 12.8698K wps
Begin Testing...
[Epoch 20] train avg loss 0.00373698, test acc 0.9133, test avg loss 0.207851, throughput 12.9292K wps
[Epoch 21 Batch 30/162] avg loss 0.00382224, throughput 13.2618K wps
[Epoch 21 Batch 60/162] avg loss 0.00356697, throughput 12.8229K wps
[Epoch 21 Batch 90/162] avg loss 0.00347064, throughput 12.9335K wps
[Epoch 21 Batch 120/162] avg loss 0.00335645, throughput 12.916K wps
[Epoch 21 Batch 150/162] avg loss 0.00367571, throughput 12.7994K wps
Begin Testing...
[Epoch 21] train avg loss 0.00358636, test acc 0.9167, test avg loss 0.211793, throughput 12.9353K wps
Observed Improvement.
Begin Testing...
[Epoch 22 Batch 30/162] avg loss 0.0036098, throughput 13.3367K wps
[Epoch 22 Batch 60/162] avg loss 0.00319689, throughput 12.8261K wps
[Epoch 22 Batch 90/162] avg loss 0.0035132, throughput 12.949K wps
[Epoch 22 Batch 120/162] avg loss 0.003641, throughput 12.934K wps
[Epoch 22 Batch 150/162] avg loss 0.00346108, throughput 12.8093K wps
Begin Testing...
[Epoch 22] train avg loss 0.00351359, test acc 0.9178, test avg loss 0.203456, throughput 12.9666K wps
Observed Improvement.
Begin Testing...
[Epoch 23 Batch 30/162] avg loss 0.00328117, throughput 13.2732K wps
[Epoch 23 Batch 60/162] avg loss 0.00370724, throughput 12.7804K wps
[Epoch 23 Batch 90/162] avg loss 0.00349746, throughput 12.9395K wps
[Epoch 23 Batch 120/162] avg loss 0.00340839, throughput 12.9491K wps
[Epoch 23 Batch 150/162] avg loss 0.0030082, throughput 12.8599K wps
Begin Testing...
[Epoch 23] train avg loss 0.00341061, test acc 0.9189, test avg loss 0.200202, throughput 12.946K wps
Observed Improvement.
Begin Testing...
[Epoch 24 Batch 30/162] avg loss 0.00323023, throughput 13.2267K wps
[Epoch 24 Batch 60/162] avg loss 0.00337513, throughput 12.7383K wps
[Epoch 24 Batch 90/162] avg loss 0.00323589, throughput 12.9074K wps
[Epoch 24 Batch 120/162] avg loss 0.00331298, throughput 12.8667K wps
[Epoch 24 Batch 150/162] avg loss 0.00342413, throughput 12.8262K wps
Begin Testing...
[Epoch 24] train avg loss 0.00332278, test acc 0.9244, test avg loss 0.19717, throughput 12.9065K wps
Observed Improvement.
Begin Testing...
[Epoch 25 Batch 30/162] avg loss 0.00313932, throughput 13.1485K wps
[Epoch 25 Batch 60/162] avg loss 0.00311653, throughput 12.8485K wps
[Epoch 25 Batch 90/162] avg loss 0.00318401, throughput 12.956K wps
[Epoch 25 Batch 120/162] avg loss 0.00317402, throughput 12.9529K wps
[Epoch 25 Batch 150/162] avg loss 0.00299644, throughput 12.9396K wps
Begin Testing...
[Epoch 25] train avg loss 0.00313914, test acc 0.9222, test avg loss 0.195095, throughput 12.9631K wps
[Epoch 26 Batch 30/162] avg loss 0.00312051, throughput 13.1949K wps
[Epoch 26 Batch 60/162] avg loss 0.00294621, throughput 12.8516K wps
[Epoch 26 Batch 90/162] avg loss 0.00308229, throughput 12.962K wps
[Epoch 26 Batch 120/162] avg loss 0.0028425, throughput 12.9726K wps
[Epoch 26 Batch 150/162] avg loss 0.00319458, throughput 12.916K wps
Begin Testing...
[Epoch 26] train avg loss 0.00304266, test acc 0.9244, test avg loss 0.192774, throughput 12.9618K wps
Observed Improvement.
Begin Testing...
[Epoch 27 Batch 30/162] avg loss 0.00275458, throughput 13.2179K wps
[Epoch 27 Batch 60/162] avg loss 0.00279818, throughput 12.798K wps
[Epoch 27 Batch 90/162] avg loss 0.00309411, throughput 12.9384K wps
[Epoch 27 Batch 120/162] avg loss 0.00311899, throughput 12.9816K wps
[Epoch 27 Batch 150/162] avg loss 0.00292367, throughput 12.9617K wps
Begin Testing...
[Epoch 27] train avg loss 0.00296591, test acc 0.9267, test avg loss 0.189606, throughput 12.9777K wps
Observed Improvement.
Begin Testing...
[Epoch 28 Batch 30/162] avg loss 0.00268713, throughput 13.1413K wps
[Epoch 28 Batch 60/162] avg loss 0.00300695, throughput 12.6785K wps
[Epoch 28 Batch 90/162] avg loss 0.00283094, throughput 12.8451K wps
[Epoch 28 Batch 120/162] avg loss 0.00293541, throughput 12.6008K wps
[Epoch 28 Batch 150/162] avg loss 0.00298548, throughput 12.8271K wps
Begin Testing...
[Epoch 28] train avg loss 0.00289077, test acc 0.9256, test avg loss 0.1876, throughput 12.81K wps
[Epoch 29 Batch 30/162] avg loss 0.00270066, throughput 13.1304K wps
[Epoch 29 Batch 60/162] avg loss 0.00271835, throughput 12.7931K wps
[Epoch 29 Batch 90/162] avg loss 0.00276987, throughput 12.9574K wps
[Epoch 29 Batch 120/162] avg loss 0.00271939, throughput 12.9849K wps
[Epoch 29 Batch 150/162] avg loss 0.00271599, throughput 12.9636K wps
Begin Testing...
[Epoch 29] train avg loss 0.00275545, test acc 0.9211, test avg loss 0.190686, throughput 12.961K wps
[Epoch 30 Batch 30/162] avg loss 0.002673, throughput 13.222K wps
[Epoch 30 Batch 60/162] avg loss 0.00267495, throughput 12.7868K wps
[Epoch 30 Batch 90/162] avg loss 0.00268553, throughput 12.9247K wps
[Epoch 30 Batch 120/162] avg loss 0.0024704, throughput 12.9322K wps
[Epoch 30 Batch 150/162] avg loss 0.00281242, throughput 12.7798K wps
Begin Testing...
[Epoch 30] train avg loss 0.00265752, test acc 0.9278, test avg loss 0.189539, throughput 12.927K wps
Observed Improvement.
Begin Testing...
[Epoch 31 Batch 30/162] avg loss 0.00231297, throughput 13.1041K wps
[Epoch 31 Batch 60/162] avg loss 0.0028289, throughput 12.8304K wps
[Epoch 31 Batch 90/162] avg loss 0.00284901, throughput 12.9167K wps
[Epoch 31 Batch 120/162] avg loss 0.00266938, throughput 12.9491K wps
[Epoch 31 Batch 150/162] avg loss 0.00265996, throughput 12.9989K wps
Begin Testing...
[Epoch 31] train avg loss 0.00263652, test acc 0.9233, test avg loss 0.182266, throughput 12.9594K wps
[Epoch 32 Batch 30/162] avg loss 0.00234205, throughput 13.1291K wps
[Epoch 32 Batch 60/162] avg loss 0.00249018, throughput 12.7471K wps
[Epoch 32 Batch 90/162] avg loss 0.00262145, throughput 12.8013K wps
[Epoch 32 Batch 120/162] avg loss 0.00262936, throughput 12.886K wps
[Epoch 32 Batch 150/162] avg loss 0.00245373, throughput 12.897K wps
Begi