-
Notifications
You must be signed in to change notification settings - Fork 151
Expand file tree
/
Copy pathsentiment_raw_20180817.log
More file actions
412 lines (411 loc) · 18.5 KB
/
sentiment_raw_20180817.log
File metadata and controls
412 lines (411 loc) · 18.5 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
Namespace(batch_size=16, bucket_mult=100, bucket_num=10, bucket_ratio=0.0, bucket_type='fixed', clip=None, dropout=0.0, epochs=3, gpu=0, lm_model='standard_lstm_lm_200', log_interval=30, lr=0.005, no_pretrained=True, save_prefix='imdb_lstm_200', use_mean_pool=True, valid_ratio=0.1)
Use gpu0
Tokenize using spaCy...
Done! Tokenizing Time=8.51s, #Sentences=22500
Done! Tokenizing Time=1.36s, #Sentences=2500
Done! Tokenizing Time=8.90s, #Sentences=25000
Use FixedBucketSampler
FixedBucketSampler:
sample_num=22500, batch_num=1412
key=[59, 108, 157, 206, 255, 304, 353, 402, 451, 500]
cnt=[541, 1794, 4583, 4609, 2737, 1848, 1322, 1052, 780, 3234]
batch_size=[16, 16, 16, 16, 16, 16, 16, 16, 16, 16]
SentimentNet(
(embedding): HybridSequential(
(0): Embedding(33278 -> 200, float32)
)
(encoder): LSTM(200 -> 200, TNC, num_layers=2)
(agg_layer): AggregationLayer(
)
(output): HybridSequential(
(0): Dropout(p = 0.0, axes=())
(1): Dense(None -> 1, linear)
)
)
[22:26:59] src/storage/storage.cc:129: Using GPUPooledRoundedStorageManager.
[Epoch 0 Batch 30/1412] avg loss 0.0428669, throughput 106.378K wps
[Epoch 0 Batch 60/1412] avg loss 0.0570866, throughput 168.379K wps
[Epoch 0 Batch 90/1412] avg loss 0.0439298, throughput 156.44K wps
[Epoch 0 Batch 120/1412] avg loss 0.0443242, throughput 159.624K wps
[Epoch 0 Batch 150/1412] avg loss 0.0449614, throughput 152.496K wps
[Epoch 0 Batch 180/1412] avg loss 0.043559, throughput 152.676K wps
[Epoch 0 Batch 210/1412] avg loss 0.0432507, throughput 130.492K wps
[Epoch 0 Batch 240/1412] avg loss 0.0433857, throughput 152.49K wps
[Epoch 0 Batch 270/1412] avg loss 0.0433795, throughput 138.597K wps
[Epoch 0 Batch 300/1412] avg loss 0.0445076, throughput 142.928K wps
[Epoch 0 Batch 330/1412] avg loss 0.0427613, throughput 145.381K wps
[Epoch 0 Batch 360/1412] avg loss 0.0430574, throughput 127.357K wps
[Epoch 0 Batch 390/1412] avg loss 0.0413071, throughput 136.346K wps
[Epoch 0 Batch 420/1412] avg loss 0.0389483, throughput 151.582K wps
[Epoch 0 Batch 450/1412] avg loss 0.0370766, throughput 153.719K wps
[Epoch 0 Batch 480/1412] avg loss 0.0359703, throughput 101.234K wps
[Epoch 0 Batch 510/1412] avg loss 0.0341675, throughput 150.443K wps
[Epoch 0 Batch 540/1412] avg loss 0.0308559, throughput 158.564K wps
[Epoch 0 Batch 570/1412] avg loss 0.0322593, throughput 150.255K wps
[Epoch 0 Batch 600/1412] avg loss 0.0449493, throughput 136.193K wps
[Epoch 0 Batch 630/1412] avg loss 0.0400247, throughput 138.369K wps
[Epoch 0 Batch 660/1412] avg loss 0.0393196, throughput 162.616K wps
[Epoch 0 Batch 690/1412] avg loss 0.0382806, throughput 161.626K wps
[Epoch 0 Batch 720/1412] avg loss 0.0381384, throughput 150.525K wps
[Epoch 0 Batch 750/1412] avg loss 0.031074, throughput 144.569K wps
[Epoch 0 Batch 780/1412] avg loss 0.0291415, throughput 166.854K wps
[Epoch 0 Batch 810/1412] avg loss 0.0294662, throughput 156.329K wps
[Epoch 0 Batch 840/1412] avg loss 0.0297857, throughput 140.029K wps
[Epoch 0 Batch 870/1412] avg loss 0.0263667, throughput 147.856K wps
[Epoch 0 Batch 900/1412] avg loss 0.031217, throughput 173.415K wps
[Epoch 0 Batch 930/1412] avg loss 0.0288161, throughput 150.079K wps
[Epoch 0 Batch 960/1412] avg loss 0.0286358, throughput 159.721K wps
[Epoch 0 Batch 990/1412] avg loss 0.0283354, throughput 166.265K wps
[Epoch 0 Batch 1020/1412] avg loss 0.0261076, throughput 159.984K wps
[Epoch 0 Batch 1050/1412] avg loss 0.0241801, throughput 126.988K wps
[Epoch 0 Batch 1080/1412] avg loss 0.0307288, throughput 129.859K wps
[Epoch 0 Batch 1110/1412] avg loss 0.0267675, throughput 150.202K wps
[Epoch 0 Batch 1140/1412] avg loss 0.0276033, throughput 144.298K wps
[Epoch 0 Batch 1170/1412] avg loss 0.0224557, throughput 149.781K wps
[Epoch 0 Batch 1200/1412] avg loss 0.0258165, throughput 143.115K wps
[Epoch 0 Batch 1230/1412] avg loss 0.0224211, throughput 155.295K wps
[Epoch 0 Batch 1260/1412] avg loss 0.0237606, throughput 145.458K wps
[Epoch 0 Batch 1290/1412] avg loss 0.0237062, throughput 144.302K wps
[Epoch 0 Batch 1320/1412] avg loss 0.0273534, throughput 146.953K wps
[Epoch 0 Batch 1350/1412] avg loss 0.0191257, throughput 152.756K wps
[Epoch 0 Batch 1380/1412] avg loss 0.0249543, throughput 152.931K wps
[Epoch 0 Batch 1410/1412] avg loss 0.0228795, throughput 169.814K wps
Begin Testing...
[Batch 30/157] elapsed 0.57 s
[Batch 60/157] elapsed 0.46 s
[Batch 90/157] elapsed 0.37 s
[Batch 120/157] elapsed 0.27 s
[Batch 150/157] elapsed 0.25 s
Begin Testing...
[Batch 30/1563] elapsed 0.71 s
[Batch 60/1563] elapsed 0.65 s
[Batch 90/1563] elapsed 0.71 s
[Batch 120/1563] elapsed 0.71 s
[Batch 150/1563] elapsed 0.64 s
[Batch 180/1563] elapsed 0.62 s
[Batch 210/1563] elapsed 0.66 s
[Batch 240/1563] elapsed 0.61 s
[Batch 270/1563] elapsed 0.62 s
[Batch 300/1563] elapsed 0.59 s
[Batch 330/1563] elapsed 0.49 s
[Batch 360/1563] elapsed 0.54 s
[Batch 390/1563] elapsed 0.52 s
[Batch 420/1563] elapsed 0.35 s
[Batch 450/1563] elapsed 0.42 s
[Batch 480/1563] elapsed 0.45 s
[Batch 510/1563] elapsed 0.45 s
[Batch 540/1563] elapsed 0.42 s
[Batch 570/1563] elapsed 0.43 s
[Batch 600/1563] elapsed 0.41 s
[Batch 630/1563] elapsed 0.41 s
[Batch 660/1563] elapsed 0.40 s
[Batch 690/1563] elapsed 0.40 s
[Batch 720/1563] elapsed 0.38 s
[Batch 750/1563] elapsed 0.37 s
[Batch 780/1563] elapsed 0.36 s
[Batch 810/1563] elapsed 0.35 s
[Batch 840/1563] elapsed 0.36 s
[Batch 870/1563] elapsed 0.35 s
[Batch 900/1563] elapsed 0.32 s
[Batch 930/1563] elapsed 0.35 s
[Batch 960/1563] elapsed 0.28 s
[Batch 990/1563] elapsed 0.28 s
[Batch 1020/1563] elapsed 0.29 s
[Batch 1050/1563] elapsed 0.33 s
[Batch 1080/1563] elapsed 0.31 s
[Batch 1110/1563] elapsed 0.29 s
[Batch 1140/1563] elapsed 0.27 s
[Batch 1170/1563] elapsed 0.26 s
[Batch 1200/1563] elapsed 0.27 s
[Batch 1230/1563] elapsed 0.27 s
[Batch 1260/1563] elapsed 0.27 s
[Batch 1290/1563] elapsed 0.25 s
[Batch 1320/1563] elapsed 0.27 s
[Batch 1350/1563] elapsed 0.26 s
[Batch 1380/1563] elapsed 0.20 s
[Batch 1410/1563] elapsed 0.21 s
[Batch 1440/1563] elapsed 0.18 s
[Batch 1470/1563] elapsed 0.15 s
[Batch 1500/1563] elapsed 0.14 s
[Batch 1530/1563] elapsed 0.19 s
[Batch 1560/1563] elapsed 0.19 s
[Epoch 0] train avg loss 0.0339896, valid acc 0.8576, valid avg loss 0.365598, test acc 0.8383, test avg loss 0.386616, throughput 146.557K wps
Observed Improvement.
[Epoch 1 Batch 30/1412] avg loss 0.0164784, throughput 156.317K wps
[Epoch 1 Batch 60/1412] avg loss 0.0195543, throughput 194.057K wps
[Epoch 1 Batch 90/1412] avg loss 0.016039, throughput 161.211K wps
[Epoch 1 Batch 120/1412] avg loss 0.0180766, throughput 143.514K wps
[Epoch 1 Batch 150/1412] avg loss 0.0157969, throughput 132.522K wps
[Epoch 1 Batch 180/1412] avg loss 0.0183245, throughput 183.884K wps
[Epoch 1 Batch 210/1412] avg loss 0.0162369, throughput 154.258K wps
[Epoch 1 Batch 240/1412] avg loss 0.0172209, throughput 138.354K wps
[Epoch 1 Batch 270/1412] avg loss 0.0184088, throughput 143.766K wps
[Epoch 1 Batch 300/1412] avg loss 0.0153024, throughput 148.588K wps
[Epoch 1 Batch 330/1412] avg loss 0.0138965, throughput 154.2K wps
[Epoch 1 Batch 360/1412] avg loss 0.0164958, throughput 141.821K wps
[Epoch 1 Batch 390/1412] avg loss 0.0180795, throughput 137.314K wps
[Epoch 1 Batch 420/1412] avg loss 0.0179689, throughput 133.119K wps
[Epoch 1 Batch 450/1412] avg loss 0.0191183, throughput 127.396K wps
[Epoch 1 Batch 480/1412] avg loss 0.0160748, throughput 148.973K wps
[Epoch 1 Batch 510/1412] avg loss 0.0144047, throughput 167.895K wps
[Epoch 1 Batch 540/1412] avg loss 0.0147949, throughput 158.629K wps
[Epoch 1 Batch 570/1412] avg loss 0.0167364, throughput 165.848K wps
[Epoch 1 Batch 600/1412] avg loss 0.0153618, throughput 153.916K wps
[Epoch 1 Batch 630/1412] avg loss 0.0153536, throughput 145.819K wps
[Epoch 1 Batch 660/1412] avg loss 0.0158145, throughput 133.36K wps
[Epoch 1 Batch 690/1412] avg loss 0.0168222, throughput 125.331K wps
[Epoch 1 Batch 720/1412] avg loss 0.0188277, throughput 135.199K wps
[Epoch 1 Batch 750/1412] avg loss 0.0185904, throughput 138.009K wps
[Epoch 1 Batch 780/1412] avg loss 0.0172805, throughput 129.293K wps
[Epoch 1 Batch 810/1412] avg loss 0.0163323, throughput 146.436K wps
[Epoch 1 Batch 840/1412] avg loss 0.0198024, throughput 146.528K wps
[Epoch 1 Batch 870/1412] avg loss 0.016999, throughput 179.667K wps
[Epoch 1 Batch 900/1412] avg loss 0.0152223, throughput 166.778K wps
[Epoch 1 Batch 930/1412] avg loss 0.0202609, throughput 157.282K wps
[Epoch 1 Batch 960/1412] avg loss 0.0169, throughput 146.135K wps
[Epoch 1 Batch 990/1412] avg loss 0.0155935, throughput 147.879K wps
[Epoch 1 Batch 1020/1412] avg loss 0.0181931, throughput 187.976K wps
[Epoch 1 Batch 1050/1412] avg loss 0.0182481, throughput 142.725K wps
[Epoch 1 Batch 1080/1412] avg loss 0.0168207, throughput 148.122K wps
[Epoch 1 Batch 1110/1412] avg loss 0.0161095, throughput 165.792K wps
[Epoch 1 Batch 1140/1412] avg loss 0.0139229, throughput 149.435K wps
[Epoch 1 Batch 1170/1412] avg loss 0.0152732, throughput 127.445K wps
[Epoch 1 Batch 1200/1412] avg loss 0.0154432, throughput 173.635K wps
[Epoch 1 Batch 1230/1412] avg loss 0.0157494, throughput 135.453K wps
[Epoch 1 Batch 1260/1412] avg loss 0.0161098, throughput 155.932K wps
[Epoch 1 Batch 1290/1412] avg loss 0.0161368, throughput 142.513K wps
[Epoch 1 Batch 1320/1412] avg loss 0.0151052, throughput 157.653K wps
[Epoch 1 Batch 1350/1412] avg loss 0.0150935, throughput 142.506K wps
[Epoch 1 Batch 1380/1412] avg loss 0.0154601, throughput 160.373K wps
[Epoch 1 Batch 1410/1412] avg loss 0.0163084, throughput 161.016K wps
Begin Testing...
[Batch 30/157] elapsed 0.71 s
[Batch 60/157] elapsed 0.49 s
[Batch 90/157] elapsed 0.39 s
[Batch 120/157] elapsed 0.33 s
[Batch 150/157] elapsed 0.27 s
Begin Testing...
[Batch 30/1563] elapsed 0.77 s
[Batch 60/1563] elapsed 0.71 s
[Batch 90/1563] elapsed 0.74 s
[Batch 120/1563] elapsed 0.73 s
[Batch 150/1563] elapsed 0.69 s
[Batch 180/1563] elapsed 0.62 s
[Batch 210/1563] elapsed 0.58 s
[Batch 240/1563] elapsed 0.65 s
[Batch 270/1563] elapsed 0.54 s
[Batch 300/1563] elapsed 0.51 s
[Batch 330/1563] elapsed 0.48 s
[Batch 360/1563] elapsed 0.49 s
[Batch 390/1563] elapsed 0.39 s
[Batch 420/1563] elapsed 0.40 s
[Batch 450/1563] elapsed 0.44 s
[Batch 480/1563] elapsed 0.44 s
[Batch 510/1563] elapsed 0.35 s
[Batch 540/1563] elapsed 0.34 s
[Batch 570/1563] elapsed 0.36 s
[Batch 600/1563] elapsed 0.30 s
[Batch 630/1563] elapsed 0.30 s
[Batch 660/1563] elapsed 0.30 s
[Batch 690/1563] elapsed 0.27 s
[Batch 720/1563] elapsed 0.27 s
[Batch 750/1563] elapsed 0.30 s
[Batch 780/1563] elapsed 0.33 s
[Batch 810/1563] elapsed 0.32 s
[Batch 840/1563] elapsed 0.29 s
[Batch 870/1563] elapsed 0.24 s
[Batch 900/1563] elapsed 0.23 s
[Batch 930/1563] elapsed 0.23 s
[Batch 960/1563] elapsed 0.23 s
[Batch 990/1563] elapsed 0.27 s
[Batch 1020/1563] elapsed 0.29 s
[Batch 1050/1563] elapsed 0.25 s
[Batch 1080/1563] elapsed 0.22 s
[Batch 1110/1563] elapsed 0.23 s
[Batch 1140/1563] elapsed 0.23 s
[Batch 1170/1563] elapsed 0.22 s
[Batch 1200/1563] elapsed 0.21 s
[Batch 1230/1563] elapsed 0.25 s
[Batch 1260/1563] elapsed 0.27 s
[Batch 1290/1563] elapsed 0.26 s
[Batch 1320/1563] elapsed 0.23 s
[Batch 1350/1563] elapsed 0.22 s
[Batch 1380/1563] elapsed 0.22 s
[Batch 1410/1563] elapsed 0.23 s
[Batch 1440/1563] elapsed 0.22 s
[Batch 1470/1563] elapsed 0.19 s
[Batch 1500/1563] elapsed 0.17 s
[Batch 1530/1563] elapsed 0.14 s
[Batch 1560/1563] elapsed 0.13 s
[Epoch 1] train avg loss 0.0166272, valid acc 0.8852, valid avg loss 0.301081, test acc 0.8560, test avg loss 0.340057, throughput 149.671K wps
Observed Improvement.
[Epoch 2 Batch 30/1412] avg loss 0.00652912, throughput 168.676K wps
[Epoch 2 Batch 60/1412] avg loss 0.00728209, throughput 172.449K wps
[Epoch 2 Batch 90/1412] avg loss 0.0103104, throughput 174.691K wps
[Epoch 2 Batch 120/1412] avg loss 0.00847687, throughput 160.652K wps
[Epoch 2 Batch 150/1412] avg loss 0.0113538, throughput 170.461K wps
[Epoch 2 Batch 180/1412] avg loss 0.00837356, throughput 165.28K wps
[Epoch 2 Batch 210/1412] avg loss 0.00791863, throughput 173.254K wps
[Epoch 2 Batch 240/1412] avg loss 0.00816121, throughput 151.527K wps
[Epoch 2 Batch 270/1412] avg loss 0.00737816, throughput 153.706K wps
[Epoch 2 Batch 300/1412] avg loss 0.010716, throughput 171.591K wps
[Epoch 2 Batch 330/1412] avg loss 0.0114099, throughput 159.642K wps
[Epoch 2 Batch 360/1412] avg loss 0.00711972, throughput 159.317K wps
[Epoch 2 Batch 390/1412] avg loss 0.00753915, throughput 175.116K wps
[Epoch 2 Batch 420/1412] avg loss 0.010602, throughput 162.14K wps
[Epoch 2 Batch 450/1412] avg loss 0.0103454, throughput 174.899K wps
[Epoch 2 Batch 480/1412] avg loss 0.0115595, throughput 141.081K wps
[Epoch 2 Batch 510/1412] avg loss 0.00997689, throughput 168.937K wps
[Epoch 2 Batch 540/1412] avg loss 0.00886734, throughput 172.366K wps
[Epoch 2 Batch 570/1412] avg loss 0.00999336, throughput 175.769K wps
[Epoch 2 Batch 600/1412] avg loss 0.00907662, throughput 164.702K wps
[Epoch 2 Batch 630/1412] avg loss 0.00943337, throughput 177.321K wps
[Epoch 2 Batch 660/1412] avg loss 0.00882155, throughput 151.687K wps
[Epoch 2 Batch 690/1412] avg loss 0.0102559, throughput 157.662K wps
[Epoch 2 Batch 720/1412] avg loss 0.00911931, throughput 161.575K wps
[Epoch 2 Batch 750/1412] avg loss 0.0093397, throughput 150.315K wps
[Epoch 2 Batch 780/1412] avg loss 0.0117298, throughput 139.682K wps
[Epoch 2 Batch 810/1412] avg loss 0.00990772, throughput 154.829K wps
[Epoch 2 Batch 840/1412] avg loss 0.0114896, throughput 157.851K wps
[Epoch 2 Batch 870/1412] avg loss 0.0131169, throughput 164.421K wps
[Epoch 2 Batch 900/1412] avg loss 0.0117314, throughput 155.224K wps
[Epoch 2 Batch 930/1412] avg loss 0.00785715, throughput 157.559K wps
[Epoch 2 Batch 960/1412] avg loss 0.00978283, throughput 153.974K wps
[Epoch 2 Batch 990/1412] avg loss 0.00991021, throughput 142.764K wps
[Epoch 2 Batch 1020/1412] avg loss 0.0104802, throughput 171.805K wps
[Epoch 2 Batch 1050/1412] avg loss 0.00971933, throughput 167.467K wps
[Epoch 2 Batch 1080/1412] avg loss 0.00954951, throughput 165.625K wps
[Epoch 2 Batch 1110/1412] avg loss 0.0132563, throughput 152.675K wps
[Epoch 2 Batch 1140/1412] avg loss 0.00928522, throughput 169.422K wps
[Epoch 2 Batch 1170/1412] avg loss 0.00886832, throughput 146.858K wps
[Epoch 2 Batch 1200/1412] avg loss 0.0138692, throughput 184.118K wps
[Epoch 2 Batch 1230/1412] avg loss 0.00932438, throughput 161.471K wps
[Epoch 2 Batch 1260/1412] avg loss 0.0133991, throughput 163.4K wps
[Epoch 2 Batch 1290/1412] avg loss 0.00901605, throughput 168.981K wps
[Epoch 2 Batch 1320/1412] avg loss 0.00925615, throughput 159.753K wps
[Epoch 2 Batch 1350/1412] avg loss 0.00751305, throughput 141.891K wps
[Epoch 2 Batch 1380/1412] avg loss 0.0137352, throughput 142.459K wps
[Epoch 2 Batch 1410/1412] avg loss 0.0101656, throughput 168.485K wps
Begin Testing...
[Batch 30/157] elapsed 0.49 s
[Batch 60/157] elapsed 0.39 s
[Batch 90/157] elapsed 0.30 s
[Batch 120/157] elapsed 0.22 s
[Batch 150/157] elapsed 0.18 s
Begin Testing...
[Batch 30/1563] elapsed 0.59 s
[Batch 60/1563] elapsed 0.58 s
[Batch 90/1563] elapsed 0.55 s
[Batch 120/1563] elapsed 0.58 s
[Batch 150/1563] elapsed 0.62 s
[Batch 180/1563] elapsed 0.63 s
[Batch 210/1563] elapsed 0.56 s
[Batch 240/1563] elapsed 0.64 s
[Batch 270/1563] elapsed 0.59 s
[Batch 300/1563] elapsed 0.52 s
[Batch 330/1563] elapsed 0.53 s
[Batch 360/1563] elapsed 0.53 s
[Batch 390/1563] elapsed 0.52 s
[Batch 420/1563] elapsed 0.43 s
[Batch 450/1563] elapsed 0.41 s
[Batch 480/1563] elapsed 0.45 s
[Batch 510/1563] elapsed 0.41 s
[Batch 540/1563] elapsed 0.40 s
[Batch 570/1563] elapsed 0.40 s
[Batch 600/1563] elapsed 0.35 s
[Batch 630/1563] elapsed 0.37 s
[Batch 660/1563] elapsed 0.31 s
[Batch 690/1563] elapsed 0.29 s
[Batch 720/1563] elapsed 0.26 s
[Batch 750/1563] elapsed 0.26 s
[Batch 780/1563] elapsed 0.26 s
[Batch 810/1563] elapsed 0.27 s
[Batch 840/1563] elapsed 0.26 s
[Batch 870/1563] elapsed 0.27 s
[Batch 900/1563] elapsed 0.25 s
[Batch 930/1563] elapsed 0.25 s
[Batch 960/1563] elapsed 0.26 s
[Batch 990/1563] elapsed 0.26 s
[Batch 1020/1563] elapsed 0.25 s
[Batch 1050/1563] elapsed 0.24 s
[Batch 1080/1563] elapsed 0.26 s
[Batch 1110/1563] elapsed 0.23 s
[Batch 1140/1563] elapsed 0.26 s
[Batch 1170/1563] elapsed 0.30 s
[Batch 1200/1563] elapsed 0.27 s
[Batch 1230/1563] elapsed 0.27 s
[Batch 1260/1563] elapsed 0.26 s
[Batch 1290/1563] elapsed 0.23 s
[Batch 1320/1563] elapsed 0.21 s
[Batch 1350/1563] elapsed 0.23 s
[Batch 1380/1563] elapsed 0.22 s
[Batch 1410/1563] elapsed 0.20 s
[Batch 1440/1563] elapsed 0.18 s
[Batch 1470/1563] elapsed 0.17 s
[Batch 1500/1563] elapsed 0.15 s
[Batch 1530/1563] elapsed 0.16 s
[Batch 1560/1563] elapsed 0.14 s
[Epoch 2] train avg loss 0.00986556, valid acc 0.8720, valid avg loss 0.373289, test acc 0.8450, test avg loss 0.43176, throughput 161.126K wps
No Improvement.
Begin Testing...
[Batch 30/157] elapsed 0.60 s
[Batch 60/157] elapsed 0.49 s
[Batch 90/157] elapsed 0.34 s
[Batch 120/157] elapsed 0.25 s
[Batch 150/157] elapsed 0.25 s
Begin Testing...
[Batch 30/1563] elapsed 0.70 s
[Batch 60/1563] elapsed 0.66 s
[Batch 90/1563] elapsed 0.64 s
[Batch 120/1563] elapsed 0.63 s
[Batch 150/1563] elapsed 0.62 s
[Batch 180/1563] elapsed 0.65 s
[Batch 210/1563] elapsed 0.64 s
[Batch 240/1563] elapsed 0.56 s
[Batch 270/1563] elapsed 0.56 s
[Batch 300/1563] elapsed 0.50 s
[Batch 330/1563] elapsed 0.52 s
[Batch 360/1563] elapsed 0.54 s
[Batch 390/1563] elapsed 0.51 s
[Batch 420/1563] elapsed 0.44 s
[Batch 450/1563] elapsed 0.47 s
[Batch 480/1563] elapsed 0.41 s
[Batch 510/1563] elapsed 0.37 s
[Batch 540/1563] elapsed 0.42 s
[Batch 570/1563] elapsed 0.40 s
[Batch 600/1563] elapsed 0.38 s
[Batch 630/1563] elapsed 0.40 s
[Batch 660/1563] elapsed 0.38 s
[Batch 690/1563] elapsed 0.35 s
[Batch 720/1563] elapsed 0.30 s
[Batch 750/1563] elapsed 0.31 s
[Batch 780/1563] elapsed 0.30 s
[Batch 810/1563] elapsed 0.27 s
[Batch 840/1563] elapsed 0.29 s
[Batch 870/1563] elapsed 0.26 s
[Batch 900/1563] elapsed 0.28 s
[Batch 930/1563] elapsed 0.28 s
[Batch 960/1563] elapsed 0.24 s
[Batch 990/1563] elapsed 0.22 s
[Batch 1020/1563] elapsed 0.24 s
[Batch 1050/1563] elapsed 0.24 s
[Batch 1080/1563] elapsed 0.24 s
[Batch 1110/1563] elapsed 0.26 s
[Batch 1140/1563] elapsed 0.26 s
[Batch 1170/1563] elapsed 0.23 s
[Batch 1200/1563] elapsed 0.24 s
[Batch 1230/1563] elapsed 0.24 s
[Batch 1260/1563] elapsed 0.22 s
[Batch 1290/1563] elapsed 0.21 s
[Batch 1320/1563] elapsed 0.19 s
[Batch 1350/1563] elapsed 0.19 s
[Batch 1380/1563] elapsed 0.19 s
[Batch 1410/1563] elapsed 0.20 s
[Batch 1440/1563] elapsed 0.19 s
[Batch 1470/1563] elapsed 0.17 s
[Batch 1500/1563] elapsed 0.16 s
[Batch 1530/1563] elapsed 0.15 s
[Batch 1560/1563] elapsed 0.13 s
Best validation loss 0.301081, validation acc 0.8852
Best test loss 0.340057, test acc 0.8560
Total time cost 192.25s