-
Notifications
You must be signed in to change notification settings - Fork 28k
/
DataSourceReadBenchmark-jdk11-results.txt
431 lines (385 loc) · 42.4 KB
/
DataSourceReadBenchmark-jdk11-results.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
================================================================================================
SQL Single Numeric Column Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 11004 11065 86 1.4 699.6 1.0X
SQL Json 7986 8011 35 2.0 507.7 1.4X
SQL Parquet Vectorized: DataPageV1 124 148 16 127.0 7.9 88.9X
SQL Parquet Vectorized: DataPageV2 101 115 12 155.0 6.5 108.4X
SQL Parquet MR: DataPageV1 1614 1620 8 9.7 102.6 6.8X
SQL Parquet MR: DataPageV2 1445 1446 2 10.9 91.9 7.6X
SQL ORC Vectorized 163 204 41 96.2 10.4 67.3X
SQL ORC MR 1407 1429 31 11.2 89.4 7.8X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 123 140 14 128.3 7.8 1.0X
ParquetReader Vectorized: DataPageV2 105 114 11 150.3 6.7 1.2X
ParquetReader Vectorized -> Row: DataPageV1 56 61 5 279.9 3.6 2.2X
ParquetReader Vectorized -> Row: DataPageV2 39 43 4 399.4 2.5 3.1X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single TINYINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 13262 13310 67 1.2 843.2 1.0X
SQL Json 9104 9173 98 1.7 578.8 1.5X
SQL Parquet Vectorized: DataPageV1 136 172 31 115.4 8.7 97.3X
SQL Parquet Vectorized: DataPageV2 138 153 17 114.0 8.8 96.1X
SQL Parquet MR: DataPageV1 1789 1805 22 8.8 113.7 7.4X
SQL Parquet MR: DataPageV2 1631 1662 44 9.6 103.7 8.1X
SQL ORC Vectorized 210 252 33 74.8 13.4 63.0X
SQL ORC MR 1412 1437 36 11.1 89.7 9.4X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single TINYINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 171 183 14 92.0 10.9 1.0X
ParquetReader Vectorized: DataPageV2 175 184 9 90.1 11.1 1.0X
ParquetReader Vectorized -> Row: DataPageV1 88 95 12 179.0 5.6 1.9X
ParquetReader Vectorized -> Row: DataPageV2 88 92 4 179.0 5.6 1.9X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single SMALLINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 14022 14236 303 1.1 891.5 1.0X
SQL Json 9763 9929 235 1.6 620.7 1.4X
SQL Parquet Vectorized: DataPageV1 173 226 38 90.7 11.0 80.9X
SQL Parquet Vectorized: DataPageV2 222 241 13 70.7 14.1 63.1X
SQL Parquet MR: DataPageV1 2069 2086 24 7.6 131.5 6.8X
SQL Parquet MR: DataPageV2 1771 1806 49 8.9 112.6 7.9X
SQL ORC Vectorized 203 263 37 77.6 12.9 69.2X
SQL ORC MR 1528 1552 34 10.3 97.2 9.2X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single SMALLINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 246 256 11 63.9 15.6 1.0X
ParquetReader Vectorized: DataPageV2 301 313 17 52.3 19.1 0.8X
ParquetReader Vectorized -> Row: DataPageV1 257 292 18 61.2 16.3 1.0X
ParquetReader Vectorized -> Row: DataPageV2 296 318 25 53.1 18.8 0.8X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single INT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 16153 16252 140 1.0 1027.0 1.0X
SQL Json 10406 10547 200 1.5 661.6 1.6X
SQL Parquet Vectorized: DataPageV1 159 207 33 99.1 10.1 101.8X
SQL Parquet Vectorized: DataPageV2 337 402 40 46.6 21.4 47.9X
SQL Parquet MR: DataPageV1 2160 2193 46 7.3 137.4 7.5X
SQL Parquet MR: DataPageV2 1892 1900 11 8.3 120.3 8.5X
SQL ORC Vectorized 297 340 42 53.0 18.9 54.5X
SQL ORC MR 1705 1732 38 9.2 108.4 9.5X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single INT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 251 262 10 62.6 16.0 1.0X
ParquetReader Vectorized: DataPageV2 418 431 13 37.7 26.6 0.6X
ParquetReader Vectorized -> Row: DataPageV1 247 288 30 63.7 15.7 1.0X
ParquetReader Vectorized -> Row: DataPageV2 412 455 39 38.1 26.2 0.6X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single BIGINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 19896 20026 183 0.8 1264.9 1.0X
SQL Json 12540 12634 132 1.3 797.3 1.6X
SQL Parquet Vectorized: DataPageV1 221 271 30 71.3 14.0 90.1X
SQL Parquet Vectorized: DataPageV2 546 564 23 28.8 34.7 36.5X
SQL Parquet MR: DataPageV1 2196 2211 21 7.2 139.6 9.1X
SQL Parquet MR: DataPageV2 2085 2089 6 7.5 132.5 9.5X
SQL ORC Vectorized 379 416 39 41.5 24.1 52.5X
SQL ORC MR 1858 1859 2 8.5 118.1 10.7X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single BIGINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 311 340 20 50.5 19.8 1.0X
ParquetReader Vectorized: DataPageV2 639 647 11 24.6 40.6 0.5X
ParquetReader Vectorized -> Row: DataPageV1 359 376 13 43.9 22.8 0.9X
ParquetReader Vectorized -> Row: DataPageV2 653 658 9 24.1 41.5 0.5X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single FLOAT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 17338 17710 526 0.9 1102.3 1.0X
SQL Json 11844 12121 392 1.3 753.0 1.5X
SQL Parquet Vectorized: DataPageV1 148 187 28 106.2 9.4 117.0X
SQL Parquet Vectorized: DataPageV2 147 183 31 106.8 9.4 117.7X
SQL Parquet MR: DataPageV1 2027 2033 9 7.8 128.9 8.6X
SQL Parquet MR: DataPageV2 1966 1981 21 8.0 125.0 8.8X
SQL ORC Vectorized 399 425 25 39.4 25.4 43.4X
SQL ORC MR 1748 1756 11 9.0 111.2 9.9X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single FLOAT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 226 240 15 69.6 14.4 1.0X
ParquetReader Vectorized: DataPageV2 225 237 15 69.9 14.3 1.0X
ParquetReader Vectorized -> Row: DataPageV1 247 299 38 63.6 15.7 0.9X
ParquetReader Vectorized -> Row: DataPageV2 245 296 25 64.1 15.6 0.9X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single DOUBLE Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 21052 21617 799 0.7 1338.4 1.0X
SQL Json 15822 16049 321 1.0 1005.9 1.3X
SQL Parquet Vectorized: DataPageV1 266 286 19 59.0 16.9 79.0X
SQL Parquet Vectorized: DataPageV2 277 291 14 56.8 17.6 76.0X
SQL Parquet MR: DataPageV1 2267 2275 12 6.9 144.1 9.3X
SQL Parquet MR: DataPageV2 2046 2064 26 7.7 130.1 10.3X
SQL ORC Vectorized 535 545 10 29.4 34.0 39.3X
SQL ORC MR 1976 2000 34 8.0 125.6 10.7X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Parquet Reader Single DOUBLE Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------
ParquetReader Vectorized: DataPageV1 314 337 25 50.1 20.0 1.0X
ParquetReader Vectorized: DataPageV2 309 323 14 50.8 19.7 1.0X
ParquetReader Vectorized -> Row: DataPageV1 331 348 13 47.5 21.1 0.9X
ParquetReader Vectorized -> Row: DataPageV2 332 347 11 47.4 21.1 0.9X
================================================================================================
SQL Single Numeric Column Scan in Struct
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single TINYINT Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2193 2196 4 7.2 139.5 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2211 2222 16 7.1 140.6 1.0X
SQL ORC Vectorized (Nested Column Enabled) 268 310 32 58.7 17.0 8.2X
SQL Parquet MR: DataPageV1 2243 2280 53 7.0 142.6 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 2747 2758 16 5.7 174.6 0.8X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 155 174 22 101.7 9.8 14.2X
SQL Parquet MR: DataPageV2 2193 2203 13 7.2 139.5 1.0X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 2709 2733 33 5.8 172.3 0.8X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 150 174 27 104.7 9.6 14.6X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single SMALLINT Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2322 2391 97 6.8 147.6 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2362 2374 17 6.7 150.2 1.0X
SQL ORC Vectorized (Nested Column Enabled) 412 419 9 38.2 26.2 5.6X
SQL Parquet MR: DataPageV1 2393 2400 10 6.6 152.1 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 2919 2922 4 5.4 185.6 0.8X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 228 281 54 69.0 14.5 10.2X
SQL Parquet MR: DataPageV2 2223 2240 25 7.1 141.3 1.0X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 2692 2712 28 5.8 171.2 0.9X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 341 361 31 46.1 21.7 6.8X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single INT Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2376 2380 6 6.6 151.0 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2333 2378 64 6.7 148.4 1.0X
SQL ORC Vectorized (Nested Column Enabled) 430 451 20 36.6 27.3 5.5X
SQL Parquet MR: DataPageV1 2485 2501 22 6.3 158.0 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 3017 3062 65 5.2 191.8 0.8X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 189 235 34 83.1 12.0 12.6X
SQL Parquet MR: DataPageV2 2356 2376 29 6.7 149.8 1.0X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 2823 2831 12 5.6 179.5 0.8X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 408 447 25 38.5 26.0 5.8X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single BIGINT Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2614 2633 26 6.0 166.2 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2711 2776 91 5.8 172.4 1.0X
SQL ORC Vectorized (Nested Column Enabled) 556 598 35 28.3 35.4 4.7X
SQL Parquet MR: DataPageV1 2671 2673 2 5.9 169.8 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 3148 3172 34 5.0 200.2 0.8X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 282 369 63 55.8 17.9 9.3X
SQL Parquet MR: DataPageV2 2430 2443 19 6.5 154.5 1.1X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 3047 3119 101 5.2 193.7 0.9X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 577 628 62 27.3 36.7 4.5X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single FLOAT Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2516 2527 15 6.3 160.0 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2588 2591 4 6.1 164.5 1.0X
SQL ORC Vectorized (Nested Column Enabled) 577 589 10 27.3 36.7 4.4X
SQL Parquet MR: DataPageV1 2446 2480 49 6.4 155.5 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 3011 3018 10 5.2 191.4 0.8X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 173 263 70 91.2 11.0 14.6X
SQL Parquet MR: DataPageV2 2204 2216 16 7.1 140.2 1.1X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 2733 2758 35 5.8 173.8 0.9X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 173 207 24 90.8 11.0 14.5X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Single DOUBLE Column Scan in Struct: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 2566 2632 92 6.1 163.2 1.0X
SQL ORC Vectorized (Nested Column Disabled) 2627 2642 21 6.0 167.0 1.0X
SQL ORC Vectorized (Nested Column Enabled) 678 690 16 23.2 43.1 3.8X
SQL Parquet MR: DataPageV1 2497 2501 6 6.3 158.8 1.0X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 3726 3752 36 4.2 236.9 0.7X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 295 401 60 53.4 18.7 8.7X
SQL Parquet MR: DataPageV2 2417 2464 66 6.5 153.7 1.1X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 3682 3697 20 4.3 234.1 0.7X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 293 303 14 53.7 18.6 8.8X
================================================================================================
SQL Nested Column Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
SQL Nested Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-------------------------------------------------------------------------------------------------------------------------------------------
SQL ORC MR 20991 21114 172 0.0 20018.5 1.0X
SQL ORC Vectorized (Nested Column Disabled) 20899 21160 293 0.1 19931.0 1.0X
SQL ORC Vectorized (Nested Column Enabled) 9512 9580 75 0.1 9071.5 2.2X
SQL Parquet MR: DataPageV1 16203 16490 305 0.1 15452.5 1.3X
SQL Parquet Vectorized: DataPageV1 (Nested Column Disabled) 16718 16851 128 0.1 15943.3 1.3X
SQL Parquet Vectorized: DataPageV1 (Nested Column Enabled) 11510 11671 130 0.1 10976.8 1.8X
SQL Parquet MR: DataPageV2 15935 16063 109 0.1 15197.1 1.3X
SQL Parquet Vectorized: DataPageV2 (Nested Column Disabled) 16648 16869 193 0.1 15876.4 1.3X
SQL Parquet Vectorized: DataPageV2 (Nested Column Enabled) 8865 8996 87 0.1 8454.6 2.4X
================================================================================================
Int and String Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Int and String Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 14365 14389 34 0.7 1369.9 1.0X
SQL Json 11768 11819 73 0.9 1122.2 1.2X
SQL Parquet Vectorized: DataPageV1 2037 2047 14 5.1 194.2 7.1X
SQL Parquet Vectorized: DataPageV2 2460 2468 11 4.3 234.6 5.8X
SQL Parquet MR: DataPageV1 4289 4334 64 2.4 409.0 3.3X
SQL Parquet MR: DataPageV2 4098 4149 72 2.6 390.8 3.5X
SQL ORC Vectorized 2183 2206 33 4.8 208.2 6.6X
SQL ORC MR 3657 3697 56 2.9 348.8 3.9X
================================================================================================
Repeated String Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Repeated String: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 8224 8262 53 1.3 784.3 1.0X
SQL Json 6795 6809 20 1.5 648.1 1.2X
SQL Parquet Vectorized: DataPageV1 587 602 14 17.9 56.0 14.0X
SQL Parquet Vectorized: DataPageV2 563 592 26 18.6 53.7 14.6X
SQL Parquet MR: DataPageV1 1682 1693 15 6.2 160.4 4.9X
SQL Parquet MR: DataPageV2 1562 1593 44 6.7 149.0 5.3X
SQL ORC Vectorized 447 491 52 23.5 42.6 18.4X
SQL ORC MR 1803 1835 46 5.8 171.9 4.6X
================================================================================================
Partitioned Table Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Partitioned Table: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
---------------------------------------------------------------------------------------------------------------------------------
Data column - CSV 19758 19973 303 0.8 1256.2 1.0X
Data column - Json 12715 12800 120 1.2 808.4 1.6X
Data column - Parquet Vectorized: DataPageV1 260 274 12 60.5 16.5 76.0X
Data column - Parquet Vectorized: DataPageV2 639 673 37 24.6 40.6 30.9X
Data column - Parquet MR: DataPageV1 2598 2638 57 6.1 165.2 7.6X
Data column - Parquet MR: DataPageV2 2535 2543 11 6.2 161.2 7.8X
Data column - ORC Vectorized 397 424 21 39.6 25.3 49.7X
Data column - ORC MR 2252 2378 178 7.0 143.2 8.8X
Partition column - CSV 6023 6057 48 2.6 382.9 3.3X
Partition column - Json 10133 10275 202 1.6 644.2 1.9X
Partition column - Parquet Vectorized: DataPageV1 50 67 18 313.5 3.2 393.9X
Partition column - Parquet Vectorized: DataPageV2 49 63 15 319.8 3.1 401.8X
Partition column - Parquet MR: DataPageV1 1288 1339 72 12.2 81.9 15.3X
Partition column - Parquet MR: DataPageV2 1347 1357 15 11.7 85.6 14.7X
Partition column - ORC Vectorized 53 69 19 299.6 3.3 376.3X
Partition column - ORC MR 1456 1512 79 10.8 92.6 13.6X
Both columns - CSV 19667 19925 364 0.8 1250.4 1.0X
Both columns - Json 14112 14138 36 1.1 897.2 1.4X
Both columns - Parquet Vectorized: DataPageV1 319 334 11 49.4 20.3 62.0X
Both columns - Parquet Vectorized: DataPageV2 725 735 14 21.7 46.1 27.3X
Both columns - Parquet MR: DataPageV1 2815 2848 47 5.6 179.0 7.0X
Both columns - Parquet MR: DataPageV2 2582 2638 80 6.1 164.1 7.7X
Both columns - ORC Vectorized 449 509 48 35.0 28.5 44.0X
Both columns - ORC MR 2437 2439 2 6.5 155.0 8.1X
================================================================================================
String with Nulls Scan
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
String with Nulls Scan (0.0%): Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 9544 9547 5 1.1 910.2 1.0X
SQL Json 10015 10102 123 1.0 955.1 1.0X
SQL Parquet Vectorized: DataPageV1 1381 1399 26 7.6 131.7 6.9X
SQL Parquet Vectorized: DataPageV2 1778 1780 2 5.9 169.6 5.4X
SQL Parquet MR: DataPageV1 3675 3708 47 2.9 350.5 2.6X
SQL Parquet MR: DataPageV2 3778 3812 47 2.8 360.3 2.5X
ParquetReader Vectorized: DataPageV1 937 954 24 11.2 89.4 10.2X
ParquetReader Vectorized: DataPageV2 1438 1440 4 7.3 137.1 6.6X
SQL ORC Vectorized 1061 1065 5 9.9 101.2 9.0X
SQL ORC MR 2899 2937 54 3.6 276.5 3.3X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
String with Nulls Scan (50.0%): Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 7267 7292 35 1.4 693.0 1.0X
SQL Json 7658 7728 99 1.4 730.4 0.9X
SQL Parquet Vectorized: DataPageV1 1021 1055 48 10.3 97.4 7.1X
SQL Parquet Vectorized: DataPageV2 1411 1412 1 7.4 134.6 5.1X
SQL Parquet MR: DataPageV1 2866 2883 24 3.7 273.3 2.5X
SQL Parquet MR: DataPageV2 2880 2899 26 3.6 274.7 2.5X
ParquetReader Vectorized: DataPageV1 1000 1065 93 10.5 95.3 7.3X
ParquetReader Vectorized: DataPageV2 1288 1294 9 8.1 122.8 5.6X
SQL ORC Vectorized 1274 1311 51 8.2 121.5 5.7X
SQL ORC MR 2818 2884 92 3.7 268.8 2.6X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
String with Nulls Scan (95.0%): Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 5408 5434 38 1.9 515.7 1.0X
SQL Json 4570 4693 175 2.3 435.8 1.2X
SQL Parquet Vectorized: DataPageV1 254 274 25 41.3 24.2 21.3X
SQL Parquet Vectorized: DataPageV2 316 336 22 33.2 30.1 17.1X
SQL Parquet MR: DataPageV1 1738 1768 42 6.0 165.8 3.1X
SQL Parquet MR: DataPageV2 1613 1619 10 6.5 153.8 3.4X
ParquetReader Vectorized: DataPageV1 265 274 10 39.5 25.3 20.4X
ParquetReader Vectorized: DataPageV2 326 335 9 32.1 31.1 16.6X
SQL ORC Vectorized 383 407 37 27.4 36.5 14.1X
SQL ORC MR 1543 1550 11 6.8 147.1 3.5X
================================================================================================
Single Column Scan From Wide Columns
================================================================================================
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Single Column Scan from 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 2192 2196 6 0.5 2090.2 1.0X
SQL Json 2994 3016 31 0.4 2855.4 0.7X
SQL Parquet Vectorized: DataPageV1 42 58 16 25.2 39.6 52.8X
SQL Parquet Vectorized: DataPageV2 62 79 19 16.8 59.5 35.2X
SQL Parquet MR: DataPageV1 184 201 22 5.7 175.6 11.9X
SQL Parquet MR: DataPageV2 171 192 26 6.1 163.0 12.8X
SQL ORC Vectorized 52 74 27 20.2 49.5 42.2X
SQL ORC MR 143 167 25 7.4 136.0 15.4X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Single Column Scan from 50 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 5245 5333 126 0.2 5001.7 1.0X
SQL Json 11916 12064 210 0.1 11363.9 0.4X
SQL Parquet Vectorized: DataPageV1 56 91 28 18.6 53.7 93.1X
SQL Parquet Vectorized: DataPageV2 76 99 22 13.7 72.9 68.6X
SQL Parquet MR: DataPageV1 194 221 33 5.4 185.5 27.0X
SQL Parquet MR: DataPageV2 184 211 23 5.7 175.0 28.6X
SQL ORC Vectorized 64 89 27 16.5 60.7 82.4X
SQL ORC MR 151 177 30 6.9 144.3 34.7X
OpenJDK 64-Bit Server VM 11.0.14+9-LTS on Linux 5.13.0-1021-azure
Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Single Column Scan from 100 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
SQL CSV 9296 9407 157 0.1 8865.0 1.0X
SQL Json 23191 23509 450 0.0 22116.5 0.4X
SQL Parquet Vectorized: DataPageV1 87 119 24 12.1 82.5 107.5X
SQL Parquet Vectorized: DataPageV2 105 125 21 10.0 100.2 88.5X
SQL Parquet MR: DataPageV1 226 253 34 4.6 215.7 41.1X
SQL Parquet MR: DataPageV2 232 263 31 4.5 221.2 40.1X
SQL ORC Vectorized 78 101 20 13.4 74.6 118.9X
SQL ORC MR 173 191 28 6.1 164.8 53.8X