/
JsonBenchmark-results.txt
127 lines (114 loc) · 11.7 KB
/
JsonBenchmark-results.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
================================================================================================
Benchmark for performance of JSON parsing
================================================================================================
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
JSON schema inferring: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 2084 2134 46 2.4 416.8 1.0X
UTF-8 is set 3077 3093 14 1.6 615.3 0.7X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
count a short column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 2854 2863 8 1.8 570.8 1.0X
UTF-8 is set 4066 4066 1 1.2 813.1 0.7X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
count a wide column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 3348 3368 26 0.3 3347.8 1.0X
UTF-8 is set 5215 5239 22 0.2 5214.7 0.6X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
select wide row: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 11046 11102 54 0.0 220928.4 1.0X
UTF-8 is set 12135 12181 54 0.0 242697.4 0.9X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Select a subset of 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns 2486 2488 2 0.4 2486.5 1.0X
Select 1 column 1505 1506 2 0.7 1504.6 1.7X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
creation of JSON parser per line: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding 888 889 3 1.1 887.6 1.0X
Short column with UTF-8 1134 1136 2 0.9 1134.3 0.8X
Wide column without encoding 8012 8056 51 0.1 8012.4 0.1X
Wide column with UTF-8 9830 9844 22 0.1 9829.7 0.1X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
JSON functions: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 85 87 2 11.7 85.4 1.0X
from_json 1706 1711 4 0.6 1706.4 0.1X
json_tuple 1528 1534 7 0.7 1528.2 0.1X
get_json_object 1275 1286 17 0.8 1275.0 0.1X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Dataset of json strings: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 369 370 1 13.6 73.8 1.0X
schema inferring 1880 1883 4 2.7 376.0 0.2X
parsing 3731 3737 8 1.3 746.1 0.1X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Json files in the per-line mode: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 553 579 32 9.0 110.6 1.0X
Schema inferring 2195 2196 2 2.3 439.0 0.3X
Parsing without charset 4272 4274 3 1.2 854.3 0.1X
Parsing with UTF-8 5459 5464 6 0.9 1091.7 0.1X
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Write dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps 102 112 13 9.8 101.9 1.0X
to_json(timestamp) 840 841 1 1.2 839.6 0.1X
write timestamps to files 692 696 4 1.4 692.0 0.1X
Create a dataset of dates 120 121 1 8.4 119.7 0.9X
to_json(date) 589 591 2 1.7 589.3 0.2X
write dates to files 442 445 2 2.3 442.3 0.2X
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Read dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files 143 145 4 7.0 142.6 1.0X
read timestamps from files 2449 2469 17 0.4 2448.6 0.1X
infer timestamps from files 5579 5596 15 0.2 5578.8 0.0X
read date text from files 132 139 7 7.6 131.7 1.1X
read date from files 1017 1020 2 1.0 1017.5 0.1X
timestamp strings 227 230 3 4.4 227.2 0.6X
parse timestamps from Dataset[String] 2827 2830 3 0.4 2826.5 0.1X
infer timestamps from Dataset[String] 6001 6008 6 0.2 6001.2 0.0X
date strings 259 261 2 3.9 259.0 0.6X
parse dates from Dataset[String] 1382 1387 4 0.7 1382.3 0.1X
from_json(timestamp) 3557 3561 7 0.3 3556.8 0.0X
from_json(date) 2146 2148 2 0.5 2146.4 0.1X
infer error timestamps from Dataset[String] with default format 1989 1993 4 0.5 1989.3 0.1X
infer error timestamps from Dataset[String] with user-provided format 1922 1925 3 0.5 1922.1 0.1X
infer error timestamps from Dataset[String] with legacy format 1919 1923 4 0.5 1919.1 0.1X
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Filters pushdown: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters 14387 14399 12 0.0 143871.9 1.0X
pushdown disabled 13891 13899 7 0.0 138912.3 1.0X
w/ filters 782 784 2 0.1 7820.5 18.4X
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Partial JSON results: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
parse invalid JSON 3100 3106 6 0.0 309967.6 1.0X