/
AggregateBenchmark-jdk11-results.txt
148 lines (117 loc) · 11 KB
/
AggregateBenchmark-jdk11-results.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
================================================================================================
aggregate without grouping
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
agg w/o group: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
agg w/o group wholestage off 56895 61603 2729 36.9 27.1 1.0X
agg w/o group wholestage on 1065 1092 28 1969.5 0.5 53.4X
================================================================================================
stat functions
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
stddev: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
stddev wholestage off 6564 6571 9 16.0 62.6 1.0X
stddev wholestage on 1187 1226 49 88.3 11.3 5.5X
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
kurtosis: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
kurtosis wholestage off 31655 31720 92 3.3 301.9 1.0X
kurtosis wholestage on 1802 1829 18 58.2 17.2 17.6X
================================================================================================
aggregate with linear keys
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Aggregate w keys: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 8990 9084 133 9.3 107.2 1.0X
codegen = T, hashmap = F 5029 5034 8 16.7 60.0 1.8X
codegen = T, row-based hashmap = T 2025 2139 203 41.4 24.1 4.4X
codegen = T, vectorized hashmap = T 1045 1067 25 80.2 12.5 8.6X
================================================================================================
aggregate with randomized keys
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Aggregate w keys: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 8467 8581 162 9.9 100.9 1.0X
codegen = T, hashmap = F 5350 5612 237 15.7 63.8 1.6X
codegen = T, row-based hashmap = T 2543 2562 13 33.0 30.3 3.3X
codegen = T, vectorized hashmap = T 1618 1634 13 51.9 19.3 5.2X
================================================================================================
aggregate with string key
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Aggregate w string key: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 3371 3446 106 6.2 160.7 1.0X
codegen = T, hashmap = F 1947 1981 54 10.8 92.8 1.7X
codegen = T, row-based hashmap = T 1262 1275 13 16.6 60.2 2.7X
codegen = T, vectorized hashmap = T 955 964 12 22.0 45.5 3.5X
================================================================================================
aggregate with decimal key
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Aggregate w decimal key: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 3085 3086 1 6.8 147.1 1.0X
codegen = T, hashmap = F 1947 2057 155 10.8 92.9 1.6X
codegen = T, row-based hashmap = T 794 803 10 26.4 37.9 3.9X
codegen = T, vectorized hashmap = T 621 636 17 33.8 29.6 5.0X
================================================================================================
aggregate with multiple key types
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Aggregate w multiple keys: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 5539 5632 133 3.8 264.1 1.0X
codegen = T, hashmap = F 3123 3232 155 6.7 148.9 1.8X
codegen = T, row-based hashmap = T 2016 2117 142 10.4 96.1 2.7X
codegen = T, vectorized hashmap = T 2579 2583 5 8.1 123.0 2.1X
================================================================================================
max function bytecode size of wholestagecodegen
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
max function bytecode size: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
codegen = F 483 551 46 1.4 737.7 1.0X
codegen = T, hugeMethodLimit = 10000 241 293 41 2.7 367.3 2.0X
codegen = T, hugeMethodLimit = 1500 176 198 22 3.7 267.9 2.8X
================================================================================================
cube
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
cube: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
cube wholestage off 2582 2612 42 2.0 492.5 1.0X
cube wholestage on 1421 1442 26 3.7 271.0 1.8X
================================================================================================
hash and BytesToBytesMap
================================================================================================
OpenJDK 64-Bit Server VM 11.0.17+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BytesToBytesMap: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
UnsafeRowhash 253 254 1 82.8 12.1 1.0X
murmur3 hash 112 112 0 187.9 5.3 2.3X
fast hash 57 57 0 370.3 2.7 4.5X
arrayEqual 180 181 3 116.7 8.6 1.4X
Java HashMap (Long) 98 151 57 213.4 4.7 2.6X
Java HashMap (two ints) 110 163 55 190.2 5.3 2.3X
Java HashMap (UnsafeRow) 687 688 0 30.5 32.8 0.4X
LongToUnsafeRowMap (opt=false) 460 461 1 45.6 21.9 0.6X
LongToUnsafeRowMap (opt=true) 103 126 29 202.8 4.9 2.5X
BytesToBytesMap (off Heap) 697 699 2 30.1 33.2 0.4X
BytesToBytesMap (on Heap) 723 727 5 29.0 34.5 0.4X
Aggregate HashMap 55 58 7 377.9 2.6 4.6X