Speedup hashing of interned objects. #1053

ryzhyk · 2021-08-18T18:28:25Z

We recently changed the implementation of Hash for interned objects to
hash the value of the object itself instead of hashing the pointer,
which is non-deterministic in some scenarios. So obviously hashing
became much more expensive for interned types. Of course the biggest
user of hashing is the internment library itself, as it hashes objects
in order to intern them. So, interning an object whose fields are also
interned values became many times more expensive. This for example
affected the performance of OVN in some scenarios.

To solve this, we change the way interned objects are hashed once again.
We already store a hash with each interned object, used to speedup
comparison operators by comparing by hash instead of by value.
Similarly, we now hash the interned object by hashing the stored hash
instead of hashing the object itself. We also increase the stored hash
size to 64 bit.

Signed-off-by: Leonid Ryzhyk lryzhyk@vmware.com

We recently changed the implementation of `Hash` for interned objects to hash the value of the object itself instead of hashing the pointer, which is non-deterministic in some scenarios. So obviously hashing became much more expensive for interned types. Of course the biggest user of hashing is the internment library itself, as it hashes objects in order to intern them. So, interning an object whose fields are also interned values became many times more expensive. This for example affected the performance of OVN in some scenarios. To solve this, we change the way interned objects are hashed once again. We already store a hash with each interned object, used to speedup comparison operators by comparing by hash instead of by value. Similarly, we now hash the interned object by hashing the stored hash instead of hashing the object itself. We also increase the stored hash size to 64 bit. Signed-off-by: Leonid Ryzhyk <lryzhyk@vmware.com>

mihaibudiu · 2021-08-18T18:45:45Z

Does this restore the OVN perf to its prior status?

blp · 2021-08-18T19:46:04Z

Does this restore the OVN perf to its prior status?

Yes, my measurements show that:

                       GB    elapsed time    CPU s
0.38                 16.8        5:13        277.5
0.42.1               16.5        5:11        271.2
0.42.1 + interning   12.9        4:44        248.0
0.43                 16.5       10:05        564.2
0.45.1               16.6        9:52        550.6
0.45.1 + interning   12.8       11:47        665.1
0.45.1 + 64b intern  12.9       11:21        640.3
0.45.1 + PR#1053     12.8        5:08        267.4

ryzhyk requested a review from mihaibudiu August 18, 2021 18:28

mihaibudiu approved these changes Aug 18, 2021

View reviewed changes

ryzhyk merged commit 2ce3ba7 into vmware:master Aug 18, 2021

ryzhyk deleted the 64bit_hash branch August 18, 2021 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup hashing of interned objects. #1053

Speedup hashing of interned objects. #1053

ryzhyk commented Aug 18, 2021

mihaibudiu commented Aug 18, 2021

blp commented Aug 18, 2021

Speedup hashing of interned objects. #1053

Speedup hashing of interned objects. #1053

Conversation

ryzhyk commented Aug 18, 2021

mihaibudiu commented Aug 18, 2021

blp commented Aug 18, 2021