Performance improvement #4

israellot · 2019-06-27T08:22:05Z

This small change reverses the origin loop order. By doing so landmarks become the first appearance of each hash, and collides are on forward ( left to right ) order.
This greatly improves performance, especially for larger inputs with small changes.
The original code, when matching a huge section would find the landmark, which would be further to the right, and cycle the collisions from left to right, getting a bigger match at every pass. With this modification most times the code can find the biggest match at the first pass, and skip checking if the other collisions are inside the already found match.

endel · 2019-06-27T20:32:33Z

Hi @israellot, thanks a lot for this pull-request! Sounds like you really dived deep on this!

I haven't been using this library lately though, can you confirm if the tests are still passing? Cheers!

israellot · 2019-06-27T21:39:50Z

Yes, the tests are still fine.
Thanks for the work you've done years ago, it has been useful for me on a data sync project.

israellot added 2 commits June 27, 2019 10:13

performance tweak

264ea17

typo

a00eba4

endel merged commit c52680c into endel:master Aug 22, 2019

endel mentioned this pull request Feb 17, 2020

Test on data folder 2 not passing #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvement #4

Performance improvement #4

israellot commented Jun 27, 2019

endel commented Jun 27, 2019

israellot commented Jun 27, 2019

Performance improvement #4

Performance improvement #4

Conversation

israellot commented Jun 27, 2019

endel commented Jun 27, 2019

israellot commented Jun 27, 2019