A filter is presented that reduces big ‘common substring problems' to a more manageable size. The approach uses ‘shingling’ and ‘fingerprinting’: In a first step the fingerprints of the reference string are scattered over a large hash map. Gathered in a second step, these hash values are matched against the values obtained from the test string.
-
Updated
Apr 30, 2023 - C++