Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
branch: experimental

Feb 16, 2012

  1. John Whitney

    Delete file which was accidentally added.

    authored

Feb 15, 2012

  1. John Whitney

    Fix MSVC++ compile errors.

    authored

Feb 07, 2012

  1. John Whitney

    Make sure we actually process the match we found.

    authored
  2. John Whitney

    Use BDELTA_GLOBAL as a flag, instead of BDELTA_LOCAL. Also, fix BDelt…

    …a's Python wrapper for flag handling.
    authored
  3. John Whitney

    Add ability to require that the hole sides be ordered or the hole sid…

    …e be under a specified maximum.
    authored
  4. John Whitney

    Use flags instead of boolean parameters.

    authored
  5. John Whitney

    Use Cython to create a new Python wrapper which is much more function…

    …al than the old one.
    authored
  6. John Whitney

    Make BDelta usable again as a C library.

    authored
  7. John Whitney

    Checkpoint 5

    authored
  8. John Whitney

    Remove "THOROUGH" flag altogether, because it doesn't work anymore.

    authored
  9. John Whitney

    Small optimization.

    authored
  10. John Whitney

    Boost performance by quickly figuring out an appropriate hash table s…

    …ize.
    authored
  11. John Whitney

    bdelta_pass_2() should return void.

    authored
  12. John Whitney

    Clean up "verbose" output messages.

    authored
  13. John Whitney

    Remove useless "lastChecksum" check.

    authored
  14. John Whitney

    More Cleanups.

    authored
  15. John Whitney

    Fixes

    authored
  16. John Whitney

    Cleanups and important bug fix (prevent file read error.)

    authored
  17. John Whitney

    Remove old stats-collecting code.

    authored
  18. John Whitney

    Use "BDelta_Instance *" types instead of passing around "void *".

    authored
  19. John Whitney

    Add ability to specify "removeOverlap" when cleaning matches.

    authored
  20. John Whitney

    Unify hole-finding algorithm and add extra features to the lib.

    authored
  21. John Whitney

    Don't remove overlap prematurely.

    authored
  22. John Whitney

    Checkpoint 4

    authored
  23. John Whitney

    Better picking when more than one checksum matches.

    authored
  24. John Whitney

    Checkpoint 3

    authored
  25. John Whitney

    Add ability to switch inputs.

    authored
  26. John Whitney

    Fix hash table issues.

    Most importantly, make sure identical checksums are always adjacent
    when the list is sorted, since our "removal of repeated checksums" code
    depends on it.
    authored
  27. John Whitney

    Checkpoint 2

    authored
  28. John Whitney

    Checkpoint 1

    authored
  29. John Whitney

    New experimental ideas...

    This commit isn't meant to be used publicly.
    authored
  30. John Whitney

    Clean "THOROUGH" mode code.

    authored
  31. John Whitney

    Use STL lists in place of our custom double-linked list implementation.

    authored
  32. John Whitney

    Use STL list to store "potential" checksum matches.

    Besides the advantage of using the standard, this change actually
    reduces the code size on my machine - both source and compiled.
    authored
  33. John Whitney

    Select matches more carefully.

    Only recognize matches that are at least "blocksize * 2" bytes.
    Sometimes the "next pass" will find a better-suited match.
    
    Sample data sets:
    1: "*-12(Benefits for Organizations*-12(Benefits for Countries..."
    2: "-12(Benefits for Organizations-12(Benefits for Countries..."
    
    Without this patch, and with a blocksize of 16, the following blocks
    are hashed:
    "*-12(Benefits fo", "r Organizations*", "-12(Benefits for"
    This causes the front of dataset 2 to be linked against the third
    hashed block of dataset 1. (BAD!)
    
    With this patch, the hashes match, but the match won't be recognized
    because it's only 17 bytes (needs to be 32).
    The next pass hashes the following blocks:
    "*-12(Ben", "efits fo", "r Organi", "zations*", "-12(Bene", "fits for"
    Now there are two reliable hashes to latch onto.
    authored
Something went wrong with that request. Please try again.