Insertion of a key, A, that is a complete subset of another key, B, prevents A from being found #12

jamesmistry · 2015-03-03T06:35:09Z

Consider 2 key-value pairs:

"tester" => 1
"test" => 2

Inserting both of these in the radix tree, in either order, results in a tree that fails to match on lookups using key "test".

Leaf nodes are always nodes representing the right-most bytes of the key, and a leaf node is regarded as synonymous with a match. However, the node representing the superset key is not split to allow a node representing the end of the subset key (or a "string end" leaf node if the leaf == match semantics are to be preserved).

jamesmistry · 2015-03-03T06:46:33Z

Just read a comment you made in another thread about full prefixes not being supported, and using terminating characters instead. I'll close as this is obviously intended behaviour.

Summary: a facet of the libart implementation that is undocumented except for a series of issues (linked below) is this: an inserted key must not be a full prefix of another key that is already stored in the tree. The recommendation is to store keys that have a terminator character. armon/libart#4 armon/libart#12 armon/libart#14 armon/libart#17 Our usage in watchman can't guarantee to provide a NUL-terminated input, so this diff adjusts the key comparison routines to generate an implicit or synthetic NUL terminator character when comparing one character beyond the end of a key, and by asserting (or allowing ASAN to complain) if we try to look more than 1 character beyond the end. I also tidied up the function signatures of a couple of matching functions so that the semantics are clearer (return boolean true for success rather than integer 0) and removed some redundant casts when invoking callbacks. Test Plan: added an explicit test for the full-prefix issue. Built with -fsanitize=address and ran the integration tests as well as the sparse/unsparse checks I've been using for the pending list changes.

jamesmistry closed this as completed Mar 3, 2015

nmav mentioned this issue Oct 21, 2015

Adding a key which is a full prefix of another leads to valgrind warning #17

Open

cvermilion mentioned this issue Feb 21, 2017

Unicode strings with zero-byte values inside the key cannot be inserted kellydunn/go-art#6

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Insertion of a key, A, that is a complete subset of another key, B, prevents A from being found #12

Insertion of a key, A, that is a complete subset of another key, B, prevents A from being found #12

jamesmistry commented Mar 3, 2015

jamesmistry commented Mar 3, 2015

Insertion of a key, A, that is a complete subset of another key, B, prevents A from being found #12

Insertion of a key, A, that is a complete subset of another key, B, prevents A from being found #12

Comments

jamesmistry commented Mar 3, 2015

jamesmistry commented Mar 3, 2015