StringComparators: No need to convert to UTF-8 for lexicographic comparison. #11171

gianm · 2021-04-27T19:28:19Z

Lexicographic ordering of UTF-8 byte sequences and in-memory UTF-16
strings are equivalent. So, we can skip the (expensive) conversion and
get an equivalent result. Thank you, Unicode!

…arison. Lexicographic ordering of UTF-8 byte sequences and in-memory UTF-16 strings are equivalent. So, we can skip the (expensive) conversion and get an equivalent result. Thank you, Unicode!

clintropolis

Heh, this has been like this since topN was added 😄 50a6839#diff-10d947f7e5fa97a2ece2629950472d210574627f9f728f2e934b8a8b001fa4bbR46

StringComparators: No need to convert to UTF-8 for lexicographic comp…

b263be4

…arison. Lexicographic ordering of UTF-8 byte sequences and in-memory UTF-16 strings are equivalent. So, we can skip the (expensive) conversion and get an equivalent result. Thank you, Unicode!

gianm added Performance Area - Querying labels Apr 27, 2021

clintropolis approved these changes Apr 30, 2021

View reviewed changes

gianm merged commit 6d82c3c into apache:master Apr 30, 2021

gianm deleted the query-stringcomparators-skip-conversion branch April 30, 2021 17:54

clintropolis added this to the 0.22.0 milestone Aug 12, 2021

clintropolis mentioned this pull request Sep 3, 2021

[Draft] 0.22.0 Release Notes #11657

Closed

navis mentioned this pull request Sep 5, 2021

No need to convert to UTF-8 for lexicographic comparison metatron-app/metatron-discovery#3882

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StringComparators: No need to convert to UTF-8 for lexicographic comparison. #11171

StringComparators: No need to convert to UTF-8 for lexicographic comparison. #11171

gianm commented Apr 27, 2021

clintropolis left a comment

StringComparators: No need to convert to UTF-8 for lexicographic comparison. #11171

StringComparators: No need to convert to UTF-8 for lexicographic comparison. #11171

Conversation

gianm commented Apr 27, 2021

clintropolis left a comment

Choose a reason for hiding this comment