Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider adding a Unicode normalization storage transformer #201

Open
jstriebel opened this issue Feb 2, 2023 · 0 comments
Open

Consider adding a Unicode normalization storage transformer #201

jstriebel opened this issue Feb 2, 2023 · 0 comments

Comments

@jstriebel
Copy link
Member

jstriebel commented Feb 2, 2023

Since unicode characters are allowed for node names unicode normalization is a useful feature, which is also recommended for users. It would be great to add a group storage transformer which normalizes all node-names of sub-groups and sub-arrays, so that only normalized names are stored and they can be retrieved by using any string that normalizes to the same form.

As recommended by the Unicode Standard Annex # 31 we also recommend to use the normalization form KC (NFKC) in the spec, which should also be used by this transformer (at least by default, there may be reasons to allow customizing this). NFC might also be used to fit filsystem storage normalization, see #56 (comment).

The main discussion around this topic so far happened in issue #56.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant