Releases · Goutte/godot-addon-unicode-normalizer

UnicodeNormalizer

This singleton helps normalize your unicode strings by:

removing diacritics (decomposing, then keeping only the first character) — "é" → "e"
substituting fallback characters — "Æ" → "AE"
being blazingly fast (binary search)

NormalizationMapping

This Resource is our database of replacements, used by the UnicodeNormalizer.
It is built from the official unicode.org data.

It is only about 16Kio, and is derived from 1.9Mio of raw data.

Basic Usage

You can use the normalize method on the autoload singleton UnicodeNormalizer:

UnicodeNormalizer.normalize("Dès Noël, où un zéphyr haï me vêt")
# "Des Noel, ou un zephyr hai me vet"

Advanced Usage

The UnicodeNormalizer is made to be extended, to be tailored to your font capabilities and needs.

Here, the font supports some french diacritics, but only uppercase characters:

# file "MyFontNormalizer.gd"
extends UnicodeNormalizerClass

var characters_in_my_font := "ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789ÉÈÊËÀÂÄÔÖÙÛÜÇ"

func should_skip_character(character: String, _character_code: int) -> bool:
	return self.characters_in_my_font.contains(character)  # inefficient

func normalize(some_string: String) -> String:
	return super.normalize(some_string.to_upper())

This is a naive/inefficient implementation to keep the example short and simple.
A more performant implementation would use binary search on a sorted array.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rejection !

Breaking

Features

Bug Fixes

UnicodeNormalizer

NormalizationMapping

Basic Usage

Advanced Usage

Releases: Goutte/godot-addon-unicode-normalizer

Last Minute Changes

Rejection !

Breaking

Easier Extension of Replacements

Features

Bug Fixes

Initial Release

UnicodeNormalizer

NormalizationMapping

Basic Usage

Advanced Usage