Byte-size string handler (for Python)

Handle Python strings according to their size in bytes

Encoded strings may need to fit a specific size in bytes to be accepted by some libraries or APIs. Let's take UTF-8 chars, for instance, they may need from 1 to 4 bytes (please refer to Wikipedia for details):

the first 128 characters (US-ASCII) need one byte;
the next 1,920 characters need two bytes to encode, which covers the remainder of almost all Latin-script alphabets, and also Greek, Cyrillic, Coptic, Armenian, Hebrew, Arabic, Syriac, Thaana and N'Ko alphabets, as well as Combining Diacritical Marks;
three bytes are needed for characters in the rest of the Basic Multilingual Plane, which contains virtually all characters in common use, including most Chinese, Japanese and Korean characters;
four bytes are needed for characters in the other planes of Unicode, which include less common CJK characters, various historic scripts, mathematical symbols, and emoji (pictographic symbols).

The `byte_size_string_handler` module

truncate_utf8(): given a string and maximum size, the function checks string's UTF-8 byte-size and truncates if needed. Implementation is based on StackOverflow question and answers.

How to contribute

Please make sure to take a moment and read the Code of Conduct.

Report issues

Please report bugs and suggest features via the GitHub Issues.

Before opening an issue, search the tracker for possible duplicates. If you find a duplicate, please add a comment saying that you encountered the problem as well.

Contribute code

Please make sure to read the Contributing Guide before making a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.circleci		.circleci
.github		.github
src		src
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Byte-size string handler (for Python)

Handle Python strings according to their size in bytes

The `byte_size_string_handler` module

How to contribute

Report issues

Contribute code

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ricardolsmendes/byte-size-string-handler

Folders and files

Latest commit

History

Repository files navigation

Byte-size string handler (for Python)

Handle Python strings according to their size in bytes

The byte_size_string_handler module

How to contribute

Report issues

Contribute code

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

The `byte_size_string_handler` module

Packages