Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for UTF-8 emoji #46

Closed
wants to merge 1 commit into from
Closed

Add support for UTF-8 emoji #46

wants to merge 1 commit into from

Conversation

dfoody
Copy link
Contributor

@dfoody dfoody commented Mar 17, 2018

The original UTF-8 state tables where built before unicode emojis were introduced.
This change updates the state table to handle the unicode emojis for UTF-8.

For example, to recognize the UTF-8 spouting whale emoji:
jschardet.detect(new Buffer("f09f90b3", "hex"));

For example, to recognize the UTF-8 spouting whale emoji:
jschardet.detect(new Buffer("f09f90b3", "hex"));
@aadsm
Copy link
Owner

aadsm commented Jan 22, 2019

@dfoody sorry it took ages but I finally merged this and it will be available in v2.1.0.

@aadsm aadsm closed this Jan 22, 2019
yinyue200 added a commit to yinyue200/UTF-unknown that referenced this pull request Feb 19, 2021
304NotModified pushed a commit to CharsetDetector/UTF-unknown that referenced this pull request Mar 31, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants