Pile-of-poo test for `text-to-unicode` #1081

lionel-rowe · 2024-05-14T06:39:49Z

Describe the bug

The text-to-unicode tool fails the Pile of Poo Test:

Whenever you’re working on a piece of JavaScript code that deals with strings or regular expressions in some way, just add a unit test that contains a pile of poo (💩) in a string, and see if anything breaks.

In other words, it fails to correctly handle any non-BMP code point (code points higher than 0xffff).

Also, decimal-encoded HTML/XML entities seems like an odd default choice to represent "Unicode". I'd expect the default to be \u{...} or maybe U+... notation, with ... being hex digits. But offering HTML/XML entities as an alternative could be useful too.

What happened?

💩 results for https://it-tools.tech/text-to-unicode:

document.write('&#55357;&#56489;') renders as ��, not 💩.

System information

Win 11, Chrome Version 124.0.6367.158 (Official Build) (64-bit)

Where did you encounter the bug?

Public app (it-tools.tech)

The text was updated successfully, but these errors were encountered:

lionel-rowe added bug Something isn't working triage labels May 14, 2024

lionel-rowe assigned CorentinTh May 14, 2024

lionel-rowe linked a pull request May 14, 2024 that will close this issue

fix(text-to-unicode): handle non-BMP + more conversion options #1087

Open

sunnydanu mentioned this issue Oct 26, 2024

fix(text-to-unicode):-handle-non-bmp-+-more-conversion-options sunnydanu/godev.run#136

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pile-of-poo test for `text-to-unicode` #1081

Pile-of-poo test for `text-to-unicode` #1081

lionel-rowe commented May 14, 2024

Pile-of-poo test for text-to-unicode #1081

Pile-of-poo test for text-to-unicode #1081

Comments

lionel-rowe commented May 14, 2024

Describe the bug

What happened?

System information

Where did you encounter the bug?

Pile-of-poo test for `text-to-unicode` #1081

Pile-of-poo test for `text-to-unicode` #1081