-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unicode characters can sometimes not be read in the form they are emitted #1802
Comments
It seems, all these characters make sense within a quoted context. I mean, 1/4 has always been a regular printable character in Latin1. Hexadecimal sequences should be reserved for the truly ambiguous cases, like the nbsp. |
I now found out that applying #1799 reduces the list to 17 remaining cases, I have updated the list in the original post! |
triska
added a commit
to triska/scryer-prolog
that referenced
this issue
May 14, 2023
This addresses mthom#1802.
Resolved via #1805. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Let's ask Scryer Prolog whether there are any other cases like #1768, where the system itself emits a term that cannot be read back with
read/1
:With #1799 and #1800 already applied, I get the following 17 remaining cases:
For instance, taking the first example:
The overall impression I get from this is that the situation is very hopeful: If these remaining cases cannot be meaningfully checked for with one of the available character categorizations, then we can simply add these rather few cases as special cases that need to be emitted as hexadecimal escape sequences.
Low priority. For syntactic ISO conformance, #1771, #1773, #1778 etc. are far more important.
The text was updated successfully, but these errors were encountered: