Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Symbols need to support UTF-8 names #2172

Closed
cheald opened this issue Nov 11, 2014 · 2 comments
Closed

Symbols need to support UTF-8 names #2172

cheald opened this issue Nov 11, 2014 · 2 comments

Comments

@cheald
Copy link
Contributor

@cheald cheald commented Nov 11, 2014

MRI currently supports UTF-8 symbols:

x = :"\u{3042}" # => :あ

However, JRuby always treats symbols as US-ASCII, resulting in things like encoding errors when trying to inspect one:

> x = :"\u{3042}"
ArgumentError: invalid byte sequence in US-ASCII
from org/jruby/RubySymbol.java:221:in `inspect'

This is a source of a number of errors in the new stdlib test suite.

@enebo
Copy link
Member

@enebo enebo commented Nov 11, 2014

For some reason I think we have an issue for this but I don't see it offhand :)

@enebo enebo added the parser label Nov 11, 2014
@enebo enebo added this to the JRuby 1.7.18 milestone Nov 11, 2014
enebo added a commit that referenced this issue Dec 10, 2014
headius added a commit that referenced this issue Dec 10, 2014
@enebo
Copy link
Member

@enebo enebo commented Dec 10, 2014

This is fixed on 1.7 now but we also need equivalent changes on master. We might try and more elegantly do this on master. Leaving open for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants