-
-
Notifications
You must be signed in to change notification settings - Fork 283
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Force answer content to be always UTF-8 encoded #11
Comments
You might want to check this: http://github.com/axic/whois/commit/955d5157c3b92679e62cca57d469713dedcbe5d1 It implements this feature. |
I checked the commit, but it doesn't really close this issue. Instead, it only provides a limited solution for two specific TLDs. Also, this feature definitely need an extensive test suite. |
If you want to work on this feature, I suggest you to move to a dedicated branch. Thanks for your contribution. |
I've stopped getting encoding errors in my app after I passed whois answer through |
Very interesting method. Unfortunately, it's not that simple. There's a very wide range of possible encodings (so far, I counted more than 10) and there are cases where a multipart whois record is returned with several different encodings. I would love to find a solution that doesn't rely on third party Gems. Probably, it will be compatible with Ruby 1.9 only because Ruby 1.8 doesn't have encoding support. |
Create whois.fastdomain.com parser
Hi @weppos. I was hoping to revive this thread and ask what strategy you think reasonable for properly handling of the various character encodings received from Whois servers around the Internet? In particular I was wondering about providing a list of hints for character encodings returned by well-known Whois servers (i.e. those in lib/definitions), or if you had other thoughts? |
I tried this approach in the past. Unfortunately, it's not very effective. Maintaining that list is such a pain in the ***, because it can be very long and changes might not be immediately applied. Also, a few registries are able to return a response in more than one single encoding (this is insane, I know). The only solution is to guess the encoding at runtime. |
Closing as this is a very old topic and there is no viable solution the client can implement at the time. |
Internally WHOIS should always prefer UTF-8 encoding regardless server encoding.
The text was updated successfully, but these errors were encountered: