Rdatatable / data.table Public
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG REPORT] Query fails if the key contains Chinese Character on Windows7 #2462
Comments
This bug is due to this two lines ( It uses the I will try to file a PR later. Hopefully, it can help to settle this issue down (finally...). UPDATE The original understanding is not accurate, so I modified the comment. |
DESCRIPTION
Thanks for this great package. It helps us a lot. Currently, I'm working on a windows platform. If my data.table's key contains Chinese characters, the query will return wrong answer sometimes. Below is my minimal example. I can only get the right answer when the Chinese character key column gets converted to UTF-8 encoding.
@renkun-ken It would be great if you can confirm this bug for me because you are one of the data.table users who probably has the similar working environment - Windows + Chinese characters. I believe this bug actually is introduced by 03cd45f and nobody else reports this for almost two years, which is quite odd to me.
Minimal reproducible example
Dataset
Will fail (returns NA) if the encoding is native
Will succeed only if the encoding is converted to utf8
Now it returns the correct answer
1
.Note the dt's order now also becomes different, which is not supposed to happen.
sessionInfo
UPDATE Using the devel version to run the example.
The text was updated successfully, but these errors were encountered: