Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify the behavior of Cyrillic characters #16

Open
andkov opened this issue Aug 19, 2022 · 1 comment
Open

Clarify the behavior of Cyrillic characters #16

andkov opened this issue Aug 19, 2022 · 1 comment
Assignees
Labels
help wanted Extra attention is needed

Comments

@andkov
Copy link
Contributor

andkov commented Aug 19, 2022

When performing string manipulations on Ukrainian characters, it sometimes does not recognize and/or transforms the matched characters:

Sys.setlocale("LC_CTYPE", "russian")
Sys.setlocale("LC_CTYPE", "ukr")
d <- tibble::tribble(
  ~a , ~ b,
  "громада", "область"
)
d %>% mutate(a = str_remove(a,"гр"))

image

Please investigate this behavior and report possible solutions

@andkov andkov added the help wanted Extra attention is needed label Aug 19, 2022
@Tytser
Copy link
Contributor

Tytser commented Aug 27, 2022

Did you try to perform this operation without setting the locale to ukr/rus? Because I get the right result despite errors for both Sys.setlocale commands
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants