Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.Sign up
bytes, strings: Title does not treat Unicode punctuation as separators #34994
What version of Go are you using (
It looks like Unicode standard defines what is a word boundary (http://unicode.org/reports/tr29/#Word_Boundaries), but it's not something that could be incorporated into `strings.Title` (guessing from the comments in the linked issue). Title already treats ASCII punctuation as word boundaries (except underscore), but it doesn’t do that regarding Unicode punctuation. It looks to me as if it was supposed to be changed to also support latter, but for some reason no one ever did that. If the reason is that the output of Title mustn’t be changed now, should the documentation describing these parts of the behavior still be a BUG notice?…
Wiadomość napisana przez Ian Lance Taylor ***@***.***> w dniu 19.10.2019, o godz. 22:08: It's not obvious to me that we can change this now. If we do change it, does Unicode define the set of characters that break words? Is that locale dependent? CC @mpvl — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.