Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some unicode hashtags federate badly #25451

Open
timorl opened this issue Jun 15, 2023 · 0 comments
Open

Some unicode hashtags federate badly #25451

timorl opened this issue Jun 15, 2023 · 0 comments
Labels
bug Something isn't working

Comments

@timorl
Copy link

timorl commented Jun 15, 2023

Steps to reproduce the problem

  1. Toot on a Mastodon instance with a unicode hashtag like #ć.
  2. Get the post at an Akkoma instance.
  3. Look at the hashtag #ć on that instance.

Expected behaviour

The post appears there

Actual behaviour

The post is not there, but it is under #c

Detailed description

Hashtags get "normalized" to ASCII(?) in some instances. Then during federation only the "normalized" variant gets sent, instead of the actual hashtag. See https://akkoma.dev/AkkomaGang/akkoma/issues/568 for some more technical details.

It would be nice to not perform this "normalization" at all, since 'ć' and 'c' are barely connected distinct letters, but failing that at least federating the actual hashtag would be preferable.

Specifications

I was on the Akkoma end of this interaction, but there were a couple Mastodon instances involved, and all had the problem.

@timorl timorl added the bug Something isn't working label Jun 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant