Skip to content
This repository has been archived by the owner. It is now read-only.

Add "·" MIDDLEDOT (U+00B7) support #63

Closed
jmontane opened this issue Dec 19, 2013 · 5 comments
Closed

Add "·" MIDDLEDOT (U+00B7) support #63

jmontane opened this issue Dec 19, 2013 · 5 comments

Comments

@jmontane
Copy link

@jmontane jmontane commented Dec 19, 2013

Hi,

MDIDDLEDOT (U+00B7) is very used as inner-word punctuation in Catalan, a mandatory diacritical char in Catalan ortography rules. Currently Twitter doesn't allow to use "·" in several places, so I request to improve its support in Twitter.

I requested it in Twitter support forum, without feedback. So, I request it here. If that's not the place, please, report it to L10N Twitter team.

For instance:

1.- It's unable to make hashtags like #il·lusió
2.- It's unable to set valid URLs like "http://www.l·l.cat" in user's profile
3.- It's unable to create or name list like "al·lucinant"

About 1 and 3
You can do a workaround using a legacy compatible characters ŀ (U+0140) / Ŀ (U+013F). According to Unicode, it's preferred to use their decomposition: l+· and L+·. So, the weird effect is that you can use ĿL in hashtags (#iŀlusió works fine), but not the preferred Unicode encoding L·L (#il·lusió fails).

About 2
MIDDLEDOT (U+00B7) is a valid char (between 2 Ls) in .CAT and .ES TLDs, and its allowed by RFC592

So, please, improve U+00B7 support in Twitter.

Thanks in advance.

Related links
https://dev.twitter.com/discussions/17723
http://www.unicode.org/reports/tr29/#Word_Boundaries
http://www.unicode.org/reports/tr31/
http://tools.ietf.org/search/rfc5892
http://www.python.org/dev/peps/pep-3131/ (search for U+00B7)

@jakl
Copy link
Contributor

@jakl jakl commented Dec 20, 2013

Thanks so much for your comprehensive description of the issue. It's recently become top priority for our international team to better support this and other unicode characters in tweet entities. I'll look into it shortly after the holidays.

@jmontane
Copy link
Author

@jmontane jmontane commented Mar 25, 2014

Is there any advance in this issue?

I hope i18n teams is working in it.

@jmontane
Copy link
Author

@jmontane jmontane commented May 21, 2014

I found a new bug related with U+00B7 and Twitter. Please, see this Tweet https://twitter.com/unjoanqualsevol/status/469148413486194688 There are 2 valid and registered URLs

@jmontane
Copy link
Author

@jmontane jmontane commented Oct 16, 2014

Hi,

Current Unicode UAX 31 cites 00B7 and its use in hashtags
http://www.unicode.org/reports/tr31/#Specific_Character_Adjustments

Is there any improvement or roadmap about this issue?

Regards,

@jmontane jmontane closed this Dec 11, 2014
@jmontane
Copy link
Author

@jmontane jmontane commented Dec 11, 2014

Closed and copied to new active repo

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants