Unicode turns into bytes, which turn into entities #1

Open
benkasminbullock opened this Issue Jun 11, 2011 · 7 comments

Comments

Projects
None yet
2 participants

If I enter Unicode text like すごい on cpanratings.perl.org, the Unicode is read as individual bytes then processed into HTML entities per-byte, rendering it unreadable.

See e.g.

http://cpanratings.perl.org/dist/Poem

This contains the Unicode codes 0x3059, 0x3054, 0x3044 translated into UTF-8 then with each utf8 byte turned into entities.

Contributor

abh commented Jun 11, 2011

Thanks – I noticed that, too. I'll see if I can fix it over the weekend. Now sleep. :-)

@abh abh closed this in bc0e4a7 Jun 30, 2011

Contributor

abh commented Jun 30, 2011

Ben - this should be all fixed now; please let me know if that's not the case! :-)

Yes it seems OK now and you have even recovered the old text from the
previous reviews.

On 30 June 2011 15:39, abh
reply@reply.github.com
wrote:

Ben - this should be all fixed now; please let me know if that's not the case! :-)

Reply to this email directly or view it on GitHub:
#1 (comment)

But there is a really strange bug appearing now when I try to edit the
text of the review I get the previous review's text not the one which
is entered.

On 30 June 2011 17:01, Ben Bullock benkasminbullock@gmail.com wrote:

Yes it seems OK now and you have even recovered the old text from the
previous reviews.

On 30 June 2011 15:39, abh
reply@reply.github.com
wrote:

Ben - this should be all fixed now; please let me know if that's not the case! :-)

Reply to this email directly or view it on GitHub:
#1 (comment)

Contributor

abh commented Jun 30, 2011

You mean after you submit the new one? The pages are cached, so wait 10 or 30 minutes (I forget which) and they should update. (or tweak the page URL by adding ?x or some such to force a cache miss).

http://localrobot.com/

On Jun 30, 2011, at 1:12, benkasminbullockreply@reply.github.com wrote:

But there is a really strange bug appearing now when I try to edit the
text of the review I get the previous review's text not the one which
is entered.

On 30 June 2011 17:01, Ben Bullock benkasminbullock@gmail.com wrote:

Yes it seems OK now and you have even recovered the old text from the
previous reviews.

On 30 June 2011 15:39, abh
reply@reply.github.com
wrote:

Ben - this should be all fixed now; please let me know if that's not the case! :-)

Reply to this email directly or view it on GitHub:
#1 (comment)

Reply to this email directly or view it on GitHub:
#1 (comment)

@abh abh reopened this Aug 1, 2011

Contributor

abh commented Aug 1, 2011

Grrh - I'm afraid I managed to break it again for old content (but it's working for new content). :-( Ben, can you verify that it's working ok for new reviews? Then I'll try to munge the database to restore the utf-8 data in old reviews.

It's working for newly-edited reviews (see top page).

abh pushed a commit that referenced this issue Apr 23, 2014

Merge pull request #1 from book/patch-1
Fix capitalization of my nickname
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment