PacketEncoder corrupts UTF-8 message #157

evgeny-pasynkov · 2014-09-15T13:58:28Z

Hi,

Using SocketIOClient.send("русский"), I receive on client string "Ñ�Ñ�Ñ�Ñ�ÐºÐ¸Ð¹"

I think that the problem is in PacketEncoder lines 272-274:

                            String str = b.toString(CharsetUtil.ISO_8859_1);
                            if (enc.canEncode(str)) {
                                buf.writeBytes(str.getBytes(CharsetUtil.UTF_8));
                            }

the string gets corrupted when converting it with ISO_8859_1 charset, and then it is sent corrupted

The text was updated successfully, but these errors were encountered:

Maypeur · 2014-09-17T15:32:45Z

Hello,
I really don't know if it's a good solution but i used that to make it work in my case :

CharsetEncoder enc = CharsetUtil.UTF_8.newEncoder();
                            String str = b.toString(CharsetUtil.UTF_8);
                            if (enc.canEncode(str)) {
                                buf.writeBytes(str.getBytes(CharsetUtil.UTF_8));
                            } else {
                                buf.writeBytes(b);
                            }

mrniko · 2014-09-17T15:35:08Z

@evgeny-pasynkov hi!
Do use UTF-8 encoding for your sources?

evgeny-pasynkov · 2014-09-17T16:02:25Z

@mrniko What do you mean by "UTF-8 for sources"?

Is it default encoding for JVM process? If yes, then it isn't a good solution - I don't want my software to depend on server locale :)

Actually, converting my string to ISO_8859_1 damages it, so client cannot restore it further.

evgeny-pasynkov · 2014-09-17T16:03:50Z

@Maypeur your solution is tautology :) It is simply equivalent to "buf.writeBytes(b)"

Maypeur · 2014-09-17T16:10:04Z

@evgeny-pasynkov maybe ! I let it because there was a problem with websocket and accent !

mrniko · 2014-09-17T16:11:51Z

@evgeny-pasynkov could you purpose a better solution for this problem?

evgeny-pasynkov · 2014-09-17T16:14:15Z

@mrniko Could you point please to the mentioned websockets bug? Why not to simplify all this stuff to "buf.writeBytes(b)"?

mrniko · 2014-09-18T08:02:10Z

to avoid encoding problem

evgeny-pasynkov · 2014-09-18T08:06:51Z

What problems? In the code comment, you've mentioned the websockets bug. Which one?

BTW, socket.io had some UTF-8 encoding problems, they claimed to be fixed in socket.io 1.1.0
Check this topic: socketio/socket.io#1744

Maypeur · 2014-09-18T08:13:55Z

@evgeny-pasynkov it is for #137 , using new socket.io.client 1.1.0 and socket.io 1.7.3-SNAPSHOT make problem on UTF-8 characters, but with client 1.0.6 and 1.7.3-SNAPSHOT it's working, so now the bug is corrected in client maybe PacketEncoder need to be adapted. Only @mrniko can say what to do now !

mrniko · 2014-09-18T08:17:04Z

oh! i think to release next 1.7.4 version with this fix so it will be 1.1.0+ compatible only, ok?

evgeny-pasynkov · 2014-09-18T08:19:08Z

For me it is ok.

Maypeur · 2014-09-18T08:27:02Z

Since pre 1.1.0 versions have this major bug i think it must !

mrniko · 2014-09-24T06:42:23Z

please check

evgeny-pasynkov · 2014-09-24T09:25:32Z

The problem is fixed for my scenarios. Thank you!

mrniko · 2014-09-25T13:37:15Z

@Maypeur did you happy too?

Maypeur · 2014-09-25T13:42:18Z

UTF-8 is now correct, but i'm searching why sometimes the memory gow up and never decrease even if i got only 5 users !
Anyway thank you !!!

mrniko · 2014-09-26T05:26:28Z

@Maypeur take a look at memory dump

mrniko added the bug label Sep 19, 2014

mrniko added this to the 1.7.4 milestone Sep 23, 2014

mrniko self-assigned this Sep 23, 2014

mrniko pushed a commit that referenced this issue Sep 24, 2014

Encoding fixed. #157

3677978

mrniko closed this as completed Sep 24, 2014

karhuan mentioned this issue Oct 24, 2014

Outgoing UTF-8 messages are not encoded properly #174

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PacketEncoder corrupts UTF-8 message #157

PacketEncoder corrupts UTF-8 message #157

evgeny-pasynkov commented Sep 15, 2014

Maypeur commented Sep 17, 2014

mrniko commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

Maypeur commented Sep 17, 2014

mrniko commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

mrniko commented Sep 18, 2014

evgeny-pasynkov commented Sep 18, 2014

Maypeur commented Sep 18, 2014

mrniko commented Sep 18, 2014

evgeny-pasynkov commented Sep 18, 2014

Maypeur commented Sep 18, 2014

mrniko commented Sep 24, 2014

evgeny-pasynkov commented Sep 24, 2014

mrniko commented Sep 25, 2014

Maypeur commented Sep 25, 2014

mrniko commented Sep 26, 2014

PacketEncoder corrupts UTF-8 message #157

PacketEncoder corrupts UTF-8 message #157

Comments

evgeny-pasynkov commented Sep 15, 2014

Maypeur commented Sep 17, 2014

mrniko commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

Maypeur commented Sep 17, 2014

mrniko commented Sep 17, 2014

evgeny-pasynkov commented Sep 17, 2014

mrniko commented Sep 18, 2014

evgeny-pasynkov commented Sep 18, 2014

Maypeur commented Sep 18, 2014

mrniko commented Sep 18, 2014

evgeny-pasynkov commented Sep 18, 2014

Maypeur commented Sep 18, 2014

mrniko commented Sep 24, 2014

evgeny-pasynkov commented Sep 24, 2014

mrniko commented Sep 25, 2014

Maypeur commented Sep 25, 2014

mrniko commented Sep 26, 2014