Force UTF-8 encoding on incoming events #16

MusikAnimal · 2018-12-27T17:16:38Z

If the "chunk" contains characters that aren't UTF-8, the library will fail with "invalid byte sequence in UTF-8 (ArgumentError)". This patch fixes this and ensures we work only with UTF-8.

francois2metz · 2018-12-27T17:21:16Z

Hi. Thanks for the contribution.

i'm fine with forcing UTF-8 (it's in the spec), but why encoding it as UTF-16be first?

MusikAnimal · 2018-12-27T19:37:36Z

Apparently with older Rubies (pre 2.1), encode is a no-op if the string has the same encoding, so you have to encode into a different encoding then back to utf-8. Ruby pre-2.1 is really old, so maybe we don't care? This is deserving of a comment, at least.

The more modern solution I think would use scrub (introduced with 2.1), as with chunk.scrub('?').

In my applications I use str.force_encoding('utf-8') but that didn't seem to work here, for whatever reason.

francois2metz · 2018-12-28T13:44:01Z

Ruby 2.1 is no longer maintained, so I'm ok to drop support of it.

Could you add a testcase please?

Then I'll check what browsers do in this case.

MusikAnimal · 2018-12-28T17:58:35Z

Sure thing, though I'm not sure how to run the tests. Is this standard rspec? With rspec spec/ I get the errors undefined method `assert_equal' for nil:NilClass

I may not get back to this until the new year, FYI. Thanks for the help!

MusikAnimal · 2018-12-28T18:06:32Z

Never mind, I got the tests to run, and they pass :) I'll add a test case when I return January 1.

If the "chunk" contains characters that aren't UTF-8, the library will fail with "invalid byte sequence in UTF-8 (ArgumentError)". This patch fixes this and ensures we work only with UTF-8.

MusikAnimal · 2019-01-04T23:58:17Z

@francois2metz Not sure if you were waiting on a ping. I added a test case :) No rush to merge obviously, just letting you know.

francois2metz · 2019-01-05T13:34:45Z

Hi. Thanks for letting me know. I'll take a look later.

francois2metz · 2019-01-11T15:00:21Z

Thanks for the patch! 💛 💜 💙 💚

francois2metz · 2019-01-11T15:09:14Z

I released the version 0.3.1 with the fix.

MusikAnimal force-pushed the patch-1 branch from 15616c0 to 15c52df Compare December 28, 2018 17:52

Force UTF-8 encoding on incoming events

5151e1f

If the "chunk" contains characters that aren't UTF-8, the library will fail with "invalid byte sequence in UTF-8 (ArgumentError)". This patch fixes this and ensures we work only with UTF-8.

MusikAnimal force-pushed the patch-1 branch from 15c52df to 5151e1f Compare December 30, 2018 22:05

francois2metz merged commit 5151e1f into francois2metz:master Jan 11, 2019

MusikAnimal deleted the patch-1 branch January 11, 2019 17:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force UTF-8 encoding on incoming events #16

Force UTF-8 encoding on incoming events #16

MusikAnimal commented Dec 27, 2018

francois2metz commented Dec 27, 2018

MusikAnimal commented Dec 27, 2018 •

edited

francois2metz commented Dec 28, 2018

MusikAnimal commented Dec 28, 2018

MusikAnimal commented Dec 28, 2018

MusikAnimal commented Jan 4, 2019

francois2metz commented Jan 5, 2019

francois2metz commented Jan 11, 2019

francois2metz commented Jan 11, 2019

Force UTF-8 encoding on incoming events #16

Force UTF-8 encoding on incoming events #16

Conversation

MusikAnimal commented Dec 27, 2018

francois2metz commented Dec 27, 2018

MusikAnimal commented Dec 27, 2018 • edited

francois2metz commented Dec 28, 2018

MusikAnimal commented Dec 28, 2018

MusikAnimal commented Dec 28, 2018

MusikAnimal commented Jan 4, 2019

francois2metz commented Jan 5, 2019

francois2metz commented Jan 11, 2019

francois2metz commented Jan 11, 2019

MusikAnimal commented Dec 27, 2018 •

edited