feat(node): add string_decoder module #11095

sgtcoolguy · 2019-07-31T13:50:37Z

JIRA: https://jira.appcelerator.org/browse/TIMOB-27286

Description:
Follow on to #11084 which adds Buffer

This adds the string_decoder module, which is sort of a safer way to interact with Buffers when dealing with character data.

Basically a string decoder will temporarily cache/buffer up incomplete multi-byte characters until they're completed before outputting them. This is handy if you expect multi-byte utf8/utf16/base64 data and want to spit it out as you go.

Caveats
V8 and Node handle invalid UTF-8 multi-byte characters by replacing invalid/incomplete multi-byte sequences with '\ufffd'. This implementation does not - and on a platform like iOS can result in undefined being returned. I've left comments about this in here for now - but the "fix" here is 2-fold:

Have iOS/Windows explicitly handle UTF-8 by injecting the replacement characters (there's a link to a gist that shows this for iOS, but it's a fair amount of code) - Note that this would fix our Ti.Buffer handling of invalid data as well.
Have this module do the checking/replacement. Given that the data lives across the JS/native boundary this could be performance-heavy. If our Buffer module got changed to use Uint8Array in the majority of cases, this may not be such an issue - but being backed by a Ti.Buffer and traversing the bridge for every byte means it would be pretty expensive to iterate through to find and validate every multi-byte char sequence.

The only "invalid" case we do handle is when you call decoder.end() and it has a buffered incomplete multi-byte character.

sgtcoolguy · 2019-07-31T13:51:22Z

⚠️ Note that this should be held until #11084 is merged (and then this should be rebased)

build · 2019-07-31T15:56:23Z

common/Resources/ti.internal/extensions/node/string_decoder.js

+	 * - charLength: expected number of bytes for the incomplete character
+	 * - index: index in the buffer where the incomplete character begins
+	 * @param {Buffer} _buffer Buffer we are checking to see if it has an incompelte "character" at the end
+	 * @returns {IncompleteCharObject}


⚠️ common/Resources/ti.internal/extensions/node/string_decoder.js line 104 – Unexpected @returns tag; function has no return statement. (valid-jsdoc)

build · 2019-07-31T15:56:25Z

	Messages
📖	💾 Here's the generated SDK zipfile.
📖	✅ All tests are passing Nice one! All 4400 tests are passing. (There are 474 tests skipped)
📖	✊ The commits in this PR match our conventions! Feel free to Rebase and Merge this PR when ready.

Generated by 🚫 dangerJS against b23069a

Fixes TIMOB-27286

janvennemann

Looks good! Just added two minor notes in the test suite but nothing important.

I guess we have to see if and how the caveats have any impact on third-party modules that use string_decoder. Regarding a fix for this i'd vote for the actual fix in Ti.Buffer on iOS to get the correct behavior natively, assuming that we don't make the switch to UIInt8Array for our buffers anytime soon.

janvennemann · 2019-09-10T15:29:53Z

tests/Resources/string_decoder.addontest.js

+		});
+	});
+
+	// FIXME: I'm using ES6 classes which doesn't allow non-new constructor!


Is this still an issue? Looks like you are just delegating to the ES6 classes so it should work fine?

janvennemann · 2019-09-10T15:48:11Z

tests/Resources/string_decoder.addontest.js

+			should(decoder.write(Buffer.from([]))).eql('');
+			should(decoder.end()).eql('');
+			// now writing an empty buffer doesn't append anything!
+			// testEnd('base64', Buffer.of(0x61), Buffer.of(), 'YQ==');


Are these a placeholder for some missing tests?

sgtcoolguy · 2019-09-17T13:53:44Z

Merged manually to master.

sgtcoolguy added ios android feature windows labels Jul 31, 2019

build added this to the 8.2.0 milestone Jul 31, 2019

build reviewed Jul 31, 2019

View reviewed changes

feat(node): add string_decoder module

daf7826

Fixes TIMOB-27286

sgtcoolguy force-pushed the node-string-decoder branch from 7571c7b to daf7826 Compare August 6, 2019 15:34

sgtcoolguy requested a review from janvennemann August 6, 2019 17:00

sgtcoolguy added 3 commits August 29, 2019 16:03

Merge branch 'master' into node-string-decoder

eeb98c7

Merge branch 'master' into node-string-decoder

4278af7

Merge branch 'master' into node-string-decoder

b23069a

sgtcoolguy modified the milestones: 8.2.0, 8.3.0 Sep 5, 2019

janvennemann approved these changes Sep 10, 2019

View reviewed changes

sgtcoolguy closed this Sep 17, 2019

sgtcoolguy deleted the node-string-decoder branch September 17, 2019 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(node): add string_decoder module #11095

feat(node): add string_decoder module #11095

sgtcoolguy commented Jul 31, 2019

sgtcoolguy commented Jul 31, 2019

build Jul 31, 2019

build commented Jul 31, 2019 •

edited

janvennemann left a comment

janvennemann Sep 10, 2019

janvennemann Sep 10, 2019

sgtcoolguy commented Sep 17, 2019

feat(node): add string_decoder module #11095

feat(node): add string_decoder module #11095

Conversation

sgtcoolguy commented Jul 31, 2019

sgtcoolguy commented Jul 31, 2019

build Jul 31, 2019

Choose a reason for hiding this comment

build commented Jul 31, 2019 • edited

janvennemann left a comment

Choose a reason for hiding this comment

janvennemann Sep 10, 2019

Choose a reason for hiding this comment

janvennemann Sep 10, 2019

Choose a reason for hiding this comment

sgtcoolguy commented Sep 17, 2019

build commented Jul 31, 2019 •

edited