fix: decode string with specified buffer size #19

toyobayashi · 2023-01-06T12:21:25Z

Fixes #17

RReverser · 2023-01-06T13:33:01Z

This won't work correctly with pthreads, because TextDecoder works only on unshared memory. Note how Emscripten has to work around that with custom helper that switches to .slice(...) instead of .subarray(...) when memory is shared: https://github.com/emscripten-core/emscripten/blob/0d7014e1b20ead9739f1fce54b316152413601c3/src/runtime_strings.js#L115

RReverser · 2023-01-06T13:34:59Z

packages/emnapi/src/error.ts

+  abort('FATAL ERROR: ' +
+    (location_len === -1 ? UTF8ToString(location) : emnapiRt.utf8Decoder.decode(HEAPU8.subarray(location, location + location_len))) +
+    ' ' +
+    (message_len === -1 ? UTF8ToString(message) : emnapiRt.utf8Decoder.decode(HEAPU8.subarray(message, message + message_len)))


Maybe better to extract this into a helper emnapiRt.readString? Could be a bit cleaner than duplicating === -1 ? ... everywhere.

RReverser · 2023-01-06T13:36:38Z

packages/runtime/src/util.ts

+export const utf8Decoder: { decode: (input: BufferSource) => string } = typeof TextDecoder === 'function'
+  ? new TextDecoder()
+  : {
+      decode (input: BufferSource) {


I'd suggest to either exclude the fallback, or at least guard it under #if TEXTDECODER == 2 etc like Emscripten does.

It's quite a bit of code for something that none of the modern engines needs.

RReverser · 2023-01-06T13:37:39Z

packages/runtime/src/util.ts

+          throw new TypeError('The "input" argument must be an instance of ArrayBuffer or ArrayBufferView')
+        }
+        const bytes = isArrayBuffer ? new Uint16Array(input) : new Uint16Array(input.buffer, input.byteOffset, input.byteLength / 2)
+        return String.fromCharCode.apply(String, bytes as any)


This is often tempting, but will fail with stack overflow on any large strings (probably worth adding a test for that). It's important to use a loop here instead.

toyobayashi · 2023-01-06T13:53:31Z

Thanks. working in progress

toyobayashi · 2023-01-06T14:56:53Z

Done. c8f1175

RReverser · 2023-01-06T15:45:34Z

Looks good. I think it would be still worth adding regression tests for 2 mentioned cases - one to check that strings work when compiled with -pthread, and one for large strings.

toyobayashi added 2 commits January 6, 2023 20:20

fix: decode string with specified buffer size

b7e758d

fix: string decode length

dd5146d

toyobayashi merged commit 458121b into main Jan 6, 2023

RReverser reviewed Jan 6, 2023

View reviewed changes

toyobayashi deleted the fix-string-decoding branch April 6, 2023 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: decode string with specified buffer size #19

fix: decode string with specified buffer size #19

toyobayashi commented Jan 6, 2023

RReverser commented Jan 6, 2023

RReverser Jan 6, 2023

RReverser Jan 6, 2023

RReverser Jan 6, 2023 •

edited

Loading

toyobayashi commented Jan 6, 2023

toyobayashi commented Jan 6, 2023

RReverser commented Jan 6, 2023

fix: decode string with specified buffer size #19

fix: decode string with specified buffer size #19

Conversation

toyobayashi commented Jan 6, 2023

RReverser commented Jan 6, 2023

RReverser Jan 6, 2023

Choose a reason for hiding this comment

RReverser Jan 6, 2023

Choose a reason for hiding this comment

RReverser Jan 6, 2023 • edited Loading

Choose a reason for hiding this comment

toyobayashi commented Jan 6, 2023

toyobayashi commented Jan 6, 2023

RReverser commented Jan 6, 2023

RReverser Jan 6, 2023 •

edited

Loading