Improve text encoding API #564

dcodeIO · 2019-03-28T18:02:50Z

This PR reworks the text encoding API on top of the runtime branch so a user can specify how to handle null termination.

Also splits the API into safe and unsafe parts. Note that where an ArrayBuffer is returned, an unsafe user can simply changetype<usize>(theBuffer) due to new headerless buffer (and string) layout and pass it to a C-API. The alternative is to return usize from the ___Raw functions, which can't reuse the new runtime API for allocating/reallocation as well as just returning an ArrayBuffer can. Thoughts?

(I don't particularly like the TextEncoder API btw because it news)

MaxGraey · 2019-03-28T18:06:44Z

std/assembly/encoding.ts

+
+  // @ts-ignore: decorator
+  @unsafe
+  export function decodeRaw(buf: usize, len: i32): string {


It looks like this could be internal because decode already provide more general interface. WDYT?

The idea is that someone who gets just a str and a str_len from C has a way to make a string from it, as the higher level decode function wants an ArrayBuffer which must have a runtime header with payloadLength.

So may be makes sense add optional len to decode as well?

That'd look like UTF8.decode(changetype<ArrayBuffer>(str), false, str_len) then. Not sure, feels somewhat wrong and would do different things depending of whether str_len is given or not. For instance, the function does not know whether it is called with an actual ArrayBuffer or not, so it doesn't know when it can length-check and when it can't.

UTF8.decode(changetype<ArrayBuffer>(str), 0, true) ?

I actually had something similar in mind initially, but gave up on it when I wasn't able to min(len, buf.byteLength) because buf could be a c pointer. Essentially, if len is given, it must not check byteLength because buf might be something unsafe, in turn making the entire function unsafe when specifying buf as an ArrayBuffer plus len. Hence I made a dedicated unsafe function.

dcodeIO · 2019-03-28T21:50:58Z

I split this into UTF8Encoder and UTF8Decoder now (UTF16 respectively) and implemented the static methods. That better?

MaxGraey · 2019-03-28T21:51:54Z

Great! Now much better

dcodeIO · 2019-03-28T22:00:03Z

Usage with a C-API would be:

// having 'str' and 'str_len'
var theString = UTF8Decoder.decodeUnsafe(str, str_len);

// having 'str' (null terminated)
var theString = UTF8Decoder.decodeNullTerminatedUnsafe(str);

// having 'theString', wanting 'str' and 'str_len'
var buffer = UTF8Encoder.encode(theString);
myCAPI(changetype<usize>(buffer), buffer.byteLength);

// having 'theString', wanting 'str' (null terminated)
var buffer = UTF8Encoder.encode(theString, true);
myCAPI(changetype<usize>(buffer));

dcodeIO · 2019-06-20T00:59:41Z

Superseded by #679

Improve text encoding API

aa38d06

MaxGraey reviewed Mar 28, 2019

View reviewed changes

split into encoders and decoders

a742537

Merge branch 'runtime' into runtime-encoding

5b7193b

vgrichina mentioned this pull request Apr 20, 2019

Fix get<bool> in near.ts nearprotocol/near-runtime-ts#14

Merged

MaxGraey mentioned this pull request May 15, 2019

make String toUTF8 and fromUTF8 more ergonomic #591

Closed

dcodeIO mentioned this pull request Jun 19, 2019

Implement text encoding helpers on top of the new runtime #679

Merged

dcodeIO closed this Jun 20, 2019

dcodeIO deleted the runtime-encoding branch September 20, 2019 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve text encoding API #564

Improve text encoding API #564

dcodeIO commented Mar 28, 2019

MaxGraey Mar 28, 2019

dcodeIO Mar 28, 2019 •

edited

MaxGraey Mar 28, 2019

dcodeIO Mar 28, 2019 •

edited

MaxGraey Mar 28, 2019

dcodeIO Mar 28, 2019 •

edited

dcodeIO commented Mar 28, 2019

MaxGraey commented Mar 28, 2019

dcodeIO commented Mar 28, 2019

dcodeIO commented Jun 20, 2019

Improve text encoding API #564

Improve text encoding API #564

Conversation

dcodeIO commented Mar 28, 2019

MaxGraey Mar 28, 2019

Choose a reason for hiding this comment

dcodeIO Mar 28, 2019 • edited

Choose a reason for hiding this comment

MaxGraey Mar 28, 2019

Choose a reason for hiding this comment

dcodeIO Mar 28, 2019 • edited

Choose a reason for hiding this comment

MaxGraey Mar 28, 2019

Choose a reason for hiding this comment

dcodeIO Mar 28, 2019 • edited

Choose a reason for hiding this comment

dcodeIO commented Mar 28, 2019

MaxGraey commented Mar 28, 2019

dcodeIO commented Mar 28, 2019

dcodeIO commented Jun 20, 2019

dcodeIO Mar 28, 2019 •

edited

dcodeIO Mar 28, 2019 •

edited

dcodeIO Mar 28, 2019 •

edited