Don't automatically copy python objects into javascript #1167

hoodmane · 2021-01-22T18:27:43Z

This is #1152 again. This implements the main changes proposed in #900.

This PR adjusts js2python to wrap Python list and dict objects in PyProxy objects instead of copying them into Javascript. This has the advantage that it allows Javascript and Python to share a reference to a Javascript Array or object. Automatic copying is lossy and inconvenient, it's asymmetric (python2js proxies Javascript Array and object into JsProxy), and it causes difficulties in a wide variety circumstances. For instance, it makes it impossible in general to get the repr of an object returned from runPython, it requires special case code to handle pyodide.globals (which is a PyProxy of dict), etc etc.

This adds new APIs deepCopyToJavascript and shallowCopyToJavascript as PyProxy methods, using these recovers the old behaviors.

…oxy.shallowCopyToJavascript and deepCopyToJavascript apis

…ect not hiwire id

phorward

Hi @hoodmane, thank you for setting up your PR again, and apologize for the seesaw.
The solution for the failing tests, as mentioned in #1152 is obviously simpler than I thought first.

I was thinking the right approach would be to merge this as-is and then preferably fix the ergonomics problems before the next release, but I could see adding more to this PR.

I'm okay with this approach, anyway, next time, better we'll test twice against current master.

Oh also the docs on type conversions and the changelog should be updated.

It would be nice if you can do that.

Presumably implementing toJSON on pyproxy would make the tests pass more painlessly.

And this as well. When all changes are done and maybe @rth is also okay with this, we can merge again.

hoodmane · 2021-01-22T22:28:24Z

Right, well if I add these other changes to this branch and it is decided that they should be merged separately I can always split them off into other PRs. I'm a little bit unsure how selenium handles serialization, the selenium docs are surprisingly vague on the topic. Presumably a little bit of experimentation will make it clear though.

hoodmane · 2021-01-22T23:41:00Z

Okay so I don't really understand how selenium serializes stuff. It definitely doesn't do JSON.stringify. The following assertion passes:

   assert selenium.run_js(
        """
        return {
            toJSON : () => {
                console.log("Running toJSON!");
                return [1,2,3];
            }
        };
        """
    ) == dict(toJSON={})

Whereas if it was doing JSON.stringify then the result would be [1,2,3].
As another test:

   selenium.run_js(
        """
        let a = {};
        a.a = a;
        return a;
        """
   )

Causes selenium.common.exceptions.WebDriverException: Message: unknown error: Maximum call stack size exceeded whereas if it were using JSON.stringify it should say something like: Uncaught TypeError: cyclic object value.

@dalcde You have any idea how this works? I guess maybe the right thing to do is to just avoid returning a pyproxy from selenium at all like I've been doing.

phorward

Hello @hoodmane, thank you for the update, do you think we can give it another try?

hoodmane · 2021-01-31T23:40:41Z

Yeah @phorward I'd appreciate it if you could merge this. (I just merged master so this time there definitely won't be any surprise conflicts!)

rth · 2021-02-03T23:54:15Z

This adds new APIs deepCopyToJavascript and shallowCopyToJavascript as PyProxy methods, using these recovers the old behaviors.

In terms of naming those are a bit verbose, particularly if they are to be used often. How about deepCopyToJS or deepCopyJS (or some other shorter name)?

rth · 2021-02-03T23:58:00Z

Also it would be good to document PyProxy and its public methods somewhere in https://pyodide.readthedocs.io/en/latest/usage/api-reference.html as currently these methods only appear in the changelog and nowhere else in the documentation.

hoodmane · 2021-02-04T00:01:19Z

How about deepCopyToJS or deepCopyJS (or some other shorter name)?

I'm happy with any of these. If you have a preference I can open a PR. Maybe even copyToJS, copy2Js, deepCopy2Js. Is it better to keep the word deep in there? What about shallowCopyToJs and copyToJs? I guess deepCopyToJs is my favorite?

hoodmane · 2021-02-04T00:05:01Z

document PyProxy and its public methods somewhere

I think JsProxy also needs this. The type_conversions.md has a fair amount of relevant stuff about the proxies, but it might be good to repeat the same stuff as API docs since that isn't ideal as a reference.

I have a local branch with a bunch of docs updates for recent changes and the changes in my various open PRs, my thought was to batch all the docs updates into one PR.

rth · 2021-02-04T10:54:25Z

Let's continue the naming discussion in #1192

my thought was to batch all the docs updates into one PR.

Sounds good.

…s merged)

daoxian · 2021-02-06T14:48:46Z

Hello @hoodmane, thank you for the magnificent APIs (deep/shallowCopyToJavaScript) that would eliminate the heavy cost of big array conversion from python to Javascript. I think the shallowCopy version is designed for memory address reference instead of bulks of content copy. But I find the performance not so good as I've imagined, which is displayed as the following:

It takes about 1~7 seconds for shallowCopyToJavascript() to complete the memory address reference and maybe some necessary meta data copy I guess. However, it's not adequate for a realtime computation. Any suggestions for better conversion performance?

hoodmane · 2021-02-06T15:52:49Z

Hi @daoxian. Thanks for reporting this bad behavior. I think it's preferable to open this sort of question as an issue rather than on the PR in general.

The handling of memory buffers is currently not good, this is an issue we hope to fix at some point. I think the current behavior is to make a javascript Array of 1080 javascript Arrays of 3-byte Uint8Array. The innermost layer is sliced out of the buffer memory so saying arr[700][500][0] = 255 will adjust the original data, but populating those large javascript Arrays is very inefficient at all (remember that an Array is a hashmap).

My guess is it would do much better if you transposed the array to have shape (3, 1080, 1920), though of course this isn't very ergonomic if for instance you are intending to use this data to back an image.

Would you look at #1168? That issue is closely related to the poor performance here. If you have an opinion about that discussion it would be helpful. One thing that limits my ability to fix the Buffer handling code is that I don't really know what the use cases look like, so I really appreciate any input you have. Of course if you want to take a stab at improving the buffer conversion code, let me know and I can tell you what I think you'll need to know.

daoxian · 2021-02-06T16:07:42Z

My guess is it would do much better if you transposed the array to have shape (3, 1080, 1920), though of course this isn't very ergonomic if for instance you are intending to use this data to back an image.

Indeed, it's much faster after the image transposed to (3, 1080, 1920).

hoodmane · 2021-02-06T16:11:50Z

Could you copy most recent post to #1202 and continue discussion there?

Hood added 17 commits January 17, 2021 17:42

Don't automatically copy python objects into javascript, add new PyPr…

6e1b006

…oxy.shallowCopyToJavascript and deepCopyToJavascript apis

Lint

e3fe092

Adjust conftest to automatically deep_copy results of run / run_async

b84929c

Update conftest again

0526580

Fix deep/shallowCopyToJavascript

048ed93

Fix shallowCopyToJavascript and deepCopyToJavascript to return js obj…

0a9b977

…ect not hiwire id

Fix the remaining tests (hopefully)

180e184

Lint

ea30e39

Fix more tests

93cd13f

Fix some numpy tests

1415803

Lint

65585d7

Fix more tests

7d454b2

Lint

e15fedb

Merge branch 'master' into py2js-no-auto-copy

34a4c0a

Temporarily dummy out setLineDash since it breaks test

e10753d

Merge branch 'master' into py2js-no-auto-copy

edd7585

Fix test_console

bc08b83

phorward reviewed Jan 22, 2021

View reviewed changes

hoodmane mentioned this pull request Jan 30, 2021

Jsfunc decorator (WIP interface, interested in feedback) #1179

Closed

Update changelog

b052772

phorward reviewed Jan 31, 2021

View reviewed changes

Merge branch 'master' into py2js-no-auto-copy

f772e2b

phorward approved these changes Feb 2, 2021

View reviewed changes

phorward merged commit b3a965a into pyodide:master Feb 2, 2021

hoodmane deleted the py2js-no-auto-copy branch February 2, 2021 18:57

This was referenced Feb 3, 2021

Test against #780 (and the same for dicts) #1188

Merged

js->py type conversions for arrays/lists not using native types #780

Closed

casatir mentioned this pull request Feb 3, 2021

TypeError: e.slice is not a function in REPL #1190

Closed

casatir added a commit to casatir/pyodide that referenced this pull request Feb 3, 2021

Fix REPL accidentally broken by pyodide#1167.

c468377

casatir mentioned this pull request Feb 3, 2021

Fix REPL completion #1191

Merged

rth mentioned this pull request Feb 4, 2021

API for casting PyProxy / JsProxy to native types #1192

Closed

hoodmane pushed a commit to hoodmane/pyodide that referenced this pull request Feb 5, 2021

Remove _eval_code_with_locals (no longer needed since pyodide#1167 wa…

a0517ae

…s merged)

hoodmane mentioned this pull request Feb 5, 2021

Simplify setup code #1195

Merged

hoodmane mentioned this pull request Feb 6, 2021

Performance issues with buffer conversions from python to javascript #1202

Closed

hoodmane mentioned this pull request Feb 6, 2021

TST Occasional failures of test_keyboard_interrupt #1200

Closed

hoodmane pushed a commit to hoodmane/pyodide that referenced this pull request Feb 15, 2021

Update faq on custom namespaces with new info since pyodide#1167 merged

db0615f

This was referenced Feb 15, 2021

Update faq on custom namespaces #1253

Merged

Fix console.html completions #1261

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't automatically copy python objects into javascript #1167

Don't automatically copy python objects into javascript #1167

hoodmane commented Jan 22, 2021 •

edited

phorward left a comment

hoodmane commented Jan 22, 2021 •

edited

hoodmane commented Jan 22, 2021

phorward left a comment

hoodmane commented Jan 31, 2021

rth commented Feb 3, 2021

rth commented Feb 3, 2021

hoodmane commented Feb 4, 2021

hoodmane commented Feb 4, 2021

rth commented Feb 4, 2021

daoxian commented Feb 6, 2021 •

edited

hoodmane commented Feb 6, 2021

daoxian commented Feb 6, 2021

hoodmane commented Feb 6, 2021

Don't automatically copy python objects into javascript #1167

Don't automatically copy python objects into javascript #1167

Conversation

hoodmane commented Jan 22, 2021 • edited

phorward left a comment

Choose a reason for hiding this comment

hoodmane commented Jan 22, 2021 • edited

hoodmane commented Jan 22, 2021

phorward left a comment

Choose a reason for hiding this comment

hoodmane commented Jan 31, 2021

rth commented Feb 3, 2021

rth commented Feb 3, 2021

hoodmane commented Feb 4, 2021

hoodmane commented Feb 4, 2021

rth commented Feb 4, 2021

daoxian commented Feb 6, 2021 • edited

hoodmane commented Feb 6, 2021

daoxian commented Feb 6, 2021

hoodmane commented Feb 6, 2021

hoodmane commented Jan 22, 2021 •

edited

hoodmane commented Jan 22, 2021 •

edited

daoxian commented Feb 6, 2021 •

edited