Write http.client in terms of Web APIs #140

mdboom · 2018-09-05T15:54:52Z

This might not even be possible given blocking issues.

However, if we could write http.client in terms of Web APIs, we might be able to get things like pip partially working. As it stands, Python libraries built on top of raw sockets don't (can't) work.

The text was updated successfully, but these errors were encountered:

anshuldutt21 · 2019-12-19T08:13:57Z

Hi, I would like to contribute to this issue.

rth · 2021-06-05T12:22:25Z

If I'm not mistaken, @kikocorreoso mentioned that another possibility could to look at the work done in Brython where some of these standard library modules might be re-implemented in JavaScript.

I quickly checked, and for instance http.client according to git commit messages, is identical to upstream, while _socket.py doesn't do anything, but maybe I'm missing something. Also brython-dev/brython#1032 (comment) suggests that there are no replacements for low level network connectivity. Also couldn't find anything about it in Skulpt.

Anyway it would indeed be a good idea to read earlier discussion on this subject in the issue tracker of these projects.

kikocorreoso · 2021-06-08T09:31:58Z

@rth my comment was more in the vein of using js libs instead of py libs when it makes sense. Some of them are maybe implemented in Brython, batavia, skulpt,..., and coul be reused in some way.

For instance, re was very slow as it was implemented in pure Python in Brython so @PierreQuentel reimplemented the functionality in JS.
brython-dev/brython#1519

I suppose re is in WASM in PyOdide so maybe this example it is not very useful. I was thinking more in pure PY libs that have been rewritten in JS to adapt some behaviour to the browser/ for performance reasons, etc.

I don't know if this could help in terms of "DoNotReinventTheWheel", performance,...

rth · 2021-06-08T09:43:39Z

this could help in terms of "DoNotReinventTheWheel"

Yes, absolutely. Thanks for your comment! We should definitely look at what could be used/adapted in JS before implementing stuff :)

hoodmane · 2021-06-08T14:19:56Z

The main issue we have to deal with is the fact that http.client is a synchronous api and the relevant web APIs are asynchronous. I implemented a small piece of the http.client api on one of my comlink/syncifiers branches as a proof of concept and I think it works quite well. However I think that it may be possible to use emscripten pthreads instead. If we can get it working with pthreads I think that would reduce how much code we need and potentially also lead to other useful features, though I think it will also allow much less fine grained control than my comlink approach. In particular, emscripten will take care of creating the web workers and deciding where the work should be done. I'm really curious how and whether pthreads can make Python threading work. To do that, I guess the backing data of the wasm module needs to be stored in a sharedarraybuffer and all the python interpreter code needs to copied into multiple workers. Seems complicated but maybe emscripten pthreads does it...

…

On Tue, Jun 8, 2021, 2:43 AM Roman Yurchak ***@***.***> wrote: this could help in terms of "DoNotReinventTheWheel" Yes, absolutely. Thanks for your comment! We should definitely look at what could be used/adapted in JS before implementing stuff :) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#140 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACCVWKQ54BD5QRZCWTQICT3TRXQ53ANCNFSM4FTM22IA> .

datakurre · 2021-08-12T13:48:56Z

@hoodmane I’d fancy to check and try out the piece of http.client API, which you had implemented, but I was unable to find the correct branch. Would you be able to link your version here?

hoodmane · 2021-08-12T18:54:54Z

Yeah, the actual partial http_client implementation is here:
https://github.com/hoodmane/pyodide/blob/comlink-demo/src/pyodide-py/pyodide/http_client.py

It uses a comlink fork which I have here:
https://github.com/hoodmane/pyodide/tree/comlink-demo/comlink

The actual demo is here:
https://github.com/hoodmane/pyodide/tree/comlink-demo/demos/syncio

I can't remember how well this stuff works. My plan is to work on the comlink port in this separate repository:
https://github.com/hoodmane/synclink
I suppose it would be good to make a comparable demo that uses that repo for the comlink fork.

datakurre · 2021-08-13T20:12:41Z

@hoodmane Thank you for the links. Unfortunately, that ended up being too much for me to get that work within the time I had, but at least that took me through learning building working pyodide from source 💪

rth · 2021-11-13T18:07:29Z

Interesting work @hoodmane ! So what do you think should be next steps on this?
Threading #237 doesn't look that far away, most major browsers now support it I think.

For this comlink demo, I think it would help to make this a bit more visible? Maybe move some of it to the pyodide org?

Otherwise taking a different approach, aren't there some proxy that could change the MIME type of a response from binary to plain/text, so that we could still fetch it with pyodide.open_url? Either a external proxy or even in a service worker? Though I guess the latter, even if it works, is not very different in complexity from running a web worker.

ricardoprins · 2022-05-23T18:35:11Z

I'm just too lazy to read everything - I confess.

Since this (and #398, consequently) have a significant impact on pyscript (I'm surfing the hype as well), I want to help to get this done. So, which are the necessary steps to finish this task (and consequently solving indirectly the requests' issue)?

I wanna help, but I want to understand the "bigger picture" first.

hoodmane · 2022-05-23T19:06:40Z

@ricardoprins can we set up a meeting?

ricardoprins · 2022-05-23T19:20:50Z

Sure, that would be great.

hoodmane · 2022-05-23T19:24:34Z

It's weird that github has no DM feature. I guess you could use private repos for that purpose as a hack.

iuriguilherme · 2022-06-10T06:26:05Z

I have a question. Why make it blocking when there's aiohttp?

rth · 2022-06-10T06:53:52Z

Because there are a lot of libraries that are sync and use http.client (either directly or via requests, etc) and won't be able to use aiohttp or another async function as a replacement. Unless #2664 is implemented, but it would take time.

For the cases where async use is possible, we have added pyodide.http.pyfetch which has a somewhat similar API (but without the session context manager)

rtpg · 2022-09-02T09:39:40Z

Serious question: while synchronisity is a problem for JS because of the interaction model with the top level, if Pyodide's code evaluation entrypoints were all async (that is to say, runCode is also async) then the python-level code could all be synchronous and things could be papered over with Asyncify at the Python/C FFI layer, maybe? After all, CPython itself already has a similar yielding concept in place.

Given there is in theory full control of the Python VM here I want to believe there is a way forward that doesn't involve too much pain.

hoodmane · 2022-09-02T15:37:55Z

In order for asyncify to work, all C, C++, Rust, fortran, etc code would have to be linked with it both in the main module and in side modules. I think the performance cost would be significant and we would probably have to find and fix bugs in asyncify. If someone does this and profiles it to be okay for performance we might consider it. But I think the costs are too high.

twinsant · 2022-10-28T02:13:20Z

So, what's the progress?

ross-spencer · 2023-01-27T12:32:39Z

Can I ask a question about security?

My understanding currently is that if you try and POST data you'll get an error such as:

Traceback (most recent call last):
  File "<console>", line 1, in <module>
  File "/lib/python3.10/urllib/request.py", line 216, in urlopen
    return opener.open(url, data, timeout)
  File "/lib/python3.10/urllib/request.py", line 519, in open
    response = self._open(req, data)
  File "/lib/python3.10/urllib/request.py", line 536, in _open
    result = self._call_chain(self.handle_open, protocol, protocol +
  File "/lib/python3.10/urllib/request.py", line 496, in _call_chain
    result = func(*args)
  File "/lib/python3.10/urllib/request.py", line 1377, in http_open
    return self.do_open(http.client.HTTPConnection, req)
  File "/lib/python3.10/urllib/request.py", line 1351, in do_open
    raise URLError(err)
urllib.error.URLError: <urlopen error [Errno 23] Host is unreachable>

If this feature is implemented, will it be entirely up to the underlying library to "promise" not to forward data loaded into the browser to another source? Are there other controls on this type of issue?

rth · 2023-01-27T13:33:21Z

Can I ask a question about security?

Sure. That error doesn't mean you cannot make that post request, only that you can't make it with urllib. Making it with pyodide.http.pyfetch (or via JS functions) would work. So generally libraries can make arbitrary network connections both when running on host Python and in the browser.

You can whitelist the allowed domains in the browser with CORS apparently.

alekssamos · 2023-05-20T19:15:54Z

since PHP and Wordpress exist, maybe you can make sockets?

I found a project where they compile PHP, SQLite and run Wordpress.
https://wordpress.wasmlabs.dev/

I think PHP can use sockets somehow.
Otherwise, how does the browser interact with this PHP?
Maybe you can still add sockets to pyodide?
And there will be libraries urllib, requests, aiohttp, httpx.
Or is it impossible and will have to be done only exclusively through js?
And web sockets (ws) can be to do?

What do you think about it?

Yes, I read the FAQ (1, 2, 3) where it was mentioned, but since I found PHP + Wordpress, I wanted to ask again.
searched here, there have already been similar topics.
So, these are the limitations of the web assembler virtual machine itself, the browser, or restrictions only on the pyodide side?

mdboom added the enhancement New feature or request label Sep 5, 2018

rth added the help wanted Extra attention is needed label Jul 8, 2020

rth mentioned this issue Dec 24, 2020

Stdlib Support? #933

Closed

This was referenced Dec 31, 2020

Including PsychoPy in the packages directory #986

Closed

Pyodide inside a Flask webapp? #487

Closed

rth mentioned this issue Feb 10, 2021

Remove pyodide_py.open_url #1228

Closed

rth mentioned this issue Apr 20, 2021

Use wheel package to create .whl if not available? #1501

Open

hoodmane mentioned this issue Apr 20, 2021

Synchronous IO #1503

Open

hoodmane added the roadmap label Apr 20, 2021

hoodmane mentioned this issue Jul 3, 2021

pyodide - requests module is not available #398

Closed

rth mentioned this issue Nov 13, 2021

Discussion / WIP: Add a very minimal requests shim and its dependencies #1956

Closed

3 tasks

rth mentioned this issue Nov 27, 2021

Review CPython patches / tests and contribute upstream #2000

Closed

matthewfeickert mentioned this issue Mar 25, 2022

Try to get pyhf to run in Pyolite / JupyterLite scikit-hep/pyhf#1775

Closed

1 task

ryanking13 mentioned this issue Jun 8, 2022

Add python-socketio package #2670

Open

rth mentioned this issue Sep 2, 2022

BUG: open_url works for binary files in the main thread #3062

Closed

ross-spencer mentioned this issue Jan 30, 2023

Describe use of analytics in README simonw/datasette-lite#61

Closed

westurner mentioned this issue Mar 27, 2023

[feat] Support JupyterLite notebooks Kanaries/pygwalker#36

Closed

ryanking13 mentioned this issue May 11, 2023

Add stub for the _ssl module #529

Closed

JeffersGlass mentioned this issue Jul 24, 2023

Requesting the module "Requests" to be part of pyodide #3997

Closed

joemarshall mentioned this issue Dec 7, 2023

added requests and (direct from git) urllib3 #4332

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write http.client in terms of Web APIs #140

Write http.client in terms of Web APIs #140

mdboom commented Sep 5, 2018

anshuldutt21 commented Dec 19, 2019

rth commented Jun 5, 2021

kikocorreoso commented Jun 8, 2021

rth commented Jun 8, 2021

hoodmane commented Jun 8, 2021 via email

datakurre commented Aug 12, 2021

hoodmane commented Aug 12, 2021

datakurre commented Aug 13, 2021

rth commented Nov 13, 2021

ricardoprins commented May 23, 2022

hoodmane commented May 23, 2022

ricardoprins commented May 23, 2022

hoodmane commented May 23, 2022 •

edited

iuriguilherme commented Jun 10, 2022

rth commented Jun 10, 2022

rtpg commented Sep 2, 2022

hoodmane commented Sep 2, 2022

twinsant commented Oct 28, 2022

ross-spencer commented Jan 27, 2023 •

edited

rth commented Jan 27, 2023

alekssamos commented May 20, 2023 •

edited

Write http.client in terms of Web APIs #140

Write http.client in terms of Web APIs #140

Comments

mdboom commented Sep 5, 2018

anshuldutt21 commented Dec 19, 2019

rth commented Jun 5, 2021

kikocorreoso commented Jun 8, 2021

rth commented Jun 8, 2021

hoodmane commented Jun 8, 2021 via email

datakurre commented Aug 12, 2021

hoodmane commented Aug 12, 2021

datakurre commented Aug 13, 2021

rth commented Nov 13, 2021

ricardoprins commented May 23, 2022

hoodmane commented May 23, 2022

ricardoprins commented May 23, 2022

hoodmane commented May 23, 2022 • edited

iuriguilherme commented Jun 10, 2022

rth commented Jun 10, 2022

rtpg commented Sep 2, 2022

hoodmane commented Sep 2, 2022

twinsant commented Oct 28, 2022

ross-spencer commented Jan 27, 2023 • edited

rth commented Jan 27, 2023

alekssamos commented May 20, 2023 • edited

since PHP and Wordpress exist, maybe you can make sockets?

hoodmane commented May 23, 2022 •

edited

ross-spencer commented Jan 27, 2023 •

edited

alekssamos commented May 20, 2023 •

edited