feat: separate and optimize async and sync clients #4116

judahrand · 2023-08-11T11:57:23Z

What does this PR address?

A draft attempt at addressing #4115

Currently, this introduces a lot of duplicate code. However, I think it should be possible to reduce this significantly.

I believe by having separate sync and async implementations optimal performance can be achieved for both.

I'd be interested to get thoughts and feedback on this PR as well as advice on how to benchmark it against main as I believe it should see some performance benefit across multiple requests on the same client.

Fixes #(issue)

Before submitting:

Does the Pull Request follow Conventional Commits specification naming? Here are GitHub's
guide on how to create a pull request.
Does the code follow BentoML's code style, pre-commit run -a script has passed (instructions)?
Did you read through contribution guidelines and follow development guidelines?
Did your changes require updates to the documentation? Have you updated
those accordingly? Here are documentation guidelines and tips on writting docs.
Did you write tests to cover your changes?

sauyon

For backwards compatibility, the client implementations will have to retain the sync/async methods; perhaps the default client could instantiate a sync / async client?

judahrand · 2023-08-15T09:10:56Z

For backwards compatibility, the client implementations will have to retain the sync/async methods; perhaps the default client could instantiate a sync / async client?

@sauyon The latest changes should maintain backwards compatibility. Is this sort of a solution acceptable? If so I'll push forwards with getting the tests passing and perhaps also look at splitting the tests up by sync and async implementation.

src/bentoml/_internal/client/__init__.py

judahrand · 2023-08-17T11:01:37Z

I think that this PR has gotten a bit muddled. This is something which I am keen to address as at the moment I think the client implementations are quite poor. However, I may take another stab at this later understanding more about all the various moving parts.

There seems to be a lot which needs changes to:

make the synchronous clients work reliably and efficiently
make the async clients work efficiently
separate the client implementation from the core BentoML package to avoid carrying huge numbers of irrelevant dependencies around in client side software.

src/bentoml/_internal/client/__init__.py

sauyon

A few things that I think need changing. Haven't reviewed the async client code entirely carefully yet.

@aarnphm can you help review the grpc client?

src/bentoml/_internal/client/__init__.py

src/bentoml/_internal/client/http.py

src/bentoml/_internal/io_descriptors/base.py

sauyon · 2023-08-18T06:10:23Z

I think that this PR has gotten a bit muddled. This is something which I am keen to address as at the moment I think the client implementations are quite poor. However, I may take another stab at this later understanding more about all the various moving parts.

I think apart from the IO descriptor change we should get this in, then probably circle back after we rework those.

There seems to be a lot which needs changes to:

make the synchronous clients work reliably and efficiently

make the async clients work efficiently

Yep.

Somewhat related, there was at one point an issue where connections were being dropped and we were running into strange errors when we reused a connection which we haven't gotten to the bottom of, which I probably could have added a comment for, but we really need to figure that out.

separate the client implementation from the core BentoML package to avoid carrying huge numbers of irrelevant dependencies around in client side software.

We probably need to think more carefully about this one in general; a lot of our dependencies are only required for servers / only required for clients, etc. @parano @bojiang something to think about, I think.

aarnphm · 2023-08-18T15:19:22Z

FWIW I think we can separate out a bentoml-client package, and by default bentoml will include bentoml-client

judahrand · 2023-08-18T16:14:53Z

FWIW I think we can separate out a bentoml-client package, and by default bentoml will include bentoml-client

Agreed. I also wonder if the serialization/deserialization logic should live in a separate package (given that both the client and server will want to depend on it). This is something a lot of Rust projects do very very well (see arrow-rs). They split out all their subcomponents into sub crates.

I believe PDM might even be able to cope with this setup in Python? https://pdm.fming.dev/latest/usage/advanced/#use-pdm-to-manage-a-monorepo

aarnphm · 2023-08-18T16:16:05Z

FWIW I think we can separate out a bentoml-client package, and by default bentoml will include bentoml-client

Agreed. I also wonder if the serialization/deserialization logic should live in a separate package (given that both the client and server will want to depend on it). This is something a lot of Rust projects do very very well (see arrow-rs). They split out all their subcomponents into sub crates.

I believe PDM might even be able to cope with this setup in Python? pdm.fming.dev/latest/usage/advanced/#use-pdm-to-manage-a-monorepo

this will be added to bentoml-core 🎉

aarnphm · 2023-08-18T16:17:54Z

@ssheng do you think it is good to refactor our repo into monorepo architecture now?

Maybe good to also separate SDK and CLI as well as client

We might also want to think about bentoml-core development 🤔

sauyon · 2023-08-18T23:19:59Z

I don't know that it needs to be a monorepo.

sauyon · 2023-08-29T23:45:21Z

Would it be ok if I took this one over so we can get it in while we work on the rework?

judahrand · 2023-08-30T10:01:09Z

Would it be ok if I took this one over so we can get it in while we work on the rework?

100% - sorry for letting this one lag. Various demands on time etc etc etc - same as everyone else 🤣

frostming

I am mostly fine with it.

src/bentoml/_internal/client/__init__.py

src/bentoml/_internal/client/http.py

Co-Authored-By: Judah Rand <17158624+judahrand@users.noreply.github.com>

For more information, see https://pre-commit.ci

sauyon · 2023-10-10T04:11:03Z

@aarnphm @frostming should be ready for another look!

For more information, see https://pre-commit.ci

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

judahrand requested a review from a team as a code owner August 11, 2023 11:57

judahrand requested review from sauyon and removed request for a team August 11, 2023 11:57

judahrand marked this pull request as draft August 11, 2023 11:57

judahrand force-pushed the client-attempt-2 branch 3 times, most recently from c14b693 to 46e8dd5 Compare August 11, 2023 12:07

sauyon requested changes Aug 14, 2023

View reviewed changes

judahrand force-pushed the client-attempt-2 branch from 1fd8efa to 34940ff Compare August 15, 2023 09:10

judahrand force-pushed the client-attempt-2 branch 2 times, most recently from 45c29c8 to e916f29 Compare August 15, 2023 10:44

aarnphm reviewed Aug 16, 2023

View reviewed changes

src/bentoml/_internal/client/__init__.py Outdated Show resolved Hide resolved

judahrand requested a review from aarnphm August 16, 2023 14:00

judahrand force-pushed the client-attempt-2 branch 2 times, most recently from 5077691 to 5b3eefa Compare August 16, 2023 14:29

sauyon reviewed Aug 18, 2023

View reviewed changes

src/bentoml/_internal/client/__init__.py Show resolved Hide resolved

sauyon reviewed Aug 18, 2023

View reviewed changes

judahrand force-pushed the client-attempt-2 branch from cc85ea2 to 27f5279 Compare August 18, 2023 12:07

sauyon mentioned this pull request Aug 29, 2023

feat(http-client): async and sync implementation #3845

Closed

5 tasks

sauyon mentioned this pull request Aug 30, 2023

refactor(client): move async and sync implementation #3853

Closed

frostming reviewed Oct 10, 2023

View reviewed changes

src/bentoml/_internal/client/__init__.py Outdated Show resolved Hide resolved

src/bentoml/_internal/client/http.py Outdated Show resolved Hide resolved

sauyon force-pushed the client-attempt-2 branch from 9dd5261 to 943982c Compare October 10, 2023 02:27

larme added this to the 1.1.7 milestone Oct 10, 2023

sauyon force-pushed the client-attempt-2 branch from 943982c to 25c5a67 Compare October 10, 2023 03:54

sauyon and others added 6 commits October 9, 2023 21:05

add httpx dependency

b05de32

separate sync and async clients

3de5ff5

Co-Authored-By: Judah Rand <17158624+judahrand@users.noreply.github.com>

ci: auto fixes from pre-commit.ci

07ed947

For more information, see https://pre-commit.ci

address review comments and some other fixes

d8a1d71

ci: auto fixes from pre-commit.ci

a464a04

For more information, see https://pre-commit.ci

set default timeout to 300s

3e1cd6b

sauyon force-pushed the client-attempt-2 branch from 6d24963 to 3e1cd6b Compare October 10, 2023 04:10

minor fixes

fc8b726

sauyon force-pushed the client-attempt-2 branch from 08803c5 to fc8b726 Compare October 10, 2023 04:18

sauyon and others added 2 commits October 9, 2023 21:38

fix sync client

059bb07

ci: auto fixes from pre-commit.ci

dd3139c

For more information, see https://pre-commit.ci

aarnphm previously approved these changes Oct 12, 2023

View reviewed changes

merge: branch 'main'@github.com:bentoml/BentoML -> client-attempt-2

4bcecdd

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

aarnphm dismissed their stale review via 4bcecdd October 12, 2023 18:09

aarnphm approved these changes Oct 12, 2023

View reviewed changes

aarnphm merged commit 1e8902a into bentoml:main Oct 12, 2023
1 of 41 checks passed

sauyon mentioned this pull request Oct 31, 2023

bug: connects are created for every request #4115

Open

aarnphm mentioned this pull request Nov 8, 2023

infra: update to use Ruff formatter #4269

Merged

This was referenced Nov 17, 2023

chore(version): checking using importlib.metadata #4285

Closed

fix(dependencies): lock cattrs<23.2 for now #4292

Merged

docs: update quickstart with OpenLLM #4295

Merged

fix(docs): correct server implementation #4297

Merged

This was referenced Dec 22, 2023

feat: support .python-version symlink #4354

Merged

fix(with_config): annotate return type #4355

Merged

chore(generated): new stubs for proto 4 #4374

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: separate and optimize async and sync clients #4116

feat: separate and optimize async and sync clients #4116

judahrand commented Aug 11, 2023 •

edited

sauyon left a comment

judahrand commented Aug 15, 2023 •

edited

judahrand commented Aug 17, 2023

sauyon left a comment

sauyon commented Aug 18, 2023 •

edited

aarnphm commented Aug 18, 2023

judahrand commented Aug 18, 2023

aarnphm commented Aug 18, 2023 •

edited

aarnphm commented Aug 18, 2023

sauyon commented Aug 18, 2023

sauyon commented Aug 29, 2023

judahrand commented Aug 30, 2023

frostming left a comment

sauyon commented Oct 10, 2023

feat: separate and optimize async and sync clients #4116

feat: separate and optimize async and sync clients #4116

Conversation

judahrand commented Aug 11, 2023 • edited

What does this PR address?

Before submitting:

sauyon left a comment

Choose a reason for hiding this comment

judahrand commented Aug 15, 2023 • edited

judahrand commented Aug 17, 2023

sauyon left a comment

Choose a reason for hiding this comment

sauyon commented Aug 18, 2023 • edited

aarnphm commented Aug 18, 2023

judahrand commented Aug 18, 2023

aarnphm commented Aug 18, 2023 • edited

aarnphm commented Aug 18, 2023

sauyon commented Aug 18, 2023

sauyon commented Aug 29, 2023

judahrand commented Aug 30, 2023

frostming left a comment

Choose a reason for hiding this comment

sauyon commented Oct 10, 2023

judahrand commented Aug 11, 2023 •

edited

judahrand commented Aug 15, 2023 •

edited

sauyon commented Aug 18, 2023 •

edited

aarnphm commented Aug 18, 2023 •

edited