feat(0.5): update support for OpenLLM 0.5 #22442

aarnphm · 2024-06-03T17:51:10Z

This PR updates the internal wrapper for OpenLLM

Since OpenLLM v0.5 is considered breaking, I have also taken this opportunity to separate the API and local inference component into two separate class:

OpenLLMAPI will be responsible for handing remote server going forward (async, sync, streaming supported)
OpenLLM will now only for local inference, and only support batching and synchronous generations.

Hopefully this would make it a bit more future-proof. In terms of testing, I believe I have tested through some manual workflow and will be include the tests on our end.

updates from #19894, cc @baskaryan

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

vercel · 2024-06-03T18:29:46Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 4, 2024 10:29pm

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

baskaryan · 2024-06-03T21:44:09Z

libs/community/langchain_community/llms/openllm.py

-        model_name: Optional[str] = None,
-        *,
-        model_id: Optional[str] = None,
-        server_url: Optional[str] = None,
+        server_url: str,
        timeout: int = 30,
-        server_type: Literal["grpc", "http"] = "http",
-        embedded: bool = True,


shoudl we keep around the old args and raise a warning if they're passed in, so that this isn't a breaking change?

sure I think we can do that.

oh, this is for the new class OpenLLMAPI, but for OpenLLM we can keep backward compat.

ah missed the rename :+1

aarnphm · 2024-06-04T18:46:18Z

@baskaryan since the OpenLLM implementation here won't support _acall or async anymore, it would be a breaking change regardless.

docs/docs/integrations/providers/openllm.mdx

docs/docs/integrations/llms/openllm.ipynb

aarnphm · 2024-06-15T13:41:45Z

bump @baskaryan when you have bandwidth. sorry for the ping :)

feat(0.5): update support for OpenLLM 0.5

c5ddb81

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

chore: update lint

d5b2f60

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

aarnphm force-pushed the feat/0.5 branch from 01819c4 to d5b2f60 Compare June 3, 2024 18:33

vercel bot deployed to Preview June 3, 2024 18:46 View deployment

aarnphm added 3 commits June 3, 2024 20:12

chore: update lint

b92fc15

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

chore: run format

9645bf3

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

fix: type

e25782a

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

aarnphm force-pushed the feat/0.5 branch from 77226d5 to e25782a Compare June 3, 2024 20:36

aarnphm marked this pull request as ready for review June 3, 2024 20:45

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. 🤖:refactor A large refactor of a feature(s) or restructuring of many files labels Jun 3, 2024

vercel bot deployed to Preview June 3, 2024 20:49 View deployment

baskaryan reviewed Jun 3, 2024

View reviewed changes

eyurtsev assigned baskaryan Jun 4, 2024

aarnphm commented Jun 4, 2024

View reviewed changes

docs/docs/integrations/providers/openllm.mdx Outdated Show resolved Hide resolved

Update docs/docs/integrations/providers/openllm.mdx

c03dfed

aarnphm commented Jun 4, 2024

View reviewed changes

docs/docs/integrations/llms/openllm.ipynb Show resolved Hide resolved

Update docs/docs/integrations/llms/openllm.ipynb

2ead29f

vercel bot deployed to Preview June 4, 2024 22:29 View deployment

aarnphm mentioned this pull request Jun 15, 2024

bug: Attempting to invoke OpenLLM from Langchain results in error bentoml/OpenLLM#1014

Closed

ccurme added the community Related to langchain-community label Jun 18, 2024

aarnphm closed this Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(0.5): update support for OpenLLM 0.5 #22442

feat(0.5): update support for OpenLLM 0.5 #22442

aarnphm commented Jun 3, 2024

vercel bot commented Jun 3, 2024 •

edited

Loading

baskaryan Jun 3, 2024

aarnphm Jun 4, 2024

aarnphm Jun 4, 2024

baskaryan Jun 4, 2024

aarnphm commented Jun 4, 2024

aarnphm commented Jun 15, 2024

feat(0.5): update support for OpenLLM 0.5 #22442

feat(0.5): update support for OpenLLM 0.5 #22442

Conversation

aarnphm commented Jun 3, 2024

vercel bot commented Jun 3, 2024 • edited Loading

baskaryan Jun 3, 2024

Choose a reason for hiding this comment

aarnphm Jun 4, 2024

Choose a reason for hiding this comment

aarnphm Jun 4, 2024

Choose a reason for hiding this comment

baskaryan Jun 4, 2024

Choose a reason for hiding this comment

aarnphm commented Jun 4, 2024

aarnphm commented Jun 15, 2024

vercel bot commented Jun 3, 2024 •

edited

Loading