Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for streaming LLM generation #680

Closed
FBR65 opened this issue Feb 29, 2024 · 2 comments
Closed

Add support for streaming LLM generation #680

FBR65 opened this issue Feb 29, 2024 · 2 comments
Assignees
Milestone

Comments

@FBR65
Copy link

FBR65 commented Feb 29, 2024

Hi,

the Error is thrown in line 61 litellm.py:


results.append(result["choices"][0]["message"]["content"])

'CustomStreamWrapper' object is not subscriptable


Call is
from txtai.pipeline import LLM

MODEL_NAME = "huggingface/TheBloke/leo-hessianai-70B-chat-GPTQ"
llm = LLM(path=MODEL_NAME,method="litellm", api_base=api_base,stream=True)


This works fine:

import litellm
from litellm import completion

MODEL_NAME = "huggingface/TheBloke/leo-hessianai-70B-chat-GPTQ"
messages = [{"content": "C", "role": "user"}] # LiteLLM follows the OpenAI format
api_base = "http://127.0.0.1:8080"

CALLING ENDPOINT

response=completion(model=MODEL_NAME, messages=messages, api_base=api_base,stream=True)
for part in response:
print(part.choices[0].delta.content or "")

@davidmezzetti
Copy link
Member

Thank you for the issue. I'll see if streaming support can be added in.

@davidmezzetti davidmezzetti changed the title Litelm streaming throws Error Support streaming LLM Generation Jul 12, 2024
@davidmezzetti
Copy link
Member

It's been a while on this but streaming will be added shortly.

@davidmezzetti davidmezzetti self-assigned this Jul 12, 2024
@davidmezzetti davidmezzetti added this to the v7.3.0 milestone Jul 12, 2024
@davidmezzetti davidmezzetti changed the title Support streaming LLM Generation Add support for streaming LLM Generation Jul 12, 2024
@davidmezzetti davidmezzetti changed the title Add support for streaming LLM Generation Add support for streaming LLM generation Jul 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants