Add support for streaming LLM generation #680

FBR65 · 2024-02-29T06:58:08Z

Hi,

the Error is thrown in line 61 litellm.py:

results.append(result["choices"][0]["message"]["content"])

'CustomStreamWrapper' object is not subscriptable

Call is
from txtai.pipeline import LLM

MODEL_NAME = "huggingface/TheBloke/leo-hessianai-70B-chat-GPTQ"
llm = LLM(path=MODEL_NAME,method="litellm", api_base=api_base,stream=True)

This works fine:

import litellm
from litellm import completion

MODEL_NAME = "huggingface/TheBloke/leo-hessianai-70B-chat-GPTQ"
messages = [{"content": "C", "role": "user"}] # LiteLLM follows the OpenAI format
api_base = "http://127.0.0.1:8080"

CALLING ENDPOINT

response=completion(model=MODEL_NAME, messages=messages, api_base=api_base,stream=True)
for part in response:
print(part.choices[0].delta.content or "")

davidmezzetti · 2024-02-29T23:32:24Z

Thank you for the issue. I'll see if streaming support can be added in.

davidmezzetti · 2024-07-12T16:27:11Z

It's been a while on this but streaming will be added shortly.

davidmezzetti changed the title ~~Litelm streaming throws Error~~ Support streaming LLM Generation Jul 12, 2024

davidmezzetti self-assigned this Jul 12, 2024

davidmezzetti added this to the v7.3.0 milestone Jul 12, 2024

davidmezzetti changed the title ~~Support streaming LLM Generation~~ Add support for streaming LLM Generation Jul 12, 2024

davidmezzetti changed the title ~~Add support for streaming LLM Generation~~ Add support for streaming LLM generation Jul 12, 2024

davidmezzetti closed this as completed in dd1067c Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for streaming LLM generation #680

Add support for streaming LLM generation #680

FBR65 commented Feb 29, 2024

davidmezzetti commented Feb 29, 2024

davidmezzetti commented Jul 12, 2024

Add support for streaming LLM generation #680

Add support for streaming LLM generation #680

Comments

FBR65 commented Feb 29, 2024

davidmezzetti commented Feb 29, 2024

davidmezzetti commented Jul 12, 2024