refactor assistant streaming and create OpenAI compliant base class #425

pmeier · 2024-05-27T14:25:52Z

This came from an offline discussion with @nenb and supersedes #424. It also paves the way for #375.

The main two changes are

Factor out the streaming protocol logic, i.e. SSE and JSONL streaming, to avoid code duplication and easy switching if this is the only change between assistants. The latter part lead directly to 2.
Implement a generic OpenAI compliant base class allowing the selection of the streaming protocol, an arbitrary URL as well as optional model selection.

More details inline.

pmeier · 2024-05-27T14:26:46Z

ragna/assistants/_api.py

This file was renamed to _http_api.py but has enough changes for git to not recognize it as such.

pmeier · 2024-05-27T14:27:35Z

ragna/assistants/_http_api.py

+
+
+class HttpApiAssistant(Assistant):
+    _API_KEY_ENV_VAR: Optional[str]


The API key is now optional. See #375 for a discussion.

pmeier · 2024-05-27T14:30:06Z

ragna/assistants/_ai21labs.py

@@ -21,8 +21,8 @@ def _make_system_content(self, sources: list[Source]) -> str:
        )
        return instruction + "\n\n".join(source.content for source in sources)

-    async def _call_api(


The def _call_api abstraction for def answer was just a remnant of an old implementation that I forgot to clean up earlier:

ragna/ragna/assistants/_api.py

Lines 32 to 38 in 84cf4f6

async def answer(

self, prompt: str, sources: list[Source], *, max_new_tokens: int = 256

) -> AsyncIterator[str]:

async for chunk in self._call_api(

prompt, sources, max_new_tokens=max_new_tokens

):

yield chunk

This PR removes it and all subclass simply implement def answer directly.

pmeier · 2024-05-27T14:30:47Z

ragna/assistants/_anthropic.py



-class AnthropicApiAssistant(ApiAssistant):
+class AnthropicAssistant(HttpApiAssistant):


Driveby rename to align it with other provider base classes.

pmeier · 2024-05-27T14:32:03Z

ragna/assistants/_llamafile.py

This is a demonstration how easy it is after this PR to add new OpenAI compliant assistants.

pmeier · 2024-05-27T14:33:01Z

ragna/assistants/_openai.py

+            yield cast(str, choice["delta"]["content"])
+
+
+class OpenaiAssistant(OpenaiCompliantHttpApiAssistant):


The public OpenAI API fits the new scheme nicely.

pmeier · 2024-05-27T14:33:23Z

tests/assistants/test_api.py

+@pytest.mark.parametrize(
+    "assistant",
+    [assistant for assistant in HTTP_API_ASSISTANTS if assistant._API_KEY_ENV_VAR],
+)


pmeier · 2024-05-27T14:33:51Z

ragna/assistants/_llamafile.py

+
+    @property
+    def _url(self) -> str:
+        base_url = os.environ.get("RAGNA_LLAMAFILE_BASE_URL", "http://localhost:8080")


@nenb is port 8080 the default?

This seems to be the case.

pmeier · 2024-05-27T14:38:33Z

If this PR is accepted, I'll have a go at #376 and bring it up to speed.

nenb

Great work, delighted that ragna can connect with many local LLMs so easily now, thank you!

nenb · 2024-05-28T00:40:13Z

ragna/assistants/_llamafile.py

+
+    @property
+    def _url(self) -> str:
+        base_url = os.environ.get("RAGNA_LLAMAFILE_BASE_URL", "http://localhost:8080")


This seems to be the case.

pmeier · 2024-05-28T07:05:57Z

Touching on #424 (comment)

me having a different (incorrect?) definition of what a compliant API is

That is certainly up for debate. I think the most practical thing given the variety of cases here is: Any REST API is OpenAI compliant if it uses the same request and response schema as OpenAI.

For practicality reasons, we allow the following deviations:

The model can be passed in the request (OpenAI, Ollama), but doesn't have to in case the deployment only features one model (Azure OpenAI, Llamafile)
The streaming can either be performed with SSE (OpenAI, Azure OpenAI) or JSONL (Llamafile, Ollama)

…425)

pmeier added 3 commits May 27, 2024 16:19

refactor assistant streaming and create OpenAI compliant base class

c947e5a

add LlamafileAssistant

1b63d3a

harden tests

e0a1fc6

pmeier added area: developer-experience 🧑‍💻 dev: components labels May 27, 2024

pmeier requested a review from nenb May 27, 2024 14:25

pmeier commented May 27, 2024

View reviewed changes

cleanup

e4c7681

nenb approved these changes May 28, 2024

View reviewed changes

nenb mentioned this pull request May 28, 2024

Add OpenAI API compatible assistant #424

Closed

add documentation

6a7c8cd

pmeier merged commit a45bd90 into main May 28, 2024
10 checks passed

pmeier deleted the http-api-assistants branch May 28, 2024 07:32

pmeier mentioned this pull request May 28, 2024

[ENH] Add support for Ollama assistants #376

Merged

blakerosenthal pushed a commit that referenced this pull request Jul 17, 2024

refactor assistant streaming and create OpenAI compliant base class (#…

34c26cd

…425)

pmeier mentioned this pull request Jul 26, 2024

fix streaming handling for builtin assistants #462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor assistant streaming and create OpenAI compliant base class #425

refactor assistant streaming and create OpenAI compliant base class #425

pmeier commented May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

pmeier May 27, 2024

nenb May 28, 2024

pmeier commented May 27, 2024

nenb left a comment

nenb May 28, 2024

pmeier commented May 28, 2024



		class HttpApiAssistant(Assistant):
		_API_KEY_ENV_VAR: Optional[str]

	async def answer(
	self, prompt: str, sources: list[Source], *, max_new_tokens: int = 256
	) -> AsyncIterator[str]:
	async for chunk in self._call_api(
	prompt, sources, max_new_tokens=max_new_tokens
	):
	yield chunk



		class AnthropicApiAssistant(ApiAssistant):
		class AnthropicAssistant(HttpApiAssistant):

		yield cast(str, choice["delta"]["content"])


		class OpenaiAssistant(OpenaiCompliantHttpApiAssistant):

refactor assistant streaming and create OpenAI compliant base class #425

refactor assistant streaming and create OpenAI compliant base class #425

Conversation

pmeier commented May 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pmeier commented May 27, 2024

nenb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pmeier commented May 28, 2024